Question

我正在尝试将以下C代码转换为汇编：

void write (int bitpos, unsigned short sample)
{
    int pos = bitpos / 16;
    int posA = bitpos - pos * 16;
    unsigned short write1 = sample >> posA;
}

我一直在换班操作中遇到错误。我看了一本书中的一些例子，但我不明白什么是错的。我想这可能是因为我想要转移的数量是一个变量。我想知道实现这个目的的正确方法是什么？

以下是我的尝试：

//int pos = bitpos / 16;
mov eax, 0
mov eax, [bitpos] // eax= bitpos
cdq
mov ecx, 16         
idiv ecx        //ecx = pos

//int posA = bitpos - pos * 16;
mov ebx, ecx    //ebx = pos
imul ebx, 16    // ebx = pos*16
sub eax, ebx    // eax = posA

//unsigned short write1 = sample >> posA;
mov bx, [sample]
shr bx, eax // This is the part that is not working.

错误说明：错误的操作数类型。错误代码：C2415

Answer 1

您的write()函数没有返回值且没有副作用（没有写入任何全局变量，没有系统调用，只设置一些在函数返回时被丢弃的局部变量）。您可以而且应该将其优化为空函数just like gcc does。

global write
write:
    ret

让我们假装你的函数返回write1变量，所以你必须计算它。

gcc -Og（针对调试进行优化）使得可读的asm始终不会从内存中存储/重新加载。 gcc -m32 -Og -fverbose-asm -masm=intel emits：

# see the godbolt link for colour-coded mapping of source lines to asm lines
write(int, unsigned short):
    mov edx, DWORD PTR [esp+4]  # bitpos, bitpos
    lea eax, [edx+15]   # tmp98,
    test    edx, edx    # bitpos
    cmovns  eax, edx    # tmp98,, bitpos, bitpos
    sar eax, 4  # tmp99,
    neg eax # tmp101
    sal eax, 4  # tmp102,
    mov ecx, eax    # tmp102, tmp102
    add ecx, edx    # posA, bitpos
    movzx   eax, WORD PTR [esp+8]   # D.2591, sample
    sar eax, cl # D.2591, posA
    ret

注意它是如何从堆栈加载函数参数的，因为它们是函数参数，而不是全局变量。（您的代码引用[bitpos]，一个全局的，而不是返回地址后的第一个位置[esp+4]。）64位ABI在寄存器中传递args，因此您可以获得更清晰的代码。

条件移动代码在那里，因为负数的整数除法的C语义给出了算术右移的不同结果（它们以不同的方式舍入）。由于idiv与班次相比非常昂贵，因此仍需要使用额外的指令来设置班次。如果bitpos未签名，则可以使用shr。

通过全面优化，gcc找到了一种更有效的方法，并将一些算法叠加在一起。（即除以16，然后乘以16，舍入到最接近的16的倍数，用单个and实现，以掩盖这些位。）

故事的道德：您可以随时查看编译器输出，以获取有关如何做某事的灵感，并且经常会看到您最初没有想到的技巧。

装配翻译

1 个答案: