Question

我正在用C编写面向嵌入式平台的例程。
在例程中，我需要对128位值执行按位>>> from mpl_toolkits.basemap import Basemap >>> import numpy as np >>> import matplotlib.pyplot as plt >>> # read in topo data (on a regular lat/lon grid) >>> etopo = np.loadtxt('etopo20data.gz') >>> lons = np.loadtxt('etopo20lons.gz') >>> lats = np.loadtxt('etopo20lats.gz') >>> # create Basemap instance for Robinson projection. >>> m = Basemap(projection='robin',lon_0=0.5*(lons[0]+lons[-1])) >>> # compute map projection coordinates for lat/lon grid. >>> x, y = m(*np.meshgrid(lons,lats))和XOR操作。
目标架构没有SSE2，因此不支持本机128位操作。
我遇到了this答案，该答案模拟了软件中的SHIFT RIGHT操作。
我的问题是，是否有更好的方法可以做到这一点，我的意思是与使用递归相比，具有更好的数据结构来表示128位值以及模拟SHIFT和XOR操作的最佳方法（如链接中的答案所示））。我希望最大程度地减少有限堆栈内存的使用。

Answer 1

您可以使用以下结构存储128位数据

typedef struct
{
    uint32_t a;
    uint32_t b;
    uint32_t c;
    uint32_t d;
} Type_128bit;

然后您可以编写左移函数，如下所示

int leftshift(Type_128bit in, Type_128bit out, int value)
{
    int val;
    if (value >= 128)
    {
        return (-1); // error condition
    }
    else if (value < 32)
    {
        out->a = (in->a << value) | (in->b >> value);
        out->b = (in->b << value) | (in->c >> value);
        out->c = (in->c << value) | (in->d >> value);
        out->d = in->d << value;
    }
    else if (value < 64)
    {
        val = value - 32;
        out->a = (in->b << val) | (in->c >> val);
        out->b = (in->c << val) | (in->d >> val);
        out->c = (in->d << val);
        out->d = 0x00;
    }
    else if (value < 96)
    {
        val = value - 64;
        out->a = (in->c << val) | (in->d >> val);
        out->b = (in->d << val);
        out->c = 0x00;
        out->d = 0x00;
    }
    else // value < 128
    {
        val = value - 96;
        out->a = (in->d << val);
        out->b = 0x00;
        out->c = 0x00;
        out->d = 0x00;
    }
    return (0); //success
}

这将避免递归提到的解决方案，并提供更好的运行时。但是代码大小会增加，您需要仔细测试代码。

Answer 2

uint32_t *shiftL(uint32_t *val, const size_t size, const size_t nbits)  // <= 32
{
    uint32_t mask = (1 << nbits) - 1;

    mask <<= 32 - nbits;

    for(size_t cword = size; cword - 1 ; cword --)
    {
        uint32_t temp = (val[cword - 2] & mask) >> nbits
        val[cword - 1] <<= nbits;
        val |= temp;
    }
    val[0] <<= nbits;
    return val;
}

非SSE2拱上的128位值的按位运算

2 个答案: