Question

我的任务是设计一个满足这些要求的功能：

函数应将给定一维数组的成员求和。但是，它应该只对二进制表示形式中的个数大于定义的阈值的成员求和（例如，如果阈值为4，则将计数255，而不会计数15）
数组长度是任意的
该功能应使用尽可能少的内存，并应以高效的方式编写
生产函数代码（'sum_filtered（）{..}'）不得使用任何标准C库函数（或任何其他库）
函数成功返回0，错误返回错误代码
数组元素是16位有符号整数类型，计算期间的溢出应视为失败
使用确保不同CPU之间具有可移植性的数据类型（因此在8/16/32位MCU上的计算将是相同的）
功能代码中的doxygen注释中应包含合理数量的注释

这是我的解决方法：

#include <iostream>
using namespace std;

int sum_filtered(short array[], int treshold)
{
    // return 1 if invalid input parameters
    if((treshold < 0) || (treshold > 16)){return(1);}

    int sum = 0;
    int bitcnt = 0;
    for(int i=0; i < sizeof(array); i++)
    {
        // Count one bits of integer
        bitcnt = 0;
        for (int pos = 0 ; pos < 16 ; pos++) {if (array[i] & (1 << pos)) {bitcnt++;}}

        // Add integer to sum if bitcnt>treshold
        if(bitcnt>treshold){sum += array[i];}
    }
    return(0);
}

int main()
{
 short array[5] = {15, 2652, 14, 1562, -115324};
 int result = sum_filtered(array, 14);
 cout << result << endl;

 short array2[5] = {15, 2652, 14, 1562, 15324};
 result = sum_filtered(array2, -2);
 cout << result << endl;
}

但是我不确定该代码是否可以在不同的CPU之间移植。

我不知道在计算过程中怎么会发生溢出，以及使用此功能处理数组时还会发生其他错误。

更有经验的人能给我他的意见吗？

Answer 1

好吧，我可以预见一个问题：

for(int i=0; i < sizeof(array); i++)

在这种情况下，

array是一个指针，因此在32位系统上可能是4，在64位系统上可能是8。您确实确实希望将一个计数变量（在本例中为5）传递给sum_filtered函数（然后您可以将计数作为sizeof（array）/ sizeof（short）传递）。

无论如何，这段代码：

    // Count one bits of integer
    bitcnt = 0;
    for (int pos = 0 ; pos < 16 ; pos++) {if (array[i] & (1 << pos)) {bitcnt++;}}

实际上，您正在此处执行一个弹出计数（）（可以在gcc / clang上使用__builtin_popcount或在MSVC上使用__popcnt来完成。它们是特定于编译器的，但通常归结为大多数CPU上的单个popcount CPU指令）< / em>。

如果您确实想以较慢的方式执行此操作，那么一种有效的方法是将计算视为按位SIMD操作的形式：

#include <cstdint> // or stdint.h if you have a rubbish compiler :) uint16_t popcount(uint16_t s) { // perform 8x 1bit adds uint16_t a0 = s & 0x5555; uint16_t b0 = (s >> 1) & 0x5555; uint16_t s0 = a0 + b0; // perform 4x 2bit adds uint16_t a1 = s0 & 0x3333; uint16_t b1 = (s0 >> 2) & 0x3333; uint16_t s1 = a1 + b1; // perform 2x 4bit adds uint16_t a2 = s1 & 0x0F0F; uint16_t b2 = (s1 >> 4) & 0x0F0F; uint16_t s2 = a2 + b2; // perform 1x 8bit adds uint16_t a3 = s2 & 0x00FF; uint16_t b3 = (s2 >> 8) & 0x00FF; return a3 + b3; }

我知道它说您不能使用stdlib函数（第4点），但这肯定不应该应用于标准整数类型吗？（例如uint16_t），如果没有，那么就无法保证跨平台的可移植性。你真倒霉

我个人只是使用64位整数作为总和。该应该减少任何溢出的风险*（即，如果阈值为零，并且所有值均为-128，那么如果数组大小超过0x1FFFFFFFFFFFF元素，则您将溢出[562,949,953,421,311十进制）。

#include <cstdint> int64_t sum_filtered(int16_t array[], uint16_t threshold, size_t array_length) { // changing the type on threshold to be unsigned means we don't need to test // for negative numbers. if(threshold > 16) { return 1; } int64_t sum = 0; for(size_t i=0; i < array_length; i++) { if (popcount(array[i]) > threshold) { sum += array[i]; } } return sum; }

在不同CPU上处理整数

1 个答案: