将较小结构的数组移动到C中较大结构的数组中

时间:2014-10-14 14:50:13

标签: c data-structures struct casting padding

今天我正在研究将一个较小的结构数组直接移动到一个较大结构数组(arrayNew)的问题(基本上是升级较小的结构以存储更多信息)。需要从一个读取操作中的HDD读取较小的结构到新的“升级的”较大结构的数组中,将调用一个函数来执行“升级”。此外,从硬盘驱动器读取的结构中的所有新字段都将设置为'0'。 我尝试过的其他更简单的解决方案是:

  • 创建旧结构的本地数组(arrayOld),将结构从HDD加载到其中然后只需循环遍历新结构的空数组(arrayNew)并手动将每个结构内容从arrayOld移动到arrayNew。 (例如arrayNew[i].x = arrayOld[i].x;) 这个问题是,在我的情况下,我使用的数组非常大而且对于堆栈来说太大(每个数组大约1mb),在调用升级函数的瞬间导致分段错误。

  • 另一个可行的解决方案是创建旧结构的动态数组(arrayDy)并将旧结构加载到arrayDy中,然后再次手动移动arrayDy中的每个结构内容进入arrayNew。 (例如arrayNew[i].y = arrayDy[i].y;)这解决了堆栈内存不足的问题。

实施第二个解决方案后。我决定尝试并开发一种不使用动态分配内存的解决方案,并在一次读取操作中将HHD中的旧结构数组直接加载到更大的结构arrayNew数组中,并操纵{{1}的内容在内存中填充由于数组更大而丢失的值。

我将在下面的解决版本中发布我的解决方案,使用以下结构作为我的示例:

arrayNew

2 个答案:

答案 0 :(得分:1)

是的,这是可能的 - 您可以使用union。 C99标准作出特殊保证,可用于实现您的要求:

  6.5.2.3-5:为了简化联合的使用,我们做了一个特殊的保证:如果一个联合包含几个共享一个共同初始序列的结构(见下文),并且如果联合对象当前包含其中一个结构,允许检查其中任何一个的共同初始部分,即可以看到完整类型的联合声明。

您的structA_structB_会共享一个共同的初始序列,因此创建union并通过它访问结构将会起到作用:

union {
    structA a;
    structB b;
} u;
memset(&u.b, 0, sizeof(structB)); // Zero out the bigger structB
loadFromHdd(&u.a); // Load structA part into the union
// At this point, u.b is valid, with its structA portion filled in
// and structB part zeroed out.

请注意,您无法对数组执行此操作(当然,除非您创建union s的数组)。每个structA都需要单独加载到union中,然后可以将其作为structB读取。

答案 1 :(得分:1)

我提出并用作解决方案的方法基本上将HDD的较小结构(在这种情况下为文件)加载到新的较大结构的数组中,然后重新排列内存块,以便可以正确访问每个字段。用于说明这一点的代码如下,并且是mcve

#include <stdio.h>
#include <string.h>

typedef struct INNER_STRUCT_ {

    int i_item1;
    int i_item2;
    char i_item3;

} INNER_STRUCT;

typedef struct SMALL_STRUCT_ {

    int item1;
    char item2;
    INNER_STRUCT item3;

} SMALL_STRUCT;

typedef struct BIG_STRUCT_ {

    int item1;
    char item2;
    INNER_STRUCT item3;
    INNER_STRUCT item4;
    /* 
    Note that the big struct is exactly the same as the small 
    struct with one extra field - Key to this method working 
    is the fact that the extension to the struct is appended
    at the end, in an array of the structs will be placed one 
    after the other in memory with no gaps*/

} BIG_STRUCT;

void printSmallStruct (SMALL_STRUCT *inStruct, int count) {
    // Print everything inside given small struct
    printf("\n\n Small struct %d, item1: %d \n",count,inStruct->item1);
    printf(" Small struct %d, item2: %c \n",count,inStruct->item2);
    printf(" Small struct %d, item3.i_item1: %d \n",count,inStruct->item3.i_item1);
    printf(" Small struct %d, item3.i_item2: %d \n",count,inStruct->item3.i_item2);
    printf(" Small struct %d, item3.i_item3: %c \n",count,inStruct->item3.i_item3);
}

void printBigStruct (BIG_STRUCT *inStruct, int count) {
    // Print everything inside given big struct
    printf("\n\n Big struct %d, item1: %d \n",count,inStruct->item1);
    printf(" Big struct %d, item2: %c \n",count,inStruct->item2);
    printf(" Big struct %d, item3.i_item1: %d \n",count,inStruct->item3.i_item1);
    printf(" Big struct %d, item3.i_item2: %d \n",count,inStruct->item3.i_item2);
    printf(" Big struct %d, item3.i_item3: %c \n",count,inStruct->item3.i_item3);
    printf(" Big struct %d, item4.i_item1: %d \n",count,inStruct->item4.i_item1);
    printf(" Big struct %d, item4.i_item1: %d \n",count,inStruct->item4.i_item2);
    printf(" Big struct %d, item4.i_item1: %c \n",count,inStruct->item4.i_item3);
}

int main() {


    SMALL_STRUCT smallStructArray[5];       // The array of small structs that we will write to a file then read

    BIG_STRUCT   loadedBigStructArray[5];   // The large array of structs that we will read the data from the file into

    int i;  // Counter that we will use

    FILE *pfile;    // pointer to our file stream

    void *secondary_ptr;    // void pointer that we will use to 'chop' memory into the size we want

    /* Fill the array of structs (smallStructArray) */
    for (i = 0; i < 5; i++) {
    /* We fill each field with different data do we can ID that the right data is in the right fields */
        smallStructArray[i].item1 = 111;
        smallStructArray[i].item2 = 'S';
        INNER_STRUCT*    temp = &smallStructArray[i].item3;
        temp->i_item1 = 777;
        temp->i_item2 = 999;
        temp->i_item3 = 'I';
    }


    /* Write the contents of smallStructArray to binary file then display it */
    pfile = fopen("test.dat","wb");
    if (pfile!=NULL){
    for (i = 0; i < 5; i++) {
        fwrite(&smallStructArray[i],sizeof(SMALL_STRUCT),1,pfile);
    }
    fclose(pfile);
    }
    else{
    printf("Unable to open file!");
    return 1;
    }

    for (i = 0; i < 5; i++) {
         printSmallStruct(&smallStructArray[i],i);
    }

    /* Clear array of big structs using memset  */
    memset(&loadedBigStructArray[0],0,sizeof(loadedBigStructArray));

    /* Here we read from the smallStructArray that was aved to file into the  loadedBigStructArray */
    pfile = fopen("test.dat","rb");
    if (pfile !=NULL){
    /*
    He we pass fread the following:     size_t fread(void *args1, size_t args2, size_t args3, FILE *args4)
    args1   - a pointer to the beginning of a block of memory, in our case the beginning of the 
          array loadedBigStructArray.

    args2   - the size of the ammout of bytes we wish to read, in our case the size of a SMALL_STRUCT, 
          the size one of the elments in the array saved to the file.

    args3   - the ammount of elements to read, in our case five (which is the number of elements the 
          array saved to the file has. 

    args4   - a pointer to a FILE that specifies our input stream.

    Essentially what fread will do here is read a block of bytes the size of the array we saved to 
    the file (smallStructArray) into the array in memory loadedBigStructArray from the 
    beggining of loadedBigStructArray. Fig 1 illustrates what this will look like in memory.
    */
    fread(&loadedBigStructArray,sizeof(SMALL_STRUCT),5,pfile);
    fclose(pfile);
    }
    else{
    printf("Unable to open file!");
    return 1;
    }
    /* 
    Due to the way the array on the file has been read into the array in memory, if we try 
    to access the data in loadedBigStructArray only the first 5 values will be valid, due to 
    the  memory not being in the order we want. We need to re-arrange the data in loadedBigStructArray
    */

    /* 
    Here we use a void pointer to point to  the beggining of the loadedBigStructArray.
    we will use this pointer to 'chop' the data loadedBigStructArray into SMALL_STRUCT 
    sized 'chunks' we can read from.

    Due to the way pointers and arrays work in C we can cast the void pointer to any type we want
    and get a chunk of memory that size begginnig from the pointer and its off set.
    E.g. : int temp = ((int *)void_ptr)[i];  
    This example above will give us an integer 'temp' that was taken from memory beggining from position
    void_ptr in memory and its offset i. ((int *)void_ptr) casts the pointer to type int and [i] dereferances
    the pointer to location i.
    */
    secondary_ptr = &loadedBigStructArray;

    /* 
    Not we are going through the array backwards so that we can rearange the data with out overwriting 
    data in a location that has data which we havent moved yet. As the bottom end of the loadedBigStructArray
    is essentially empty we can shift data down that way.
    */
    for (i = 5; i > -1; i=i-1) {


    SMALL_STRUCT temp = ((SMALL_STRUCT *)secondary_ptr)[i]; // dereference pointer to SMALL_STRUCT [i] inside loadedBigStructArray call it 'temp'

    /*
    Now that we have dereferenced a pointer a given SMALL_STRUCT inside loadedBigStructArray called 'temp'
    we can use temp to move the data inside temp to its corresponding position in loadedBigStructArray 
    which rearragnes the data.
    */
    loadedBigStructArray[i].item1 = temp.item1;
    loadedBigStructArray[i].item2 = temp.item2;
    loadedBigStructArray[i].item3.i_item1 = temp.item3.i_item1;
    loadedBigStructArray[i].item3.i_item2 = temp.item3.i_item2;
    loadedBigStructArray[i].item3.i_item3 = temp.item3.i_item3;

    /* We then fill the new field to be blank */
    loadedBigStructArray[i].item4.i_item1 = 0;
    loadedBigStructArray[i].item4.i_item2 = 0;
    loadedBigStructArray[i].item4.i_item3 = '0';
    }

    /* Print our new structures */
    for (i = 0; i < 5; i++) {
         printBigStruct(&loadedBigStructArray[i],i);
    }

    return 0;
}

技术可视化:

Visualization of technique

当fread将保存在磁盘上的数组的单个读取操作放入内存中的数组时由于它变小,它将占用内存中第一个数组的部分,但是底部是“#”; section可以是任何东西,如果我们尝试使用我们对数据的当前句柄访问新数组中的数据,我们将获得不准确的信息或坏的内存。在我们可以在数组中的结构上使用任何句柄之前,我们必须重新排列这些数据。