Question

使用VS 2012，.NET 4.5,64bit和CUDAfy 1.12，我有以下概念证明

using System;
using System.Runtime.InteropServices;
using Cudafy;
using Cudafy.Host;
using Cudafy.Translator;

namespace Test
{
[Cudafy(eCudafyType.Struct)]
[StructLayout(LayoutKind.Sequential)]
public struct ChildStruct
{
    [MarshalAs(UnmanagedType.LPArray)]
    public float[] FArray;
    public long FArrayLength;
}

[Cudafy(eCudafyType.Struct)]
[StructLayout(LayoutKind.Sequential)]
public struct ParentStruct
{
    public ChildStruct Child;
}

public class Program
{
    [Cudafy]
    public static void KernelFunction(GThread gThread, ParentStruct parent)
    {
        long length = parent.Child.FArrayLength;
    }

    public static void Main(string[] args)
    {
        var module = CudafyTranslator.Cudafy(
          ePlatform.x64, eArchitecture.sm_35,
          new[] {typeof(ChildStruct), typeof(ParentStruct), typeof(Program)});
        var dev = CudafyHost.GetDevice();
        dev.LoadModule(module);

        float[] hostFloat = new float[10];
        for (int i = 0; i < hostFloat.Length; i++) { hostFloat[i] = i; }

        ParentStruct parent = new ParentStruct
        {
            Child = new ChildStruct
            {
                FArray = dev.Allocate(hostFloat),
                FArrayLength = hostFloat.Length
            }
        };

        dev.Launch(1, 1, KernelFunction, parent);

        Console.ReadLine();
    }
}
}

当程序运行时，我在dev.Launch上收到以下错误：

Type 'Test.ParentStruct' cannot be marshaled as an unmanaged structure; no meaningful size or offset can be computed.

如果从ChildStruct中删除float数组，它将按预期工作。

过去曾在C / C ++ / Cli和CUDA C工作，我知道错误的本质。此错误的某些解决方案建议使用Size的{{1}}参数手动设置结构大小，但由于结构中的各种类型，这是不可能的。

我查看了生成的.cu文件，它正在生成一个MarshalAs的浮点数组，这正是我所期望的。

有没有办法将结构中的数组传递给内核？如果没有最好的第二种选择？这个问题在CUDA C中不存在而且只存在，因为我们正在从CLR编组。

Answer 1

我花了很多时间阅读CUDAfy的源代码，看看是否有解决这个问题的方法。

CUDAfy正试图让.NET开发人员过于简单，并使他们远离IntPtr和其他指针概念。但是，如果没有对此库的工作方式进行重大重构，抽象级别就很难想到这个问题的答案。

无法在结构中发送浮点数组是一个显示停止。我最终在CUDA运行时进行PInvoke而不使用CUDAfy。

Answer 2

这是.NET的限制，而不是CUDAfy。数据必须是blittable，而非固定大小的数组则不是。这是有效的，基于对codeplex的CUDAfy单元测试：

[Cudafy]
[StructLayout(LayoutKind.Sequential, Size=64, CharSet = CharSet.Unicode)]
public unsafe struct PrimitiveStruct
{
    public fixed sbyte Message[32];
    public fixed char MessageChars[16];
}

由于您可以在设备代码中使用Length属性，因此也没有理由明确存储数组长度。

在CUDAfy中传递结构中的数组

2 个答案: