Question

我试图在c＃中将WAV文件读入缓冲区数组，但是遇到了一些问题。我正在使用文件流来管理音频文件。这就是我的......

FileStream WAVFile = new FileStream(@"test.wav", FileMode.Open);

        //Buffer for the wave file...
        BinaryReader WAVreader = new BinaryReader(WAVFile);

        //Read information from the header.
        chunkID = WAVreader.ReadInt32();
        chunkSize = WAVreader.ReadInt32();
        RiffFormat = WAVreader.ReadInt32();

        ...

        channels = WAVreader.ReadInt16();
        samplerate = WAVreader.ReadInt32();
        byteRate = WAVreader.ReadInt32();
        blockAllign = WAVreader.ReadInt16();
        bitsPerSample = WAVreader.ReadInt16();

        dataID = WAVreader.ReadInt32();
        dataSize = WAVreader.ReadInt32();

以上是从WAV文件头读取数据。然后我有这个：

musicalData = WAVreader.ReadBytes(dataSize);

...读取实际的样本数据，但这对于60秒的音频只有26个字节。它是否正确？

我如何将byte []数组转换为double []？

Answer 1

自从触及WAVE文件处理以来，这已经过了10到15年，但不像第一印象大多数人认为波形文件是简单的固定大小标头，后跟PCM编码的音频数据，WAVE文件有点复杂RIFF格式文件。

我建议使用interop并调用处理RIFF文件格式的API，而不是重新设计RIFF文件处理和各种情况。

您可以在this example中看到如何打开和获取数据缓冲区（以及有关缓冲区的元信息）的示例。它使用的是C ++，但它显示了使用mmioOpen，mmioRead，mmioDescend，mmioAscend API，您需要使用这些API来获取正确的音频缓冲区。< / p>

Answer 2

这段代码可以解决问题。它将波形文件转换为规范化的双数组（-1到1），但是要使它成为一个int / short数组（删除/32768.0位并添加32768）应该是微不足道的。如果发现加载的wav文件是单声道，则右[]数组将设置为null。

我无法声称它完全是防弹（潜在的一个一个错误），但在创建65536样本数组并创建一个从-1到1的波后，没有一个样本似乎是通过＆＃39;天花板或地板。

// convert two bytes to one double in the range -1 to 1
static double bytesToDouble(byte firstByte, byte secondByte)
{
    // convert two bytes to one short (little endian)
    short s = (secondByte << 8) | firstByte;

    // convert to range from -1 to (just below) 1
    return s / 32768.0;
}

// Returns left and right double arrays. 'right' will be null if sound is mono.
public void openWav(string filename, out double[] left, out double[] right)
{
    byte[] wav = File.ReadAllBytes(filename);

    // Determine if mono or stereo
    int channels = wav[22];     // Forget byte 23 as 99.999% of WAVs are 1 or 2 channels

    // Get past all the other sub chunks to get to the data subchunk:
    int pos = 12;   // First Subchunk ID from 12 to 16

    // Keep iterating until we find the data chunk (i.e. 64 61 74 61 ...... (i.e. 100 97 116 97 in decimal))
    while(!(wav[pos]==100 && wav[pos+1]==97 && wav[pos+2]==116 && wav[pos+3]==97))
    {
        pos += 4;
        int chunkSize = wav[pos] + wav[pos + 1] * 256 + wav[pos + 2] * 65536 + wav[pos + 3] * 16777216;
        pos += 4 + chunkSize;
    }

    pos += 8;

    // Pos is now positioned to start of actual sound data.
    int samples = (wav.Length - pos)/2;     // 2 bytes per sample (16 bit sound mono)

    if (channels == 2)
    {
        samples /= 2;        // 4 bytes per sample (16 bit stereo)
    }

    // Allocate memory (right will be null if only mono sound)
    left = new double[samples];

    if (channels == 2)
    {
        right = new double[samples];
    }
    else
    {
        right = null;
    }

    // Write to double array/s:
    int i=0;
    while (pos < length)
    {
        left[i] = bytesToDouble(wav[pos], wav[pos + 1]);
        pos += 2;

        if (channels == 2)
        {
            right[i] = bytesToDouble(wav[pos], wav[pos + 1]);
            pos += 2;
        }

        i++;
    }
}

如果您想使用插件，假设您的WAV文件包含16位PCM（这是最常见的），您可以使用NAudio将其读取到字节数组中，然后将其复制到为方便起见，16位整数数组。如果是立体声，则样本将左右交错。

using (WaveFileReader reader = new WaveFileReader("myfile.wav"))
{
    Assert.AreEqual(16, reader.WaveFormat.BitsPerSample, "Only works with 16 bit audio");
    byte[] buffer = new byte[reader.Length];
    int read = reader.Read(buffer, 0, buffer.Length);
    short[] sampleBuffer = new short[read / 2];
    Buffer.BlockCopy(buffer, 0, sampleBuffer, 0, read);
}

我个人尽量避免使用第三方库。但是如果您希望代码看起来更好更容易处理，那么选项仍然存在。

如何在C＃中将音频文件读入数组

2 个答案: