Question

我正在尝试编写一个程序，其中从我的计算机麦克风中读取音频，以某种方式改变（现在只是为了测试它），然后通过扬声器播放。实际上，它工作正常，但是在通过麦克风输入音频和听到声音之间有一个非常明显的延迟，我正试图找到一种方法来减少这种情况。我知道延迟被完全删除几乎是不可能的，但我正在寻找一种方法，至少使它几乎听不见。

代码如下：

package com.funguscow;

import javax.sound.sampled.AudioFormat;
import javax.sound.sampled.AudioSystem;
import javax.sound.sampled.DataLine;
import javax.sound.sampled.SourceDataLine;
import javax.sound.sampled.TargetDataLine;

public class Listen {

    public static void main(String[] args){
        AudioFormat format = new AudioFormat(44100, 16, 2, true, true); //get the format for audio

        DataLine.Info targetInfo = new DataLine.Info(TargetDataLine.class, format); //input line
        DataLine.Info sourceInfo = new DataLine.Info(SourceDataLine.class, format); //output line

        try {
            TargetDataLine targetLine = (TargetDataLine) AudioSystem.getLine(targetInfo);
            targetLine.open(format);
            targetLine.start();

            SourceDataLine sourceLine = (SourceDataLine) AudioSystem.getLine(sourceInfo);
            sourceLine.open(format);
            sourceLine.start();

            int numBytesRead;
            byte[] targetData = new byte[sourceLine.getBufferSize()];

            while (true) {
                numBytesRead = targetLine.read(targetData, 0, targetData.length); //read into the buffer

                if (numBytesRead == -1) break;

                for(int i=0; i<numBytesRead/2; i++){ //apply hard distortion/clipping
                    int j = (((targetData[i * 2]) << 8) & 0xff00) | ((targetData[i * 2 + 1]) & 0xff);
                    j *= 2;
                    if(j > 65535) j = 65535;
                    if(j < 0) j = -0;
                    targetData[i * 2] = (byte)((j & 0xff00) >> 8);
                    targetData[i * 2 + 1] = (byte)(j & 0x00ff);
                }

                sourceLine.write(targetData, 0, numBytesRead); //play
            }
        }
        catch (Exception e) {
            System.err.println(e);
        }
    }

}

因为它有一个似乎大约1秒的延迟，是否可以解决这个问题？

Answer 1

从sourceLine.getBufferSize()返回的缓冲区有多大？如果您以每秒4400个采样/秒以2字节/样本从麦克风中读取，则需要1秒才能填充88,200字节缓冲区。我猜想系统确定的缓冲区大约是那个大小。尝试在byte[] targetData = new byte[sourceLine.getBufferSize()];行使用较小的缓冲区我建议保持足够小，以便音频延迟为10毫秒或更短（882字节或更小），以获得最佳用户体验。

Answer 2

我会用Buffer大小声明一个final int。延迟10毫秒是每帧的字节数乘以每秒帧数除以100.如果立体声16位编码（CD质量）为44100fps，则为（4 * 44100）/ 100 = 1764字节

然后，使用该缓冲区大小打开TargetDataLine和SourceDataLine：

webstart-maven-plugin

检查这些行是否实际使用了您指定的大小，并使用缓冲区数组的新字节声明中的验证值。

您还可以在读取和写入中使用常量。

您可能需要使用该值来使其达到最佳状态。因此，仅将其定义一次是有意义的，因此您无需进行多次编辑。该值必须对应于读或写中整数帧所需的字节数。太高的延迟，太低会增加辍学的可能性。

10毫秒表现相当不错，特别是如果你没有处理过度敲击声。

在Java中播放麦克风音频时减少延迟

2 个答案: