使用Python SDK转录实时音频?

时间:2020-10-24 02:22:23

标签: python amazon-web-services real-time

我正在尝试使用AWS转录麦克风中的音频。我有一个Python代码,可以记录来自麦克风的音频,在这里使用了示例代码: https://github.com/awslabs/amazon-transcribe-streaming-sdk 使用Python AWS SDK转录录制的音频。我试图实时转录音频,而不是创建录音然后转录。有人可以指出我正确的方向吗?

import pyaudio
import wave
 
FORMAT = pyaudio.paInt16
CHANNELS = 1
RATE = 44100
CHUNK = 1024
RECORD_SECONDS = 5
WAVE_OUTPUT_FILENAME = "file.wav"
 
audio = pyaudio.PyAudio()
 
# start Recording
stream = audio.open(format=FORMAT, channels=CHANNELS,
                rate=RATE, input=True,
                frames_per_buffer=CHUNK)
print("recording...")
frames = []
 
for i in range(0, int(RATE / CHUNK * RECORD_SECONDS)):
    data = stream.read(CHUNK)
    frames.append(data)
print("finished recording")
 
 
# stop Recording
stream.stop_stream()
stream.close()
audio.terminate()
 
waveFile = wave.open(WAVE_OUTPUT_FILENAME, 'wb')
waveFile.setnchannels(CHANNELS)
waveFile.setsampwidth(audio.get_sample_size(FORMAT))
waveFile.setframerate(RATE)
waveFile.writeframes(b''.join(frames))
waveFile.close()

0 个答案:

没有答案