Question

我是一个初学者，我正在将音频文件转换为mfccs，我已经为一个文件完成了操作，但是不知道如何遍历所有数据集。我在Training文件夹中有多个文件夹，其中一个是001（0），从中转换了一个wav文件。我要转换Training文件夹中存在的所有文件夹的wav文件

import os
import numpy as np
import matplotlib.pyplot as plt
from glob import glob
import scipy.io.wavfile as wav
from python_speech_features import mfcc, logfbank

# Read the input audio file
(rate,sig) = wav.read('Downloads/DataVoices/Training/001(0)/001000.wav')


# Take the first 10,000 samples for analysis
sig = sig[:10000]
features_mfcc = mfcc(sig,rate)

# Print the parameters for MFCC
print('\nMFCC:\nNumber of windows =', features_mfcc.shape[0])
print('Length of each feature =', features_mfcc.shape[1])

# Plot the features
features_mfcc = features_mfcc.T
plt.matshow(features_mfcc)
plt.title('MFCC')

# Extract the Filter Bank features
features_fb = logfbank(sig, rate)

# Print the parameters for Filter Bank 
print('\nFilter bank:\nNumber of windows =', features_fb.shape[0])
print('Length of each feature =', features_fb.shape[1])

# Plot the features
features_fb = features_fb.T
plt.matshow(features_fb)
plt.title('Filter bank')

plt.show()

Answer 1

您可以对通配符使用glob来查找所有的wav文件。

for f in glob.glob(r'Downloads/DataVoices/Training/**/*.wav', recursive=True):
     (rate,sig) = wav.read(f)
     # Rest of your code

转换为MFCCC时如何遍历音频文件

1 个答案: