我想将一些flac声音文件输入keras模型。使用wavfiles我可以做(设计示例,一次使用一个音频文件)
import scipy.io.wavfile
import numpy as np
import keras
from keras.models import Sequential
from keras.layers import Dense, Dropout, Activation
from keras.optimizers import SGD
path = 'path/to/file.wav'
_, audio = scipy.io.wavfile.read(path)
dataset = [audio, audio]
x_train = np.array(dataset)
y_train = keras.utils.to_categorical([0, 1], num_classes=2)
model = Sequential()
model.add(Dense(32, activation='relu', input_shape=x_train[0].shape))
model.add(Dense(2, activation='softmax'))
model.compile(optimizer='rmsprop', loss='categorical_crossentropy', metrics=['accuracy'])
model.fit(x_train, y_train, epochs=10, batch_size=32)
如何使用flac文件执行此操作?
答案 0 :(得分:0)
soundfile包可以以numpy数组兼容格式加载flac文件
import numpy as np
import soundfile as sf
import keras
from keras.models import Sequential
from keras.layers import Dense, Dropout, Activation
from keras.optimizers import SGD
path = 'path/to/file.flac'
data, samplerate = sf.read(path)
dataset = [data, data]
x_train = np.array(dataset)
y_train = keras.utils.to_categorical([0, 1], num_classes=2)
model = Sequential()
model.add(Dense(32, activation='relu', input_shape=x_train[0].shape))
model.add(Dense(2, activation='softmax'))
model.compile(optimizer='rmsprop', loss='categorical_crossentropy', metrics=['accuracy'])
model.fit(x_train, y_train, epochs=10, batch_size=32)
forkable sscce https://www.kaggle.com/morenoh149/flac-keras-hello-world