DL

时间:2018-10-22 07:57:21

标签: python pandas tensorflow keras

我收到此错误,选择了纪元后:ValueError:输入数组的样本数应与目标数组相同。找到5516个输入样本和12870个目标样本。欢迎任何建议。预先感谢

我正在使用包含大量分类变量的数据集,并且在创建了虚拟变量之后,它们的总和为95,直到我选择时期数完美无缺地运行为止,然后出现此错误的代码是什么,此错误的原因是什么,它很重要,一个,我将来可以使用,第二个,我无法继续:)

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

dataset = pd.read_csv('zrpl_data.csv')
X = dataset.iloc[:, 0:6].values
y = dataset.iloc[:, 6].values

from sklearn.preprocessing import LabelEncoder, OneHotEncoder
labelencoder_X_1 = LabelEncoder()
X[:, 0] = labelencoder_X_1.fit_transform(X[:, 0])

labelencoder_X_2 = LabelEncoder()
X[:, 1] = labelencoder_X_2.fit_transform(X[:, 1])
labelencoder_X_3 = LabelEncoder()
X[:, 2] = labelencoder_X_1.fit_transform(X[:, 2])

labelencoder_X_4 = LabelEncoder()
X[:, 3] = labelencoder_X_1.fit_transform(X[:, 3])

labelencoder_X_5 = LabelEncoder()
X[:, 4] = labelencoder_X_1.fit_transform(X[:, 4])

onehotencoder = OneHotEncoder(categorical_features = [0,1,2,3,4])
X = onehotencoder.fit_transform(X).toarray()
X = X[:, 1:]

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=0.3, 
random_state = 0)

from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)

import keras
from keras.models import Sequential
from keras.layers import Dense

classifier = Sequential()

classifier.add(Dense(output_dim=47, 
init='uniform',activation='relu',input_dim=95)) 

classifier.add(Dense(output_dim=47, init='uniform',activation='relu'))
classifier.add(Dense(output_dim=1, 
init='uniform',activation='sigmoid'))

classifier.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

classifier.fit (X_train, y_train, batch_size=10,nb_epoch=100)

1 个答案:

答案 0 :(得分:0)

在训练模型之前,您有5516个特征样本和12870个目标样本(应该相等),