AttributeError:“ DataFrame”对象没有属性“ data”

时间:2019-12-02 01:31:54

标签: python pandas scikit-learn

尝试运行以下代码时,我始终收到错误*AttributeError: 'DataFrame' object has no attribute 'data'*。尝试使用data_sets.head(进行测试,并收到错误* AttributeError: 'dict' object has no attribute 'head'*

dataDir = '/content/drive/My Drive/Colab Notebooks/Final/dataQ2/' # Directory with input files
trainFile = 'q2train.csv' # Training examples
labelFile = 'q2label.csv' # Test label
validFile = 'q2valid.csv' # Valid Files

data_sets = {
      'train' : train,
      'label' : label,
      'valid' : valid}

def get_data(data_set_name, test_prop=0.2, seed=2019):
  """returns data for training, testing, and data characteristics"""
  data = data_sets[data_set_name]
  X, y = data.data, data.target
  X_train, X_test, y_train, y_test = train_test_split(X, y, 
                                                      test_size=test_prop, 
                                                      random_state=seed)
  nF = X.shape[1] # number of features
  nC = len(np.unique(y)) # number of classes
  nTrain, nTest = len(y_train), len(y_test)
  return X_train, X_test, y_train, y_test, nF, nC, nTrain, nTest

for name in data_set:
  X_train, X_test, y_train, y_test, nF, nC, nTrain, nTest = get_data(name)

我们非常感谢您的帮助,如果这是一个愚蠢的问题,请先抱歉。

1 个答案:

答案 0 :(得分:0)

我遇到了同样的问题。 我通过将我的 Pandas 从 1.0.1 更新到 1.0.5 解决了这个错误