Python Scikit Learn错误:ValueError:'找到样本数不一致的输入变量:[4,10]'

时间:2017-11-01 02:10:21

标签: python scikit-learn

我正在尝试运行一个使用scikit-learn的python程序来对“胜利”进行二元分类。或者'损失'基于2个阵列的曲棍球比赛:每个参加曲棍球比赛的球队的1个统计数据(如currentInjuries和Goals Against)。我习惯于java调试,但在Python中,堆栈跟踪比我以前用的要长很多。

我应该查看整个堆栈跟踪,还是只关注最相关的错误并专注于解决这个问题?即最底层的错误,即:

File "C:\Users\Thomas\Anaconda3\lib\site-packages\sklearn\utils\validation.py", line 181, in check_consistent_length
    " samples: %r" % [int(l) for l in lengths])
ValueError: Found input variables with inconsistent numbers of samples: [4, 10]

这是整个堆栈跟踪:

pydev debugger: process 1084 is connecting

C:\Users\Thomas\Anaconda3\lib\site-packages\sklearn\cross_validation.py:44: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20.
  "This module will be removed in 0.20.", DeprecationWarning)
C:\Users\Thomas\Anaconda3\lib\site-packages\sklearn\lda.py:6: DeprecationWarning: lda.LDA has been moved to discriminant_analysis.LinearDiscriminantAnalysis in 0.17 and will be removed in 0.19
  "in 0.17 and will be removed in 0.19", DeprecationWarning)
canucks_stats: [3, 11, 2, 2]
opposing_stats: [1, 4, 4, 15]
Backend Qt5Agg is interactive backend. Turning interactive mode on.
Traceback (most recent call last):
  File "C:\Program Files (x86)\JetBrains\PyCharm Community Edition 2016.3.2\helpers\pydev\pydevd.py", line 1596, in <module>
    globals = debugger.run(setup['file'], None, None, is_module)
  File "C:\Program Files (x86)\JetBrains\PyCharm Community Edition 2016.3.2\helpers\pydev\pydevd.py", line 974, in run
    pydev_imports.execfile(file, globals, locals)  # execute the script
  File "C:\Program Files (x86)\JetBrains\PyCharm Community Edition 2016.3.2\helpers\pydev\_pydev_imps\_pydev_execfile.py", line 18, in execfile
    exec(compile(contents+"\n", file, 'exec'), glob, loc)
  File "C:/Users/Thomas/Desktop/!UFV/CIS480/project/NHL-Predictor.py", line 196, in <module>
    for name, (X, y) in [('moon', toms_make_moons(noise=0.3, random_state=0))#,
  File "C:/Users/Thomas/Desktop/!UFV/CIS480/project/NHL-Predictor.py", line 113, in toms_make_moons
    X, y = util_shuffle(X, y, random_state=generator)
  File "C:\Users\Thomas\Anaconda3\lib\site-packages\sklearn\utils\__init__.py", line 285, in shuffle
    return resample(*arrays, **options)
  File "C:\Users\Thomas\Anaconda3\lib\site-packages\sklearn\utils\__init__.py", line 204, in resample
    check_consistent_length(*arrays)
  File "C:\Users\Thomas\Anaconda3\lib\site-packages\sklearn\utils\validation.py", line 181, in check_consistent_length
    " samples: %r" % [int(l) for l in lengths])
ValueError: Found input variables with inconsistent numbers of samples: [4, 10]

Process finished with exit code 1

1 个答案:

答案 0 :(得分:-2)

最底部的错误可以显示错误是什么。整个堆栈跟踪可帮助您找到错误所在的位置。

当您发布问题时,请始终发布完整的堆栈跟踪(以了解您收到错误的行)。请参阅how to ask

关于错误ValueError: Found input variables with inconsistent numbers of samples: [4, 10]

x应该是大小为[number_of_samples,number_of_features]的二维数组。您可以查看它。

并且有一些参考文献:

ref1:ValueError: Found arrays with inconsistent numbers of samples [ 6 1786]

REF2:Loading jpg of different sizes into numpy.array - ValueError: Found input variables with inconsistent numbers of samples

ref3:Number of features of the model must match the input

ref4:Found input variables with inconsistent numbers of samples