我的代码不起作用,我不知道如何将 CSV
文件定义为整数作为 Array
。任何帮助将不胜感激。我还是个初学者。
错误出现在第 17 行“fit = bestfeatures.fit(X,Y)
”
import pandas as pd
from matplotlib import pyplot
import seaborn as sns
from sklearn.feature_selection import SelectKBest
from sklearn.feature_selection import chi2
from sklearn.datasets import load_digits
from pandas import read_csv
from pandas.plotting import scatter_matrix
filename = '/Users/rahulparmeshwar/Documents/Algo Bots/Data/Live Data/Tester.csv'
data = read_csv(filename)
bestfeatures = SelectKBest(k=all)
y = data['Vol']
X = data.drop('Open',axis=1)
fit = bestfeatures.fit(X,y)
dfscores = pd.DataFrame(fit.scores_)
dfcolumns = pd.DataFrame(X.columns)
featurescores = pd.concat([dfscores,dfcolumns],axis=1)
correlation = data.corr()
print(featurescores.nlargest(2,'Score'))
pd.set_option('display.width',100)
data.head(1)
print(data)
scatter_matrix(data)
pyplot.show()
答案 0 :(得分:0)
SelectKBest(k=all)
必须是 SelectKBest(k="all")
。 all()
是一个内置函数。 "all"
是一个字符串。