import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from scipy import stats
dataFileName='RFInput.xlsx'
sheetName='Rawdata'
sheetNamePara='paraList'
dataRaw=pd.read_excel(dataFileName, sheetname = sheetName)
datapara=pd.read_excel(dataFileName, sheetname = sheetNamePara)
noData=len(dataRaw)
import matplotlib.pylab as plt
from sklearn.cross_validation import train_test_split
from sklearn.cross_validation import cross_val_score
from sklearn.preprocessing import StandardScaler
labels = datapara
x = dataRaw[labels]
y = dataRaw['classVariable']
在RFInput.xlsx中,sheetname =“paraList”,我有一些参数列表,我需要从Rawdata中提取数据。在paraList中,第一行是变量的名称,在第二行中,我将每个变量的类别标记为Y或N.我想将Y类变量数据读入x_Y,将N类变量数据读入x_N。 / p>
classVariable Category Group Category.pare Status.dist
N N Y N