如何按类别读取数据以分隔varibales

时间:2018-05-08 14:07:10

标签: python-3.x pandas dataframe

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from scipy import stats
dataFileName='RFInput.xlsx'
sheetName='Rawdata'
sheetNamePara='paraList'
dataRaw=pd.read_excel(dataFileName, sheetname = sheetName)
datapara=pd.read_excel(dataFileName, sheetname = sheetNamePara)

noData=len(dataRaw)
import matplotlib.pylab as plt
from sklearn.cross_validation import train_test_split
from sklearn.cross_validation import cross_val_score
from sklearn.preprocessing import StandardScaler


labels = datapara
x = dataRaw[labels]
y = dataRaw['classVariable']

在RFInput.xlsx中,sheetname =“paraList”,我有一些参数列表,我需要从Rawdata中提取数据。在paraList中,第一行是变量的名称,在第二行中,我将每个变量的类别标记为Y或N.我想将Y类变量数据读入x_Y,将N类变量数据读入x_N。 / p>

classVariable   Category    Group Category.pare Status.dist
N                 N                Y               N

0 个答案:

没有答案