使用熊猫分类数据

时间:2018-10-20 10:48:55

标签: python pandas machine-learning categorical-data chi-squared

enter image description here我正在尝试对数据集进行卡方检验,为此,我需要使用pd.cut()在数据集中制定类别。但是,我收到此错误

  

ufunc'subtract'不包含签名匹配类型的循环   dtype('

我的代码:

import pandas as pd
import numpy as np
import scipy as sp
import math

data_main = pd.read_csv("sample_survey.csv")
data = data_main.iloc[:, [1,2]]

data["wrkstat"] = data["wrkstat"].astype(str)
data["marital"] = data["marital"].astype(str)
cols = ['wrkstat', 'marital']

cut_points = ['Divorced', 'Married', 'Never Married', 'Seperated','Widowed']
label_names = ['Divorced1', 'Married', 'Never Married', 
'Seperated','Widowed']
data["Marital_Categories"] = pd.cut(data["marital"], cut_points)

marital_by_wrkstat = data[['wrkstat', 'marital_categories']]
marital_by_wrkstat.head()

0 个答案:

没有答案