python ValueError:

时间:2019-05-20 14:05:37

标签: pandas dummy-variable

我正尝试使用以下代码创建伪变量

    cat_vars=['Gender', 'Recidence', 'CivilStatus', 'HighestEducationQualification', 'ExtraCurricularActivities', 'ApparelRelatedVocationalQualification', 'PreviouseJob', 'ExperienceSection', 'RelativesInApparel', 'SpousesOccupation', 'FamilyOppinionAboutTheJob', 'ExpectationOfDoingTheJob', 'AvailabilityOfTransportNearTheResidence', 'ReasonForChooseApparel', 'PreviousWorkPlace', 'ContributionToTheFamilyIncome', 'PersonalImpression', 'AccommodationFee', 'RetentionCategory', 'SelectedDepartment', 'ChildrenLessThan5Years', 'NumberOfChilren', 'ApparelExperience', 'ReasonForLeaving', 'MedicalTest', 'ExpectedSalary', 'ExperienceDurationMonths', 'IQTestScore', 'LastBasicSalary', 'InterviewedBy', 'FollowingExternalCourses', 'Is_Rejoined_Employee__c']
for var in cat_vars:
    cat_list='var'+'_'+var
    cat_list = pd.get_dummies(df[var], prefix=var)
    data1=df.join(cat_list)

但是它总是抛出该错误

  

列重叠但未指定后缀:Index(['Gender_Female',   'Gender_Male'],dtype ='object')

0 个答案:

没有答案