如何将Python中的类别变量转换为R中的因子变量? (网状)

时间:2019-01-16 00:15:47

标签: python r categorical-data reticulate

我正在使用网状将python保存的.pkl转换为R数据帧。我看到分类Python类型保留在R数据框中。我猜R不知道如何处理python分类变量。如何将它们转换为R中的因子变量?

#Looking at the class of some of the variables in the R dataframe, I get

class(df$variable)

$variable
[1] "pandas.core.arrays.categorical.Categorical" 
"pandas.core.arrays.base.ExtensionArray"    
[3] "pandas.core.base.PandasObject"              
"pandas.core.base.StringMixin"              
[5] "pandas.core.accessor.DirNamesMixin"         "python.builtin.object"      

#I get the right levels, but this doesn't seem to be R-friendly 

> table(levels(factor(as.character(df$varibale))))

[level 1, level 2, level 3, level 4, level 5, ..., NaN, NaN, NaN, NaN, 
NaN]\nLength: 464121\nCategories (6, object): [level 1 < level 2 < level 3 
< level 4 < level 5 < level 6] 

0 个答案:

没有答案