使用rxFactors重新标记级别

时间:2019-01-31 12:05:02

标签: r bigdata revoscaler

我有一个XDF文件(nyc_xdf_lab)。在文件中,我有一个名为“ RatecodeID”的整数列和一个名为“ payment_type”的因子列(四个级别)。

我已经使用以下代码将“ RatecodeID”从整数更改为factor,并为级别创建了新标签。这很好。当我尝试重新标记“ payment_type”的级别时,出现以下错误:“您正尝试将单个因素级别重新编码为多个级别。有问题的级别为:“ NA”。

感谢您的任何建议。

ratecodeID_labels <- c("Standard rate", "JFK", "Newark", "Nassau or Westchester", "Negotiated fare", "Group ride")

paymenttype_labels <- c("Credit card", "Cash") #there are four levels under payment_type, I want any levels not specified here to be 'missing'. 

mynewdata <- rxFactors(inData = nyc_xdf_lab, 
                   outFile = nyc_xdf_lab, 
                   overwrite=TRUE,
                   factorInfo = list(
                     Ratecode_type_desc = list(varName = "RatecodeID", levels = 1:6, newLevels = ratecodeID_labels), 
                     payment_type_desc = list(varName = "payment_type", newLevels = paymenttype_labels)))

0 个答案:

没有答案