将所有列(因子转换为数字)而不影响rownames / colnames

时间:2016-03-09 01:06:12

标签: r numeric

对于我的示例数据集,我使用以下代码将数据从因子转换为数字:

sample = as.data.frame(lapply(sample, function(x) as.numeric(as.character(x))))

然后使用以下代码将所有NA值替换为0:

sample[is.na(sample)] = 0

但是,当我从因子转换为数字时,列名称会更改,rownames会消失。为什么会发生这种情况?如何在将所有列转换为数字时阻止它发生?

dput(sample)
structure(list(`2015-10-08 00:05:00` = structure(c(NA, NA, NA, 
NA, 2L, NA), .Names = c("72", "79", "82", "83", "116", "120"), .Label = c(" 1", 
" 2", " 3", " 5", "2015-10-08 00:05:00"), class = "factor"), 
    `2015-10-08 00:12:00` = structure(c(NA, 1L, NA, NA, NA, NA
    ), .Names = c("72", "79", "82", "83", "116", "120"), .Label = c(" 1", 
    " 2", " 3", "2015-10-08 00:12:00"), class = "factor"), `2015-10-08 00:34:00` = structure(c(NA, 
    NA, NA, NA, 1L, NA), .Names = c("72", "79", "82", "83", "116", 
    "120"), .Label = c(" 1", " 2", " 3", " 4", "2015-10-08 00:34:00"
    ), class = "factor"), `2015-10-08 00:40:00` = structure(c(NA_integer_, 
    NA_integer_, NA_integer_, NA_integer_, NA_integer_, NA_integer_
    ), .Names = c("72", "79", "82", "83", "116", "120"), .Label = c(" 1", 
    " 2", "2015-10-08 00:40:00"), class = "factor"), `2015-10-08 01:32:00` = structure(c(NA, 
    NA, 1L, 1L, 3L, NA), .Names = c("72", "79", "82", "83", "116", 
    "120"), .Label = c(" 1", " 2", " 3", " 4", " 6", " 8", "2015-10-08 01:32:00"
    ), class = "factor"), `2015-10-08 01:52:00` = structure(c(1L, 
    NA, NA, NA, NA, NA), .Names = c("72", "79", "82", "83", "116", 
    "120"), .Label = c(" 1", " 2", " 3", "2015-10-08 01:52:00"
    ), class = "factor")), .Names = c("2015-10-08 00:05:00", 
"2015-10-08 00:12:00", "2015-10-08 00:34:00", "2015-10-08 00:40:00", 
"2015-10-08 01:32:00", "2015-10-08 01:52:00"), row.names = c("72", 
"79", "82", "83", "116", "120"), class = "data.frame")

1 个答案:

答案 0 :(得分:4)

您可以使用data.frame代替as.data.framecheck.names=F告诉函数保留列名。使用row.names继承行名称。

顺便说一句,尽量不要将sample用作R中的变量名,因为它是R的保留字。

d1 = data.frame(lapply(d1, function(x) as.numeric(as.character(x))),
                   check.names=F, row.names = rownames(d1))
d1[is.na(d1)] = 0