R:将列乘以数据表中的常量

时间:2013-11-06 10:07:50

标签: r data.table

我正在尝试更正我的数据表,因此我的列具有相同的单位。这是我的一个例子。

hh:mm   A   V   W   kA  V   kW  A   kV  kW
11:00   13.84   470.16  6509.88 14.89   467.85  6964.38 15.74   464.01  7303.13
11:05   12.54   475.17  5959.22 13.40   474.52  6358.89 13.34   473.13  6311.80
11:10   9.73    476.20  4632.14 10.36   473.38  4905.86 10.38   472.73  4907.14
11:15   9.20    479.30  4410.89 9.65    482.79  4659.67 9.73    479.09  4659.33
11:20   11.28   482.22  5437.78 12.03   484.95  5835.33 12.24   476.36  5829.44
11:25   11.66   481.64  5614.56 12.76   479.95  6124.56 12.88   476.86  6139.33
11:30   10.38   475.13  4934.00 11.99   480.96  5760.44 11.50   478.77  5515.13

如您所见,某些列位于“A”,“V”或“W”,而其他列位于“kA”,“kV”或“kW”。我想要做的是通过将“kA”,“kV”和“kW”列乘以1000来将所有这些更改为相同的单位。这是我想要的

hh:mm   A   V   W   A   V   W   A   V   W
11:00   13.84   470.16  6509.88 14890   467.85  6964380 15.74   464010  7303130
11:05   12.54   475.17  5959.22 13400   474.52  6358890 13.34   473130  6311800
11:10   9.73    476.20  4632.14 10360   473.38  4905860 10.38   472730  4907140
11:15   9.20    479.30  4410.89 9650    482.79  4659670 9.73    479090  4659330
11:20   11.28   482.22  5437.78 12030   484.95  5835330 12.24   476360  5829440
11:25   11.66   481.64  5614.56 12760   479.95  6124560 12.88   476860  6139330
11:30   10.38   475.13  4934.00 11990   480.96  5760440 11.50   478770  5515130

我试过这样做:

units<-which(names(dt)=="kA") # Gives me a vector with the positions needed.
dt[,units:=units*1000] #Multiplies the vector by 1000
names(dt) <- gsub("kA", "A", names(dt)) # Changes "kA" to "A"

units<-which(names(dt)=="kV") # Gives me a vector with the positions needed.
dt[,units:=units*1000] #Multiplies the vector by 1000
names(dt) <- gsub("kV", "V", names(dt)) # Changes "kV" to "V"

units<-which(names(dt)=="kW") # Gives me a vector with the positions needed.
dt[,units:=units*1000] #Multiplies the vector by 1000
names(dt) <- gsub("kW", "W", names(dt)) # Changes "kW" to "W"

在我的第二行,我收到此错误:

Warning message:
In `[.data.table`(x2, , `:=`(units, units * 1000)) :
  Supplied 48 items to be assigned to 286 items of column 'units' (recycled leaving remainder of 46 items).

有人可以帮我解决我的语法吗?

PS:这是我输入的输入,但看起来有点好笑。

> dput(c)
structure(list(`hh:mm` = c("11:00", "11:05", "11:10", "11:15", 
"11:20", "11:25", "11:30"), A = c(13.84, 12.54, 9.73, 9.2, 11.28, 
11.66, 10.38), V = c(470.16, 475.17, 476.2, 479.3, 482.22, 481.64, 
475.13), W = c(6509.88, 5959.22, 4632.14, 4410.89, 5437.78, 5614.56, 
4934), kA = c(14.89, 13.4, 10.36, 9.65, 12.03, 12.76, 11.99), 
    V = c(467.85, 474.52, 473.38, 482.79, 484.95, 479.95, 480.96
    ), kW = c(6964.38, 6358.89, 4905.86, 4659.67, 5835.33, 6124.56, 
    5760.44), A = c(15.74, 13.34, 10.38, 9.73, 12.24, 12.88, 
    11.5), kV = c(464.01, 473.13, 472.73, 479.09, 476.36, 476.86, 
    478.77), kW = c(7303.13, 6311.8, 4907.14, 4659.33, 5829.44, 
    6139.33, 5515.13)), .Names = c("hh:mm", "A", "V", "W", "kA", 
"V", "kW", "A", "kV", "kW"), row.names = c(NA, -7L), class = c("data.table", 
"data.frame"), .internal.selfref = <pointer: 0x00000000003b0788>)

1 个答案:

答案 0 :(得分:5)

您并未正确使用data.table。这是一种可以使用的方法:

DT[, c("A2", "W2", "V2") := lapply(.SD, function(x) x*1000), .SDcols=c("kA", "kW", "kV")]
DT[, c("kA", "kW", "kV") := NULL]
DT
#    hh.mm     A      V       W    V.1   A.1    kW.1    A2      W2     V2
# 1: 11:00 13.84 470.16 6509.88 467.85 15.74 7303.13 14890 6964380 464010
# 2: 11:05 12.54 475.17 5959.22 474.52 13.34 6311.80 13400 6358890 473130
# 3: 11:10  9.73 476.20 4632.14 473.38 10.38 4907.14 10360 4905860 472730
# 4: 11:15  9.20 479.30 4410.89 482.79  9.73 4659.33  9650 4659670 479090
# 5: 11:20 11.28 482.22 5437.78 484.95 12.24 5829.44 12030 5835330 476360
# 6: 11:25 11.66 481.64 5614.56 479.95 12.88 6139.33 12760 6124560 476860
# 7: 11:30 10.38 475.13 4934.00 480.96 11.50 5515.13 11990 5760440 478770

有几点需要注意:

  • 您当前的解决方案是否创建了具有重复名称的data.table。不好的做法。我犯了一个错误,就是不在这里转换你的一个专栏,但我会留下更新答案的练习。
  • 如果您要使用data.table,请熟悉set*套功能。

或者使用set

cols <- grep("^k", names(dt))
for (j in cols) {
    set(DT, i=NULL, j=j, value=DT[[j]]*1000)
}
# change names with `setnames` now.
setnames(DT, cols, gsub("^k", "", names(dt)[cols])

虽然我同意@Ananda不使用重复名称。