尝试计算两个列之间的时间差异,但是'difftime'中的时间2在下一个列中。
sample data:
structure(list(code = c(10888, 10888, 10888, 10888, 10888, 10888,
10889, 10889, 10889, 10889, 10889, 10889, 10890, 10890, 10890
), station = c("F1", "F3", "F4", "F5", "L5", "L7", "F1", "F3",
"F4", "L5", "L6", "L7", "F1", "F3", "F5"), a = structure(c(1365895151,
1365969188, 1366105495, 1367433149, 1368005216, 1368011698, 1366244224,
1366414926, 1367513240, 1367790556, 1367946420, 1367923973, 1365896546,
1365907968, 1366144207), class = c("POSIXct", "POSIXt"), tzone = ""),
b = structure(c(1365895316, 1365976904, 1366105495, 1367436539,
1368005233, 1368033855, 1366244224, 1366415643, 1367513840,
1367915506, 1367946597, 1367954061, 1365897164, 1365907968,
1366157867), class = c("POSIXct", "POSIXt"), tzone = "")), .Names = c("code",
"station", "a", "b"), row.names = 2:16, class = "data.frame")
我想计算列b中第一行和第二列中第二行之间的时间。
Difftime很容易:
difftime(test$a, test$b)
然而,我正在努力争取下一行,尝试过:
difftime(test$a, c(test, b[seq_len(.N+1)])[])
和这个主题的变化,但无济于事。
最后,我将通过=代码进行计算,因此difftime仅在具有相同代码的值之间计算。可能使用ddply?或者,= =代码。
答案 0 :(得分:2)
我认为你正在寻找类似的东西:
difftime(head(test$a, -1), tail(test$b, -1))
然后,如果您想将这个想法应用于每个code
,并按照您的建议使用plyr
:
ddply(test, .(code), transform, diff = c(difftime(head(a, -1), tail(b, -1)), NA))
# code station a b diff
# 1 10888 F1 2013-04-13 19:19:11 2013-04-13 19:21:56 -22.709167
# 2 10888 F3 2013-04-14 15:53:08 2013-04-14 18:01:44 -37.863056
# 3 10888 F4 2013-04-16 05:44:55 2013-04-16 05:44:55 -369.734444
# 4 10888 F5 2013-05-01 14:32:29 2013-05-01 15:28:59 -158.912222
# 5 10888 L5 2013-05-08 05:26:56 2013-05-08 05:27:13 -7.955278
# 6 10888 L7 2013-05-08 07:14:58 2013-05-08 13:24:15 NA
# 7 10889 F1 2013-04-17 20:17:04 2013-04-17 20:17:04 -47.616389
# 8 10889 F3 2013-04-19 19:42:06 2013-04-19 19:54:03 -305.253889
# 9 10889 F4 2013-05-02 12:47:20 2013-05-02 12:57:20 -111.740556
# 10 10889 L5 2013-05-05 17:49:16 2013-05-07 04:31:46 -43.344722
# 11 10889 L6 2013-05-07 13:07:00 2013-05-07 13:09:57 -2.122500
# 12 10889 L7 2013-05-07 06:52:53 2013-05-07 15:14:21 NA
# 13 10890 F1 2013-04-13 19:42:26 2013-04-13 19:52:44 -3.172778
# 14 10890 F3 2013-04-13 22:52:48 2013-04-13 22:52:48 -69.416389
# 15 10890 F5 2013-04-16 16:30:07 2013-04-16 20:17:47 NA
答案 1 :(得分:0)
看起来你来自SAS-land,因为.N
有意义......但在R-land中没有任何意义。如果你想获得偏移向量的difftime
,你需要在开始和结束时缩短索引向量:
test$b[-nrow(test$b)] - test$a[-1],
# Since my session of R is locked up with a long computation
# I am posting s simple example.
d$a
# 1 2 3 4 5 6 7
d$a[-1]
# 2 3 4 5 6 7
d$b
# 10 9 8 7 6 5 4
d$b[-nrow(d$b)]
# 10 9 8 7 6 5
还可以使用:
d$b - c( NA, d$a[-1] ) # first element will be NA
> test$b - c( NA, test$a[-1] )
[1] NA "1969-12-31 18:08:36 PST"
[3] "1969-12-31 16:00:00 PST" "1969-12-31 16:56:30 PST"
[5] "1969-12-31 16:00:17 PST" "1969-12-31 22:09:17 PST"
[7] "1969-12-31 16:00:00 PST" "1969-12-31 16:11:57 PST"
[9] "1969-12-31 16:10:00 PST" "1970-01-02 02:42:30 PST"
[11] "1969-12-31 16:02:57 PST" "1970-01-01 00:21:28 PST"
[13] "1969-12-31 16:10:18 PST" "1969-12-31 16:00:00 PST"
[15] "1969-12-31 19:47:40 PST"