如何使用带有参数化日期的tibbletime创建时间序列?

时间:2017-10-12 09:20:09

标签: r tidyverse

我想为特定日期创建一个带有tibbletime的时间序列。 我有:

Data_Start<-"2015-09-07 01:55:00 UTC"
Data_End<-"2015-09-10 01:59:00 UTC"

我想创建一个时间序列,带有分钟样本,例如:

create_series(2015-09-07 + 01:55:00 ~ 2015-09-10 + 01:59:00,1~M)

参数应该是time_formula,如第17页所述: https://cran.r-project.org/web/packages/tibbletime/tibbletime.pdf

这样可行,但我无法传递如下参数:

create_series(Data_Start~Data_End,1~M)

尝试转换字符串已经有了不同的东西,但到目前为止没有任何工作:(

2 个答案:

答案 0 :(得分:3)

此处tibbletime的作者。 GitHub上最近提出了一个问题。解决方案是使用rlang::new_formula()预构建公式。如果使用POSIXct日期,我们还需要一个特殊的辅助函数来处理在公式中添加+

这是帮手:

# Time formula creator
# Can pass character, Date, POSIXct
create_time_formula <- function(lhs, rhs) {

  if(!inherits(lhs, c("character", "Date", "POSIXct"))) {
    stop("LHS must be a character or date")
  }
  if(!inherits(rhs, c("character", "Date", "POSIXct"))) {
    stop("RHS must be a character or date")
  }

  if(inherits(lhs, "Date")) {
    lhs <- as.character(lhs)
  } else if (inherits(lhs, "POSIXct")) {
    lhs <- gsub(" ", " + ", lhs)
  }

  if(inherits(rhs, "Date")) {
    rhs <- as.character(rhs)
  } else if (inherits(rhs, "POSIXct")) {
    rhs <- gsub(" ", " + ", rhs)
  }

  rlang::new_formula(lhs, rhs)
}

使用辅助功能以及开始日期和结束日期的日期版本

Data_Start<- as.POSIXct("2015-09-07 01:55:00")
Data_End  <- as.POSIXct("2015-09-10 01:59:00")

time_formula <- create_time_formula(Data_Start, Data_End)

create_series(time_formula, 1~M, tz = "UTC")

产地:

# A time tibble: 4,325 x 1
# Index: date
                  date
                <dttm>
 1 2015-09-07 01:55:00
 2 2015-09-07 01:56:00
 3 2015-09-07 01:57:00
 4 2015-09-07 01:58:00
 5 2015-09-07 01:59:00
 6 2015-09-07 02:00:00
 7 2015-09-07 02:01:00
 8 2015-09-07 02:02:00
 9 2015-09-07 02:03:00
10 2015-09-07 02:04:00
# ... with 4,315 more rows

tibbletime的未来版本中,我可能会为此案例添加更强大的create_time_formula()辅助函数版本。


更新: tibbletime 0.1.0已经发布,更强大的实现允许直接使用公式中的变量。此外,公式的每一方必须是与现在索引相同的类的字符或对象(即2013 ~ 2014应为"2013" ~ "2014")。

library(tibbletime)

Data_Start<- as.POSIXct("2015-09-07 01:55:00")
Data_End  <- as.POSIXct("2015-09-10 01:59:00")

create_series(Data_Start ~ Data_End, "1 min")
#> # A time tibble: 4,325 x 1
#> # Index: date
#>    date               
#>    <dttm>             
#>  1 2015-09-07 01:55:00
#>  2 2015-09-07 01:56:00
#>  3 2015-09-07 01:57:00
#>  4 2015-09-07 01:58:00
#>  5 2015-09-07 01:59:00
#>  6 2015-09-07 02:00:00
#>  7 2015-09-07 02:01:00
#>  8 2015-09-07 02:02:00
#>  9 2015-09-07 02:03:00
#> 10 2015-09-07 02:04:00
#> # ... with 4,315 more rows

答案 1 :(得分:0)

我创建了具有多个季节性的时间序列,使用forecast()包在上述时间和分钟之间作为频率。季节性时期因您的要求和数据长度而异

library(forecast)
Data_Start<-as.POSIXct("2015-09-07 01:55:00 UTC")
Data_End<-as.POSIXct("2015-09-10 01:59:00 UTC")

df = data.frame(tt = seq.POSIXt(Data_Start,Data_End,"min"),
                val = sample(1:40,4325,replace = T),stringsAsFactors = F)

# Seasonality Hourly, Daily
mts = msts(df$val,seasonal.periods = c(60,1440),start = Data_Start)
# Seasonality Hourly, Daily, Weekly
mts = msts(df$val,seasonal.periods = c(60,1440,10080),start = Data_Start)