将ISO 8601周年数与Windows上的年份数与德语区域设置相匹配

时间:2017-01-12 15:03:18

标签: r date iso8601 week-number

这与我的问题POSIX date from dates in weekly time format直接相关。

但是,在这个问题中,我想具体询问如何将ISO 8601周数映射到一年中的月份数。

对我来说,似乎不可能和/或涉及一些非直观的黑客攻击(甚至这些都不能真正可靠地工作),因此 IMO应被视为需要修复的东西在基地R 。如果我错了,请纠正我,不过

编辑:似乎问题与在Windows上运行和/或您所在的语言环境(标准德语,在我的情况下)密切相关

posix <- as.POSIXct(c("2015-12-24", "2015-12-31", "2016-01-01", "2016-01-08"))

ISO 8601

(yw <- format(posix, "%Y-%V"))
# [1] "2015-52" "2015-53" "2016-53" "2016-01"
ywd <- sprintf("%s-1", yw)
(as.POSIXct(ywd, format = "%Y-%V-%u"))
# [1] "2015-01-12 CET" "2015-01-12 CET" "2016-01-12 CET" "2016-01-12 CET"
# -> utterly wrong!!!

ywd <- sprintf("%s-4", yw)
(as.POSIXct(ywd, format = "%Y-%V-%u"))
# -> still wrong -> the day of the week is not the reason

# -> no way to use ISO 8601 convention to map week of the year to month of the year

为了尽职尽责:在尝试使用美国或英国惯例时也不可能:

美国惯例

(yw <- format(posix, "%Y-%U"))
# [1] "2015-51" "2015-52" "2016-00" "2016-01"
ywd <- sprintf("%s-1", yw)
(as.POSIXct(ywd, format = "%Y-%U-%u"))
# [1] "2015-12-21 CET" "2015-12-28 CET" NA               "2016-01-04 CET"
# -> NA problem for week 00

ywd <- sprintf("%s-4", yw)
# -> does not work for week 00
(as.POSIXct(ywd, format = "%Y-%U-%u"))
# The day of the week is not the reason

# -> no way to use this convention to reliably map week of the year to month of the year

英国惯例

(yw <- format(posix, "%Y-%W"))
# [1] "2015-51" "2015-52" "2016-00" "2016-01"
ywd <- sprintf("%s-1", yw)
(as.POSIXct(ywd, format = "%Y-%W-%u"))
# [1] "2015-12-21 CET" "2015-12-28 CET" NA               "2016-01-04 CET"
# -> NA problem for week 00

ywd <- sprintf("%s-4", yw)
# -> does not work for week 00
(as.POSIXct(ywd, format = "%Y-%W-%u"))
# The day of the week is not the reason

# -> no way to use this convention to reliably map week of the year to month of the year

会话信息

> sessionInfo()
R version 3.3.2 (2016-10-31)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows >= 8 x64 (build 9200)

locale:
[1] LC_COLLATE=German_Germany.1252     LC_CTYPE=German_Germany.1252       LC_MONETARY=German_Germany.1252   
[4] LC_NUMERIC=C                       LC_TIME=English_United States.1252

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] fva_0.1.0       digest_0.6.10   readxl_0.1.1    dplyr_0.5.0     plyr_1.8.4      magrittr_1.5   
 [7] memoise_1.0.0   testthat_1.0.2  roxygen2_5.0.1  devtools_1.12.0

loaded via a namespace (and not attached):
 [1] Rcpp_0.12.8     lubridate_1.6.0 assertthat_0.1  packrat_0.4.8-1 crayon_1.3.2    withr_1.0.2    
 [7] R6_2.2.0        DBI_0.5-1       stringi_1.1.2   rstudioapi_0.6  tools_3.3.2     stringr_1.1.0  
[13] tibble_1.2     

> devtools::session_info()
Session info -----------------------------------------------------------------------------------------------
 setting  value                       
 version  R version 3.3.2 (2016-10-31)
 system   x86_64, mingw32             
 ui       RStudio (1.0.136)           
 language en                          
 collate  German_Germany.1252         
 tz       Europe/Berlin               
 date     2017-01-12                  

Packages ---------------------------------------------------------------------------------------------------
 package    * version date       source        
 assertthat   0.1     2013-12-06 CRAN (R 3.3.2)
 crayon       1.3.2   2016-06-28 CRAN (R 3.3.2)
 DBI          0.5-1   2016-09-10 CRAN (R 3.3.2)
 devtools   * 1.12.0  2016-06-24 CRAN (R 3.3.2)
 digest     * 0.6.10  2016-08-02 CRAN (R 3.3.2)
 dplyr      * 0.5.0   2016-06-24 CRAN (R 3.3.2)
 fva        * 0.1.0   <NA>       local         
 lubridate    1.6.0   2016-09-13 CRAN (R 3.3.2)
 magrittr   * 1.5     2014-11-22 CRAN (R 3.3.2)
 memoise    * 1.0.0   2016-01-29 CRAN (R 3.3.2)
 packrat      0.4.8-1 2016-09-07 CRAN (R 3.3.2)
 plyr       * 1.8.4   2016-06-08 CRAN (R 3.3.2)
 R6           2.2.0   2016-10-05 CRAN (R 3.3.2)
 Rcpp         0.12.8  2016-11-17 CRAN (R 3.3.2)
 readxl     * 0.1.1   2016-03-28 CRAN (R 3.3.2)
 roxygen2   * 5.0.1   2015-11-11 CRAN (R 3.3.2)
 stringi      1.1.2   2016-10-01 CRAN (R 3.3.2)
 stringr      1.1.0   2016-08-19 CRAN (R 3.3.2)
 testthat   * 1.0.2   2016-04-23 CRAN (R 3.3.2)
 tibble       1.2     2016-08-26 CRAN (R 3.3.2)
 withr        1.0.2   2016-06-20 CRAN (R 3.3.2)

3 个答案:

答案 0 :(得分:1)

披露:如this answer中所述,我创建了ISOweek package来处理基于ISO 8601周的日期。

这个问题包含几个缺陷:

  1. ISO 8601以工作日为基础的年份与日历年不同。
  2. 如果没有指定一周的某一天,则将年周转换为年月不明确。
  3. 基于周的年份与日历年度

    OP使用

    创建了样本数据
    posix <- as.POSIXct(c("2015-12-24", "2015-12-31", "2016-01-01", "2016-01-08"))
    (yw <- format(posix, "%Y-%V"))
    
    [1] "2015-52" "2015-53" "2016-53" "2016-01"
    

    格式规范%Y返回日历年,显然第三个元素是错误的。

    使用正确的格式规范%G,我们得到

    (yw <- format(posix, "%G-%V"))
    
    [1] "2015-52" "2015-53" "2015-53" "2016-01"
    

    每年的周转换为一年中的一个月

    仅提供基于ISO周的年份和周数 ,不用 一周将产生含糊不清的结果

    这可以用(校正的)样本数据证明,该数据现在包含OP自己的(非标准)年 - 周格式的连续三周:

    yw
    
    [1] "2015-52" "2015-53" "2016-01"
    

    借助ISOweek2date()包中的ISOweek功能,数据会转换为日历日期。请注意,ISOweek2date()要求以yyyy-Www-d格式(包括星期几)生成完整的ISO 8601基于工作日的日期。如果我们选择一周的第一天(星期一),我们会得到:

    library(ISOweek)
    library(magrittr)
    yw %>% 
      # insert "W" to conform with ISO 8601 format
      sub("-", "-W", .) %>% 
      # append day of week
      paste0("-1") %>%
      # convert to class Date and print as yyyy-mm 
      ISOweek2date() %>% 
      format("%Y-%m")
    
    [1] "2015-12" "2015-12" "2016-01"
    

    现在,我们使用一周中的最后一天(星期日)重复此操作:

    yw %>% 
      sub("-", "-W", .) %>% 
      paste0("-7") %>% 
      ISOweek2date() %>% 
      format("%Y-%m")
    
    [1] "2015-12" "2016-01" "2016-01"
    

    请注意,第二个元素现在指的是2016年1月而不是2015年12月,因为53周的星期日是1月,而本周的星期一仍然是12月。

答案 1 :(得分:0)

非常确定基础R 以外的其他内容需要更改(请参阅结尾处的注释):

some_dates <- as.POSIXct(c("2015-12-24", "2015-12-31", "2016-01-01", "2016-01-08"))

(year_week <- format(some_dates, "%Y %U"))
## [1] "2015 51" "2015 52" "2016 00" "2016 01"

(year_week_day <- sprintf("%s 1", year_week))
## [1] "2015 51 1" "2015 52 1" "2016 00 1" "2016 01 1"

(as.POSIXct(year_week_day, format = "%Y %U %u"))
## [1] "2015-12-21 EST" "2015-12-28 EST" "2016-01-04 EST" "2016-01-04 EST"

它也适用于破折号:

(year_week <- format(some_dates, "%Y-%U"))
## [1] "2015-51" "2015-52" "2016-00" "2016-01"

(year_week_day <- sprintf("%s-1", year_week))
## [1] "2015-51-1" "2015-52-1" "2016-00-1" "2016-01-1"

(as.POSIXct(year_week_day, format = "%Y-%U-%u"))
## [1] "2015-12-21 EST" "2015-12-28 EST" "2016-01-04 EST" "2016-01-04 EST"

并且,尽管短划线是OK ISO形式,但当各种值不是> 12或<0

时,它们会导致读者混淆

注意

由于评论主题表明这是Windows上的行为:

(year_week <- format(some_dates, "%Y-%U"))
## [1] "2015-51" "2015-52" "2016-00" "2016-01"

(year_week_day <- sprintf("%s-1", year_week))
## [1] "2015-51-1" "2015-52-1" "2016-00-1" "2016-01-1"

(as.POSIXct(year_week_day, format = "%Y-%U-%u"))
## [1] "2015-12-21 PST" "2015-12-28 PST" NA               "2016-01-04 PST"

(Windows 10 64bit,R 3.3.2 for me / this example)

答案 2 :(得分:0)

R日期时间格式参数?strptime的文档说输入时将忽略“%V”。