R将周数(UK)转换为POSIXct问题

时间:2013-02-01 12:33:01

标签: r posixct

我有包含英国周数字的字符串(strptime docs中的%W),我可以将包含它的字符串转换为POSIXct

# create dummy data in June
x1 <- as.POSIXct('2012-06-01 01:00', format='%Y-%m-%d %H:%M', tz='UT')
(x2 <- format(x1, '%Y %W %a %H %M'))
[1] "2012 22 Fri 01 00"
as.POSIXct(x2, format='%Y %W %a %H %M', tz='UT')
[1] "2012-06-01 01:00:00"

所以这很好......但是如果我想要2012年1月的第一个它不起作用 - 我只是得到一个NA

x1 <- as.POSIXct('2012-01-01 01:00', format='%Y-%m-%d %H:%M', tz='UT')
(x2 <- format(x1, '%Y %W %a %H %M'))
[1] "2012 00 Sun 01 00"
as.POSIXct(x2, format='%Y %W %a %H %M', tz='UT')
[1] NA

如何解决这个问题,以便我可以在年初将这些日期转换为POSIXct?

更新

以下C代码在libc 2.15(ubuntu 12.04 LTS)上为我的机器演示了真正的问题是底层的libc

#define _XOPEN_SOURCE
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <time.h>
  int
main (void)
{
  struct tm tm;
  char buf[255];

  memset (&tm, 0, sizeof (struct tm));
  // ok what week number is the 1st of January 2012?
  // according to this it is week 01 in %U format 
  // and 00 in %W format...
  strptime ("2012-01-01 01:00", "%Y-%m-$d %H:%M", &tm);
  strftime (buf, sizeof (buf), "%Y-%m-%d %U %W %a %b %H:%M", &tm);
  puts("To demonstrate the different week numbers in %U %W");
  puts("Using format %Y-%m-%d %U %W %a %b %H:%M");
  puts (buf);
  // to demonstrate it works
  strptime ("2012 02 Sun 01 00", "%Y %W %a %H %M", &tm);
  strftime (buf, sizeof (buf), "%Y-%m-%d %a %b %H %M", &tm);
  puts("\nUsing format %Y-%m-%d %a %b %H%M for 2012 02 Sun 01 00");
  puts (buf);
  // but then the potential bug...
  strptime ("2012 01 Sun 01 00", "%Y %W %a %H %M", &tm);
  strftime (buf, sizeof (buf), "%Y-%m-%d %a %b %H %M", &tm);
  puts("\nUsing format %Y-%m-%d %a %b %H%M for 2012 01 Sun 01 00");
  puts("and this is wrong...");
  puts (buf);
  strptime ("2012 00 Sun 01 00", "%Y %W %a %H %M", &tm);
  strftime (buf, sizeof (buf), "%Y-%m-%d %a %b %H %M", &tm);
  puts("\nUsing format %Y-%m-%d %a %b %H %M for 2012 00 Sun 01 00");
  puts("and this is VERY wrong...");
  puts (buf);
  exit (EXIT_SUCCESS);
}

它提供以下输出

To demonstrate the different week numbers in %U %W
Using format %Y-%m-%d %U %W %a %b %H:%M
2012-01-00 01 00 Sun Jan 00:00

Using format %Y-%m-%d %a %b %H%M for 2012 02 Sun 01 00
2012-01-08 Sun Jan 01 00

Using format %Y-%m-%d %a %b %H%M for 2012 01 Sun 01 00
and this is wrong...
2012-01-01 Sun Jan 01 00

Using format %Y-%m-%d %a %b %H %M for 2012 00 Sun 01 00
and this is VERY wrong...
2012-00--371 Sun Saturday 01 00

1 个答案:

答案 0 :(得分:1)

看起来像00就会产生问题。

x3 <- gsub(' 00 ' , ' 01 ',x2)             ## dirty workaround 00 -> 01
> as.POSIXct(x3, format='%Y %W %a %H %M')
[1] "2012-01-01 01:00:00 CET"
> x1
[1] "2012-01-01 01:00:00 CET"

编辑

使用%U代替%W我得到了这个:

x1 <- as.POSIXct('2012-01-01 01:00', format='%Y-%m-%d %H:%M')
> (x2 <- format(x1, '%Y %U %a %H %M'))
[1] "2012 01 dim. 01 00"
> as.POSIXct(x2, format='%Y %U %a %H %M')
[1] "2012-01-01 01:00:00 CET"

x1 <- as.POSIXct('2012-01-08 01:00', format='%Y-%m-%d %H:%M')
> (x2 <- format(x1, '%Y %U %a %H %M'))
[1] "2012 02 dim. 01 00"
> as.POSIXct(x2, format='%Y %U %a %H %M')
[1] "2012-01-08 01:00:00 CET"