我有包含英国周数字的字符串(strptime docs中的%W),我可以将包含它的字符串转换为POSIXct
# create dummy data in June
x1 <- as.POSIXct('2012-06-01 01:00', format='%Y-%m-%d %H:%M', tz='UT')
(x2 <- format(x1, '%Y %W %a %H %M'))
[1] "2012 22 Fri 01 00"
as.POSIXct(x2, format='%Y %W %a %H %M', tz='UT')
[1] "2012-06-01 01:00:00"
所以这很好......但是如果我想要2012年1月的第一个它不起作用 - 我只是得到一个NA
x1 <- as.POSIXct('2012-01-01 01:00', format='%Y-%m-%d %H:%M', tz='UT')
(x2 <- format(x1, '%Y %W %a %H %M'))
[1] "2012 00 Sun 01 00"
as.POSIXct(x2, format='%Y %W %a %H %M', tz='UT')
[1] NA
如何解决这个问题,以便我可以在年初将这些日期转换为POSIXct?
更新
以下C代码在libc 2.15(ubuntu 12.04 LTS)上为我的机器演示了真正的问题是底层的libc
#define _XOPEN_SOURCE
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <time.h>
int
main (void)
{
struct tm tm;
char buf[255];
memset (&tm, 0, sizeof (struct tm));
// ok what week number is the 1st of January 2012?
// according to this it is week 01 in %U format
// and 00 in %W format...
strptime ("2012-01-01 01:00", "%Y-%m-$d %H:%M", &tm);
strftime (buf, sizeof (buf), "%Y-%m-%d %U %W %a %b %H:%M", &tm);
puts("To demonstrate the different week numbers in %U %W");
puts("Using format %Y-%m-%d %U %W %a %b %H:%M");
puts (buf);
// to demonstrate it works
strptime ("2012 02 Sun 01 00", "%Y %W %a %H %M", &tm);
strftime (buf, sizeof (buf), "%Y-%m-%d %a %b %H %M", &tm);
puts("\nUsing format %Y-%m-%d %a %b %H%M for 2012 02 Sun 01 00");
puts (buf);
// but then the potential bug...
strptime ("2012 01 Sun 01 00", "%Y %W %a %H %M", &tm);
strftime (buf, sizeof (buf), "%Y-%m-%d %a %b %H %M", &tm);
puts("\nUsing format %Y-%m-%d %a %b %H%M for 2012 01 Sun 01 00");
puts("and this is wrong...");
puts (buf);
strptime ("2012 00 Sun 01 00", "%Y %W %a %H %M", &tm);
strftime (buf, sizeof (buf), "%Y-%m-%d %a %b %H %M", &tm);
puts("\nUsing format %Y-%m-%d %a %b %H %M for 2012 00 Sun 01 00");
puts("and this is VERY wrong...");
puts (buf);
exit (EXIT_SUCCESS);
}
它提供以下输出
To demonstrate the different week numbers in %U %W
Using format %Y-%m-%d %U %W %a %b %H:%M
2012-01-00 01 00 Sun Jan 00:00
Using format %Y-%m-%d %a %b %H%M for 2012 02 Sun 01 00
2012-01-08 Sun Jan 01 00
Using format %Y-%m-%d %a %b %H%M for 2012 01 Sun 01 00
and this is wrong...
2012-01-01 Sun Jan 01 00
Using format %Y-%m-%d %a %b %H %M for 2012 00 Sun 01 00
and this is VERY wrong...
2012-00--371 Sun Saturday 01 00
答案 0 :(得分:1)
看起来像00就会产生问题。
x3 <- gsub(' 00 ' , ' 01 ',x2) ## dirty workaround 00 -> 01
> as.POSIXct(x3, format='%Y %W %a %H %M')
[1] "2012-01-01 01:00:00 CET"
> x1
[1] "2012-01-01 01:00:00 CET"
编辑
使用%U代替%W我得到了这个:
x1 <- as.POSIXct('2012-01-01 01:00', format='%Y-%m-%d %H:%M')
> (x2 <- format(x1, '%Y %U %a %H %M'))
[1] "2012 01 dim. 01 00"
> as.POSIXct(x2, format='%Y %U %a %H %M')
[1] "2012-01-01 01:00:00 CET"
x1 <- as.POSIXct('2012-01-08 01:00', format='%Y-%m-%d %H:%M')
> (x2 <- format(x1, '%Y %U %a %H %M'))
[1] "2012 02 dim. 01 00"
> as.POSIXct(x2, format='%Y %U %a %H %M')
[1] "2012-01-08 01:00:00 CET"