我从mysql获取数据并使用panda DataFrame我将数据分离到列
data = pd.DataFrame(data)
print(data.ix[:,3])
0 2006-04-01
1 2006-08-01
2 2006-12-01
3 2006-02-01
4 2006-01-01
5 2006-07-01
6 2006-06-01
7 2006-03-01
8 2006-05-01
9 2006-11-01
10 2006-10-01
11 2006-09-01
12 2007-04-01
13 2007-08-01
14 2007-12-01
15 2007-02-01
16 2007-01-01
17 2007-07-01
18 2007-06-01
19 2007-03-01
20 2007-05-01
21 2007-11-01
22 2007-10-01
23 2007-09-01
24 2009-04-01
25 2009-08-01
当我将这些数据用于线性回归预测时,它给出的dtype是不同的。如何将此日期字段转换为int并将其放入线性预测
答案 0 :(得分:2)
您可以通过以下方式将日期转换为序数:
data.loc[:, 3].apply(lambda x: x.toordinal())
假设此列的类型为dtype('<M8[ns]')
看起来像是:
2006-04-01 732402
2006-08-01 732524
2006-12-01 732646
2006-02-01 732343
2006-01-01 732312
2006-07-01 732493
2006-06-01 732463
2006-03-01 732371
2006-05-01 732432
2006-11-01 732616
2006-10-01 732585
2006-09-01 732555
2007-04-01 732767
2007-08-01 732889
2007-12-01 733011
2007-02-01 732708
2007-01-01 732677
2007-07-01 732858
2007-06-01 732828
2007-03-01 732736
2007-05-01 732797
2007-11-01 732981
2007-10-01 732950
2007-09-01 732920
2009-04-01 733498
2009-08-01 733620
Name: 1, dtype: int64