“ ValueError:日期超出了月份范围”,因为数据中的所有月份中的所有月份共有31天

时间:2019-07-24 13:20:27

标签: python jupyter-notebook

我正在处理3小时一次的卫星降水数据。但是,所有月份都有31天。 附加数据时间时出现错误。请帮忙!

sat=pd.read_csv(r"C:\Users\Amod\Documents\Dissertation\Data\MSWEP_INDIA_CITIES_FILTERED\data_11.375_75.875", sep=" ", header=None)


sat.columns = ["year", "month", "day", "satellite"]

years = list(sat.year)
months = list(sat.month)
days = list(sat.day)
rain = list(sat.satellite)

h = 0
datetimes = []
for i in range(len(years)):
    if i ==0:
        h = 0
    else:
        if days[i]==days[i-1]:
            h +=3 #h = h+1 is the same!
        else:
            h = 0
    datetimes.append(datetime.datetime(years[i], months[i], days[i], h))

datetimes

ValueError                                Traceback (most recent call last)
<ipython-input-6-53553b0773c9> in <module>
     10         else:
     11             h = 0
---> 12     datetimes.append(datetime.datetime(years[i], months[i], days[i], h))
     13 
     14 datetimes

ValueError: day is out of range for month

1 个答案:

答案 0 :(得分:0)

我认为您应该考虑使用pandas并过滤掉非日期,但这可以通过try / except处理,如下所示:

sat=pd.read_csv(r"C:\Users\Amod\Documents\Dissertation\Data\MSWEP_INDIA_CITIES_FILTERED\data_11.375_75.875", sep=" ", header=None)



sat.columns = ["year", "month", "day", "satellite"]

years = list(sat.year)
months = list(sat.month)
days = list(sat.day)
rain = list(sat.satellite)

h = 0
datetimes = []
for i in range(len(years)):
    if i ==0:
        h = 0
    else:
        if days[i]==days[i-1]:
            h +=3 #h = h+1 is the same!
        else:
            h = 0
    try:        
        datetimes.append(datetime.datetime(years[i], months[i], days[i], h))
    except: continue

ETA: 要处理熊猫中日期时间的错误,请查看“无效数据”部分here

df['Date'] = pd.to_datetime(df['Date'], errors = 'coerce')
df = df[df['Date'] != 'NaT']