我正在尝试导入tsv文件。其中一条线很糟糕,我不知道哪一条。我想要大熊猫忽略那条线。这不起作用:
bob_date_parser2 = lambda x: datetime.datetime.strptime(x, "%m/%d/%Y") #"3/7/2013"
df = pd.read_csv(fname, sep="\t", parse_dates=['country_release'], date_parser=bob_date_parser2, encoding='utf8',
warn_bad_lines=False, error_bad_lines=False)
我知道我的文件处理是正确的,因为我只能head
第一行并导入而没有任何问题。但是,当我尝试使用
read_csv
失败了
Traceback (most recent call last):
File "interpret_data_parse/wrangle_survey_data.py", line 57, in <module>
main()
File "interpret_data_parse/wrangle_survey_data.py", line 51, in main
print load_master_interpret_boxoffice()
File "interpret_data_parse/wrangle_survey_data.py", line 35, in load_master_interpret_boxoffice
warn_bad_lines=True, error_bad_lines=False)
File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 420, in parser_f
return _read(filepath_or_buffer, kwds)
File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 225, in _read
return parser.read()
File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 626, in read
ret = self._engine.read(nrows)
File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 1122, in read
names, data = self._do_date_conversions(names, data)
File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 946, in _do_date_conversions
self.index_names, names, keep_date_col=self.keep_date_col)
File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 1948, in _process_date_conversion
data_dict[colspec] = converter(data_dict[colspec])
File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 1915, in converter
return generic_parser(date_parser, *date_cols)
File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/date_converters.py", line 38, in generic_parser
results[i] = parse_func(*args)
File "interpret_data_parse/wrangle_survey_data.py", line 33, in <lambda>
bob_date_parser2 = lambda x: datetime.datetime.strptime(x, "%m/%d/%Y") #"3/7/2013" wtf
TypeError: must be string, not float
我在熊猫上13.1