Pandas warn_bad_lines = False,error_bad_lines = False仍在尝试解析坏线

时间:2014-02-25 21:06:31

标签: python pandas

我正在尝试导入tsv文件。其中一条线很糟糕,我不知道哪一条。我想要大熊猫忽略那条线。这不起作用:

bob_date_parser2 = lambda x: datetime.datetime.strptime(x, "%m/%d/%Y")  #"3/7/2013"
df = pd.read_csv(fname, sep="\t", parse_dates=['country_release'], date_parser=bob_date_parser2, encoding='utf8',
                 warn_bad_lines=False, error_bad_lines=False)

我知道我的文件处理是正确的,因为我只能head第一行并导入而没有任何问题。但是,当我尝试使用

时,我的整个文件read_csv失败了
Traceback (most recent call last):
  File "interpret_data_parse/wrangle_survey_data.py", line 57, in <module>
    main()
  File "interpret_data_parse/wrangle_survey_data.py", line 51, in main
    print load_master_interpret_boxoffice()
  File "interpret_data_parse/wrangle_survey_data.py", line 35, in load_master_interpret_boxoffice
    warn_bad_lines=True, error_bad_lines=False)
  File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 420, in parser_f
    return _read(filepath_or_buffer, kwds)
  File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 225, in _read
    return parser.read()
  File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 626, in read
    ret = self._engine.read(nrows)
  File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 1122, in read
    names, data = self._do_date_conversions(names, data)
  File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 946, in _do_date_conversions
    self.index_names, names, keep_date_col=self.keep_date_col)
  File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 1948, in _process_date_conversion
    data_dict[colspec] = converter(data_dict[colspec])
  File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 1915, in converter
    return generic_parser(date_parser, *date_cols)
  File "/home/isl/rfcompton/.local/lib/python2.7/site-packages/pandas/io/date_converters.py", line 38, in generic_parser
    results[i] = parse_func(*args)
  File "interpret_data_parse/wrangle_survey_data.py", line 33, in <lambda>
    bob_date_parser2 = lambda x: datetime.datetime.strptime(x, "%m/%d/%Y")  #"3/7/2013" wtf
TypeError: must be string, not float

我在熊猫上13.1

0 个答案:

没有答案