熊猫read_csv解析错误问题。只有8个田地,但大熊猫却看了9个

时间:2020-02-13 01:20:19

标签: python pandas

我想用熊猫解析csv文件(分隔符为'\ t')

只有8个字段,但是熊猫说有些行有9个字段。

要检查,我检查每行中的'\ t'数。

没有特殊的界线,但熊猫无法解析此界线。

我的验证码如下。

for line in open("/root/target_file_path"):
    if len(line.split("\t")) != 8:
        print (line)

import pandas as pd

sheet = pd.read_csv("/root/target_file_path",sep='\t')
print (sheet.keys())

打印结果如下。

Traceback (most recent call last):
File "/root/Workspace/test2.py", line 9, in <module>
sheet = pd.read_csv("/root/namu.kma.refine.SDItrForm",sep='\t')
File "/root/.local/lib/python3.6/site-packages/pandas/io/parsers.py", line 685, in parser_f
return _read(filepath_or_buffer, kwds)
File "/root/.local/lib/python3.6/site-packages/pandas/io/parsers.py", line 463, in _read
data = parser.read(nrows)
File "/root/.local/lib/python3.6/site-packages/pandas/io/parsers.py", line 1154, in read
ret = self._engine.read(nrows)
File "/root/.local/lib/python3.6/site-packages/pandas/io/parsers.py", line 2059, in read
data = self._reader.read(nrows)
File "pandas/_libs/parsers.pyx", line 881, in pandas._libs.parsers.TextReader.read
File "pandas/_libs/parsers.pyx", line 896, in 
pandas._libs.parsers.TextReader._read_low_memory
File "pandas/_libs/parsers.pyx", line 950, in pandas._libs.parsers.TextReader._read_rows
File "pandas/_libs/parsers.pyx", line 937, in pandas._libs.parsers.TextReader._tokenize_rows
File "pandas/_libs/parsers.pyx", line 2132, in pandas._libs.parsers.raise_parser_error
pandas.errors.ParserError: Error tokenizing data. C error: Expected 8 fields in line 13017, saw 9

0 个答案:

没有答案