用熊猫python读取产品CSV文件时出现问题

时间:2020-06-18 12:32:42

标签: python python-3.x pandas dataframe parsing

我有产品CSV文件,我正在尝试使用pandas python读取此文件,但出现此错误

我的代码

import pandas as pd
df = pd.read_csv('D:\\work\\amazon\\move_in_links\\final.csv')

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "C:\Users\Compu City\AppData\Local\Programs\Python\Python38\lib\site-packages\pandas\io\parsers.py", line 676, in parser_f
    return _read(filepath_or_buffer, kwds)
  File "C:\Users\Compu City\AppData\Local\Programs\Python\Python38\lib\site-packages\pandas\io\parsers.py", line 454, in _read
    data = parser.read(nrows)
  File "C:\Users\Compu City\AppData\Local\Programs\Python\Python38\lib\site-packages\pandas\io\parsers.py", line 1133, in read
    ret = self._engine.read(nrows)
  File "C:\Users\Compu City\AppData\Local\Programs\Python\Python38\lib\site-packages\pandas\io\parsers.py", line 2037, in read
    data = self._reader.read(nrows)
  File "pandas\_libs\parsers.pyx", line 860, in pandas._libs.parsers.TextReader.read
  File "pandas\_libs\parsers.pyx", line 875, in pandas._libs.parsers.TextReader._read_low_memory
  File "pandas\_libs\parsers.pyx", line 929, in pandas._libs.parsers.TextReader._read_rows
  File "pandas\_libs\parsers.pyx", line 916, in pandas._libs.parsers.TextReader._tokenize_rows
  File "pandas\_libs\parsers.pyx", line 2071, in pandas._libs.parsers.raise_parser_error
pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 7549, saw 8

this is the link of file


另一件事,当我删除大多数行并仅保留读取文件的4行时。

2 个答案:

答案 0 :(得分:1)

默认情况下,pandas假定您的csv用逗号','分隔,您应该将适当的分隔符传递给read_csv调用。

import pandas as pd
df = pd.read_csv('D:\\work\\amazon\\move_in_links\\final.csv', sep=';')

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html

答案 1 :(得分:1)

文件似乎由;分隔。试试:

import pandas as pd
df = pd.read_csv('D:\\work\\amazon\\move_in_links\\final.csv',sep=";")