Question

我正在将一个excel文件读入大熊猫，但我得到以下内容：

Out[8]:
0        \tFLOOD LIGHTS\t
1        \tFLOOD LIGHTS\t
2        \tPAR 38 LIGHT\t
3                \tMILO\t
4    \tQ-12251-DO1 MILO\t

我不想在我的数据中使用“\ t”。这是我的pandas read命令：

import pandas as pd
data = pd.read_ex('/home/Desktop/sample.xlsx')

Answer 1

您的数据中似乎有尾随标签。

因此需要strip删除它：

data['col'] = data['col'].str.strip()

如果所有列：

data = data.apply(lambda x: x.str.strip())

#then convert possible numeric columns
data['num_col'] = data['num_col'].astype(int)

如果需要删除\t字符串，请使用replace ^作为字符串的开头，使用$作为结尾：

data = data['col'].replace(['^\t', '\t$'], '', regex=True)