我是Python初学者,目前正在尝试删除csv中的某些列-效果很好!但是:当我使用熊猫时,它将自动在文件开头添加另一列并添加行号。我该如何避免呢?
输入数据如下(json):
[
{
"source": "twitter",
"cashtag": "$FB",
"sentiment score": "0.366",
"id": "719659409228451840",
"spans": [
"watching for bounce tomorrow"
]
}, ... ]
转换为csv效果很好。
我这样做的代码:
import pandas as pd
# Convert son to csv
pd.read_json("test.json").to_csv("test.csv")
# Delete cashtag, id, source column
data = pd.read_csv("test.csv")
data = data.drop(["cashtag", "id", "source"], axis=1)
data.to_csv("test_cleaned.csv")
data.head()
输出:
Unnamed: 0 sentiment score spans
0 0 0.366 ['watching for bounce tomorrow']
1 1 0.638 ['record number of passengers served in 2015']
2 2 -0.494 ['out $NFLX -.35']
3 3 0.460 ['Looking for a strong bounce', 'Lunchtime
4 4 0.403 ['Very intrigued with the technology and
我想要拥有的东西:
sentiment score spans
0.366 ['watching for bounce tomorrow']
0.638 ['record number of passengers served in 2015']
-0.494 ['out $NFLX -.35']
0.460 ['Looking for a strong bounce', 'Lunchtime
0.403 ['Very intrigued with the technology and
因此,转换和删除效果很好,但是对于使用熊猫的每个操作,它将在文件的开头添加另一列。在此示例中,转换为csv后的1列和删除列后的1列。如何避免这种情况?
答案 0 :(得分:0)
这就是索引,您可以通过使用以下代码来防止被写入
df.to_csv(‘FileMaker.csv’, index=False)
答案 1 :(得分:0)
您要引用的列是索引。保存csv时尝试这样做:
data.to_csv("test_cleaned.csv", index=False)
除非明确执行,否则Pandas会为您初始化的每个数据框自动创建一个索引。我强烈建议您阅读panda's documentation以获取更多信息。