为什么熊猫在转换后会在csv文件的开头添加数字

时间:2019-06-09 14:25:57

标签: python pandas

我是Python初学者,目前正在尝试删除csv中的某些列-效果很好!但是:当我使用熊猫时,它将自动在文件开头添加另一列并添加行号。我该如何避免呢?

输入数据如下(json):

    [
      {
        "source": "twitter",
        "cashtag": "$FB",
        "sentiment score": "0.366",
        "id": "719659409228451840",
        "spans": [
          "watching for bounce tomorrow"
    ]
  }, ... ]

转换为csv效果很好。

我这样做的代码:

import pandas as pd

# Convert son to csv
pd.read_json("test.json").to_csv("test.csv")

# Delete cashtag, id, source column
data = pd.read_csv("test.csv")
data = data.drop(["cashtag", "id", "source"], axis=1)
data.to_csv("test_cleaned.csv")
data.head()

输出:

Unnamed: 0  sentiment score spans
0   0   0.366   ['watching for bounce tomorrow']
1   1   0.638   ['record number of passengers served in 2015']
2   2   -0.494  ['out $NFLX -.35']
3   3   0.460   ['Looking for a strong bounce', 'Lunchtime 
4   4   0.403   ['Very intrigued with the technology and 

我想要拥有的东西:

sentiment score spans
    0.366   ['watching for bounce tomorrow']
    0.638   ['record number of passengers served in 2015']
    -0.494  ['out $NFLX -.35']
    0.460   ['Looking for a strong bounce', 'Lunchtime 
    0.403   ['Very intrigued with the technology and 

因此,转换和删除效果很好,但是对于使用熊猫的每个操作,它将在文件的开头添加另一列。在此示例中,转换为csv后的1列和删除列后的1列。如何避免这种情况?

2 个答案:

答案 0 :(得分:0)

这就是索引,您可以通过使用以下代码来防止被写入

df.to_csv(‘FileMaker.csv’, index=False)

答案 1 :(得分:0)

您要引用的列是索引。保存csv时尝试这样做:

data.to_csv("test_cleaned.csv", index=False)

除非明确执行,否则Pandas会为您初始化的每个数据框自动创建一个索引。我强烈建议您阅读panda's documentation以获取更多信息。