Pandas散列表给出了关键错误:0

时间:2017-01-06 09:36:05

标签: python pandas twitter bigdata

我正在尝试获取两个pandas数据表的相同元素,索引数据并合并它。我用它来获取大量数据(数百万)。 frist表(df)是constatn,第二个(d2)在每个循环中都在变化,新元素将与第一个表合并。

这是我的代码:

df = pd.read_csv("inputfile.csv",header=None)
d1 = pd.DataFrame(df).set_index(0) 

for i in range(0, len(df)):
    try:
            follower_id=twitter.get_followers_ids(user_id=df.iloc[i][0],cursor=next_cursor) 


            f=follower_id['ids']
            json.dumps(f)
            d2 = pd.DataFrame(f).set_index(0) 
            match_result = pd.merge(d1,d2,left_index=True,right_index=True) 
            fk=[df.iloc[i][0] for number in range(len(match_result))] 
            DF = pd.DataFrame(fk)

            DF.to_csv(r'output1.csv',header=None,sep=' ',index=None) 
            match_result.to_csv(r'output2.csv', header=None,  sep=' ')

我有经验,这段代码运行良好一段时间,但在那之后 - 可能它与第二个数据库大小相关,它改变每个循环 - 程序给我以下错误信息,并停止运行:< / p>

Traceback (most recent call last):
 File "halozat3.py", line 39, in <module>
d2 = pd.DataFrame(f).set_index(0) #1Trump koveto kovetolistaja
File "/usr/lib/python2.7/dist-packages/pandas/core/frame.py", line 2372, in set_index
level = frame[col].values
File "/usr/lib/python2.7/dist-packages/pandas/core/frame.py", line 1678, in __getitem__
return self._getitem_column(key)
File "/usr/lib/python2.7/dist-packages/pandas/core/frame.py", line 1685, in _getitem_column
return self._get_item_cache(key)
File "/usr/lib/python2.7/dist-packages/pandas/core/generic.py", line 1052, in _get_item_cache
values = self._data.get(item)
File "/usr/lib/python2.7/dist-packages/pandas/core/internals.py", line 2565, in get
loc = self.items.get_loc(item)
File "/usr/lib/python2.7/dist-packages/pandas/core/index.py", line 1181, in get_loc
return self._engine.get_loc(_values_from_object(key))
File "index.pyx", line 129, in pandas.index.IndexEngine.get_loc (pandas/index.c:3656)
File "index.pyx", line 149, in pandas.index.IndexEngine.get_loc (pandas/index.c:3534)
File "hashtable.pyx", line 381, in pandas.hashtable.Int64HashTable.get_item (pandas/hashtable.c:7035)
File "hashtable.pyx", line 387, in          pandas.hashtable.Int64HashTable.get_item (pandas/hashtable.c:6976)
KeyError: 0

可能是什么问题?

1 个答案:

答案 0 :(得分:0)

您的数据框中只有一行吗?

您必须根据需要编写任意数量的行 Look