使用tweepy

时间:2016-08-04 19:37:36

标签: python csv twitter tweepy

用于收集推文并将其发送到csv文件的Python代码

不断返回错误

tweepy.error.RateLimitError:[{'code':88,'message':'超出限价'}

尝试获取最新的时间轴并将所有这些推文发送到csv文件

感谢您的帮助

def get_all_tweets(screen_name):

    #authorize twitter, initialize tweepy
    auth = tweepy.OAuthHandler(consumer_token, consumer_secret)
    auth.set_access_token(access_token, access_secret)
    api = tweepy.API(auth)

    #initialize a list to hold all the tweepy Tweets
    alltweets = []    


    new_tweets = api.home_timeline (screen_name = screen_name,count=20)

    #save most recent tweets
    alltweets.extend(new_tweets)

    #save the id of the oldest tweet less one
    oldest = alltweets[-1].id - 1

    while len(new_tweets) > 0:
        print ("getting tweets before %s" % (oldest))

        #all subsiquent requests use the max_id param to prevent duplicates
            new_tweets = api.home_timeline(screen_name =      screen_name,count=20,max_id=oldest)

        #save most recent tweets
        alltweets.extend(new_tweets)

        #update the id of the oldest tweet less one
        oldest = alltweets[-1].id - 1

        print ("...%s tweets downloaded so far" % (len(alltweets)))

    #transform the tweepy tweets into a 2D array that will populate the csv    
    outtweets = [[tweet.id_str, tweet.created_at, tweet.text.encode("utf-8")] for tweet in alltweets]

    #write the csv    
    with open('%s_tweetsBQ.csv' % screen_name, 'w') as f:
        writer = csv.writer(f)
        writer.writerow(["id","created_at","text"])
        writer.writerows(outtweets)

    pass


    if __name__ == '__main__':
    #pass in the username of the account you want to download
    get_all_tweets("BQ")

2 个答案:

答案 0 :(得分:6)

您的代码没问题,您刚刚达到Twitter Streaming API限制。让你再次提取推文大约需要一个小时。

初始化tweetpy时应添加wait_on_rate_limit=True选项:

api = tweepy.API(auth, wait_on_rate_limit=True)

我强烈建议您查看:https://dev.twitter.com/rest/public/rate-limiting了解更多详情。

答案 1 :(得分:0)

您是否尝试在tweepy初始化时添加此选项,因此当达到速率限制时,它将等待而不是失败:

api = tweepy.API(auth, wait_on_rate_limit=True)