我正在使用nirgs forked version of Tweepy,我需要使用它来获取2017-01-31
和2017-02-01
之间的推文。我的代码有效,由于Twitters速率限制,我必须在多个auth处理程序之间切换才能使用推文,这些推文的日期与之前提到的相同。我正在使用this instruction,但在达到速率限制并尝试使用api.auth_idx += 1
切换到下一个身份验证处理程序后,我收到以下错误:
File "build/bdist.macosx-10.11-intel/egg/tweepy/api.py", line 45, in auth_idx
IndexError: Index out of bounds
主要代码如下所示:
oauth_keys = [["consumer_key_1", "consumer_secret_1", "access_token_1", "access_token_secret_1"], ["consumer_key_2", "consumer_secret_2", "access_token_2", "access_token_secret_2"]]
auths = []
for consumer_key, consumer_secret, access_key, access_secret in oauth_keys:
auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_key, access_secret)
auths.append(auth)
api = tweepy.API(auths, monitor_rate_limit=True)
while True:
try:
for tweet in tweepy.Cursor(api.search, q='bitcoin', since=datetime.date(2017, 1, 31), lang='en').items(3000):
...
except tweepy.TweepError as e:
api.auth_idx += 1
continue
我不知道我做错了什么。任何帮助都很受欢迎!
答案 0 :(得分:2)
这是解决方案。
错误是索引对象本身。我必须创建一个变量作为索引:
switch += 1
api.auth_idx = switch
以下是完整的代码,以防任何人试图构建相同的应用程序:
oauth_keys = [
["consumer_key_1", "consumer_secret_1", "access_token_1", "access_token_secret_1"],
["consumer_key_2", "consumer_secret_2", "access_token_2", "access_token_secret_2"]
]
auths = []
for consumer_key, consumer_secret, access_key, access_secret in oauth_keys:
auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_key, access_secret)
auths.append(auth)
# api
api = tweepy.API(auths)
currentCursor = tweepy.Cursor(api.search, q = 'bitcoin', since='2017-02-01', until='2017-02-02', lang='en')
c = currentCursor.items()
switch = 0
api.auth_idx = switch
tweets = []
early_dt_val = datetime.datetime.strptime('2017-01-31 22:34:48', '%Y-%m-%d %H:%M:%S')
later_dt_val = datetime.datetime.strptime('2017-02-01 01:25:30', '%Y-%m-%d %H:%M:%S')
maxId = 0
while True:
try:
#for tweet in tweepy.Cursor(api.search, q='bitcoin', since=datetime.date(2017, 1, 31), lang='en').items(3000)
for tweet in c:
tweets.append(tweet)
maxId = tweet.id
for tweet in reversed(tweets):
tweetTime = tweet.created_at
if tweetTime < later_dt_val and tweetTime > early_dt_val:
print('Date: ' + str(tweet.created_at))
print('Tweet: ' + tweet.text)
except tweepy.TweepError as e:
print e
print "---------------------------Rate limit exceeded.---------------------------"
print "switching keys..."
switch += 1
if switch > 4:
print "Limit reached"
break
else:
api.auth_idx = switch # naechster Key
currentCursor = tweepy.Cursor(api.search, q='bitcoin', since='2017-02-01', until='2017-02-02', lang='en', max_id=maxId)