我正在尝试在retweets
搜索中排除replies
和Twython
。
这是我的代码:
from twython import Twython, TwythonError
app_key = "xxxx"
app_secret = "xxxx"
oauth_token = "xxxx"
oauth_token_secret = "xxxx"
naughty_words = [" -RT"]
good_words = ["search phrase", "another search phrase"]
filter = " OR ".join(good_words)
blacklist = " -".join(naughty_words)
keywords = filter + blacklist
twitter = Twython(app_key, app_secret, oauth_token, oauth_token_secret)
search_results = twitter.search(q=keywords, count=100)
问题是-RT
功能并没有真正起作用。
修改
我已经尝试了@forge建议,虽然它打印了如果推文不是转推或回复,当我将它们合并到下面的代码中时,机器人仍然会发现推文,转推,引用和回复。
twitter = Twython(app_key, app_secret, oauth_token, oauth_token_secret) query = 'beer OR wine AND -filter:retweets AND -filter:replies'
response = twitter.search(q=query, count=100)
statuses = response['statuses']
try:
for tweet in statuses:
try:
twitter.retweet(id = tweet["id_str"])
except TwythonError as e:
print e
except TwythonError as e:
print e
有什么想法吗?有filter:quotes
吗?
答案 0 :(得分:8)
正确的语法是-filter:retweets
。
如果您要搜索"search phrase"
或"another search phrase"
条款并排除转推,则query
应为:
query = "search_phrase OR another_search_phrase -filter:retweets"
要排除回复,请像这样添加-filter:replies
:
query = "search_phrase OR another_search_phrase -filter:retweets AND -filter:replies"
这应该有效,您可以通过查看状态字段in_reply_to_status_id
和retweeted_status
来验证它:
in_reply_to_status_id
为空,状态不是回复retweeted_status
使用Twython
:
import twython
twitter = twython.Twython(APP_KEY, APP_SECRET, OAUTH_TOKEN, OAUTH_TOKEN_SECRET)
query = 'wine OR beer -filter:retweets AND -filter:replies'
response = twitter.search(q=query, count=100)
statuses = response['statuses']
for status in statuses:
print status['in_reply_to_status_id'], status.has_key('retweeted_status')
# Output should be (None, False) to any status