Twitter流式格式化JSON输出

时间:2016-07-10 19:46:44

标签: python json python-3.x twitter

也许你可以帮助我。以下python代码检索Twitter Streaming数据,并在获得1000个tweet数据时停止。它可以工作,但返回字段" created_at,screen_name和text"由制表符分隔。相反,我想以JSON格式获取数据。如何设置代码以获取以JSON格式化的数据?

# Import the necessary package to process data in JSON format
try:
    import json
except ImportError:
    import simplejson as json

# Import the necessary methods from "twitter" library
from twitter import Twitter, OAuth, TwitterHTTPError, TwitterStream

# Variables that contains the user credentials to access Twitter API


CONSUMER_KEY = '7pWHWtYlXM9ayJfUKv2F8v84B'
CONSUMER_SECRET = 'Dfcx10Px77Ggn0qGbCHc4TZC7M2IHsXpqk9CaGiCLzcr9VMX5n'
ACCESS_TOKEN = '245080367-zuLrIbxblOnocashgku9dsmDKgy3R7uU0VCTIRDx'
ACCESS_SECRET = 'wCx5ufD9Zft46hVjieLdv0af7p9DxUTsPgge9Zm2qelR9'

oauth = OAuth(ACCESS_TOKEN, ACCESS_SECRET, CONSUMER_KEY, CONSUMER_SECRET)

# Initiate the connection to Twitter Streaming API
twitter_stream = TwitterStream(auth=oauth)

# Get a sample of the public data following through Twitter
#iterator = twitter_stream.statuses.sample()

iterator = twitter_stream.statuses.filter(track="Euro2016", language="fr") 

tweet_count = 1000 
for tweet in iterator:
    tweet_count -= 1


    print (tweet['created_at'],"\t",tweet['user']['screen_name'],"\t",tweet['geo'], "\t",tweet['text'])


    if tweet_count <= 0:
        break

1 个答案:

答案 0 :(得分:1)

您可以导入tweepy(您需要先使用pip安装它)并覆盖listener class以便能够以json格式输出数据。这是一个例子:

from tweepy import Stream
from tweepy.streaming import StreamListener

#Listener Class Override
class listener(StreamListener):
    def on_data(self, data):
        try: 
            tweet = json.loads(data)
            with open('your_data.json', 'a') as my_file:
                json.dump(tweet, my_file)
        except BaseException:
            print('Error')
            pass

    def on_error(self, status):
        print(statuses)

my_listener=listener()
twitterStream = Stream(oauth, my_listener)  #Inizialize Stream object

您可以在此处详细了解tweepyhttp://docs.tweepy.org/en/v3.4.0/streaming_how_to.html