我正在尝试创建一个reddit scraper,它从reddit主页获取前100页并将它们存储到MongoDB中。这是我关于stackoverflow的第一篇文章,所以如果我的帖子格式不正确,我很抱歉。我一直收到错误:
TypeError: 'Collection' object is not callable. If you meant to call the 'insert_one' method on a 'Collection' object it is failing because no such method exists.
这是我的代码
import pymongo
import praw
import time
def main():
fpid = os.fork()
if fpid!=0:
# Running as daemon now. PID is fpid
sys.exit(0)
user_agent = ("Python Scraper by djames v0.1")
r = praw.Reddit(user_agent = user_agent) #Reddit API requires user agent
conn=pymongo.MongoClient()
db = conn.reddit
threads = db.threads
while 1==1: #Runs in an infinite loop, loop repeats every 30 seconds
frontpage_pull = r.get_front_page(limit=100) #get first 100 posts from reddit.com
for posts in frontpage_pull: #repeats for each of the 100 posts pulled
data = {}
data['title'] = posts.title
data['text'] = posts.selftext
threads.insert_one(data)
time.sleep(30)
if __name__ == "__main__":
main()
答案 0 :(得分:2)
insert_one()
才被添加到pymongo。如果您尝试在之前的版本上调用它,您将收到您看到的错误。
要检查你的pymongo版本,打开一个python解释器并输入:
import pymongo
pymongo.version
使用pymongo插入文档的传统方法只有Collection.insert()
。因此,在您的情况下,您可以将插入行更改为:
threads.insert(data)
有关详细信息,请参阅pymongo 2.8 documentation