我想从youtube视频中获取一些标签,例如标题,观看次数等。我使用BeautifulSoup,但我想让它更快。这是我的代码:
#for the title
from BeautifulSoup import BeautifulSoup
html = re.findall('content=.*>\n\n',urllib2.urlopen(link).read())
soup = BeautifulSoup(html)
print soup.prettify()
#for the number of views
soup0 = BeautifulSoup(urllib2.urlopen(link).read())
for items in soup0.findAll('strong'):
if re.match("^[0-9]*$", str(items).strip("<strong>").rstrip("</strong>")):
viewcount=str(strongs).strip("<strong>").rstrip("</strong>")
答案 0 :(得分:7)
他们的部分示例:
def PrintEntryDetails(entry):
print 'Video title: %s' % entry.media.title.text
print 'Video published on: %s ' % entry.published.text
print 'Video description: %s' % entry.media.description.text
print 'Video category: %s' % entry.media.category[0].text
print 'Video tags: %s' % entry.media.keywords.text
print 'Video watch page: %s' % entry.media.player.url
print 'Video flash player URL: %s' % entry.GetSwfUrl()
print 'Video duration: %s' % entry.media.duration.seconds