Question

假设我在页面上看到图像的URL，

 for i in wiki.images:
    print (i)

是否有简单的方法来获取图像标题？

Answer 1

尝试：

如果您要遍历图像的所有网址，则可以尝试

for i in wiki.images:
    i.split('/')[-1]  # -1 because the name is at the last part of the url

因此上述代码将为您提供图片名称。

希望这对您有帮助...

Answer 2

如果您要获取的是图片标签的title属性（例如，来自HTML），则可以执行以下操作：

import wikipedia
from html.parser import HTMLParser

class WikipediaImageParser(HTMLParser):
    def handle_starttag(self, tag, attrs):
        if tag == 'img':
            try:
                print(dict(attrs)['title'])
            except KeyError as e:
                return # do nothing

page = wikipedia.page("History_of_Japan")
parser = WikipediaImageParser()
parser.feed(page.html())

您可以解析HTML以获得每个图像的属性的字典，然后只需检查是否有标题属性即可。

如何使用Wikipedia API获取图像标题

2 个答案: