Question

我想在使用Python的网站中找到所有音频文件，如.mp3，.wav，.ogg，.wma等。这是我的代码＆gt;＆gt;

    url = urllib.request.urlopen(link)
    content = url.read()
    soup = BeautifulSoup(content)
    links = [a['href'] for a in soup.find_all('a',href=re.compile('http.*\.mp3'))]
    print (str(len(links)) + " Audios Found ")
   # print (links)
    print("\n".join(links))

这将只查找.mp3链接。我也想要其他音频链接。

Answer 1

由于您使用正则表达式来选择链接，请更改此行

links = [a['href'] for a in soup.find_all('a',href=re.compile('http.*\.mp3'))]

到

links = [a['href'] for a in soup.find_all('a',href=re.compile('http.*\.(mp3|wav|ogg|wma)'))]

使用python查找页面中的所有音频链接

1 个答案: