Question

我在文本文件中有一个网站列表，我希望收集主页的元数据。我怎么能这样做？

例如，我要检索的网站“”元数据是

meta name="keywords" content="**Chamber of Mines, South Africa, Mining**

Answer 1

这样的事情：

$web = New-Object Net.WebClient
$a = $web.DownloadString("Http://www.bullion.org.za")
$a | Select-String -Pattern '(meta name="keywords".+")' -allmatches |
   select -expa matches | select -expa value

这回归：

meta name="keywords" content="Chamber of Mines, South Africa, Mining"

powershell：从网站列表中提取元数据

1 个答案: