我遇到过这样一个网页,在我看来有点误导 刮。当我去地址“https://jobboerse2.arbeitsagentur.de/jobsuche/?s=1”时,它会转到带有“suchen”选项的页面。单击“suchen”后,它会在此选项卡中打开一个新布局,并将我带到一个包含许多名称的页面。因此,网站地址再次相同“https://jobboerse2.arbeitsagentur.de/jobsuche/?s=1”。 我想刮掉那个页面的名字,如“MitarbeiterfürdieLeerguttrennung(m / w)”。任何帮助将受到高度赞赏。 到目前为止我写的:
Sub WebData()
Dim http As New MSXML2.xmlhttp60
Dim html As New htmldocument, source As Object, item As Object
With http
.Open "GET", "https://jobboerse2.arbeitsagentur.de/jobsuche/?s=1", False
.send
html.body.innerHTML = .responseText
End With
Set source = html.getElementsByClassName("ng-binding ng-scope")
For Each item In source
x = x + 1
Cells(x, 1) = item.innerText
Next item
Set html = Nothing: Set source = Nothing
End Sub
链接在开发人员工具中按照xhr增加,但无法确定最后一个链接的数量。
“https://jobboerse2.arbeitsagentur.de/jobsuche/pc/v1/jobs” “https://jobboerse2.arbeitsagentur.d...00&FCT.ANGEBOTSART=ARBEIT&FCT.BEHINDERUNG=AUS” “https://jobboerse2.arbeitsagentur.d...EBOTSART=ARBEIT&FCT.BEHINDERUNG=AUS&offset=12” “https://jobboerse2.arbeitsagentur.d...EBOTSART=ARBEIT&FCT.BEHINDERUNG=AUS&offset=24” “https://jobboerse2.arbeitsagentur.d...EBOTSART=ARBEIT&FCT.BEHINDERUNG=AUS&offset=36”