在我的UWP应用程序中,我需要通过HtmlAgilityPack获取youtube页面中第一个视频的标题和链接,但结果如下,并且title和href的值不显示。
id="video-title"
class="yt-simple-endpoint style-scope ytd-video-renderer"
aria-label$="[[data.title.accessibility.accessibilityData.label]]"
href$="[[computeHref_(data.navigationEndpoint)]]"
title$="[[getSimpleString(data.title)]]"
data="[[data.navigationEndpoint]]">
这是代码
var html = @"https://www.youtube.com/results?search_query=cado+dalle+nubi+trailer+ita";
HtmlWeb web = new HtmlWeb();
HtmlDocument htmlDoc = web.Load(html);
var node = htmlDoc.DocumentNode.SelectSingleNode("//div/div/div/div/h3/a");
string result = node.OuterHtml;
答案 0 :(得分:2)
使用WebView
代替HtmlWeb
来获取HTML
WebView webView = new WebView();
webView.Navigate(new Uri(@"https://www.youtube.com/results?search_query=cado+dalle+nubi+trailer+ita"));
private async void WebView_NavigationCompletedAsync(WebView sender, WebViewNavigationCompletedEventArgs args)
{
var siteHtML = await webView.InvokeScriptAsync("eval", new string[] { "document.documentElement.innerHTML;" });
}
第一个视频是在第二个索引中。因此,请使用SelectNodes
并选择第二个索引
HtmlDocument htmlDoc = web.Load(siteHtML );
string result = htmlDoc.DocumentNode.SelectNodes("//div/div/div/div/h3/a")[1].OuterHtml;
以下是最终代码
WebView webView = new WebView();
public void GetFirstVideo(string UrlString)
{
webView.Navigate(new Uri(UrlString));
webView.NavigationCompleted -= WebView_NavigationCompletedAsync; //To avoid multiple subscribe
webView.NavigationCompleted += WebView_NavigationCompletedAsync;
}
private async void WebView_NavigationCompletedAsync(WebView sender, WebViewNavigationCompletedEventArgs args)
{
webView.NavigationCompleted -= WebView_NavigationCompletedAsync; //To stop if there is any re-direct
var siteHtML = await webView.InvokeScriptAsync("eval", new string[] { "document.documentElement.innerHTML;" });
var htmlDocument = new HtmlDocument();
htmlDocument.LoadHtml(siteHtML);
var firstVideoTitle = htmlDocument.DocumentNode.SelectNodes("//div/div/div/div/h3/a")[1].OuterHtml;
}