Question

我试图从这个网站收集数据：http://services.runescape.com/m=hiscore_oldschool/index_lite.ws?player=f2pshrympy

using HtmlAgilityPack;
using System;

var webGet = new HtmlWeb();
var document = webGet.Load("http://services.runescape.com/m=hiscore_oldschool/index_lite.ws?player=f2pshrympy");
var bodyText = document.DocumentNode.SelectNodes("/html/body/text()");
Console.WriteLine(bodyText);
Console.ReadLine();

程序运行时，没有任何内容打印到控制台，也没有错误。

screenshot of the console

我猜测XPath＆＃34; / html / body / text（）＆＃34;没有找到任何内容，有什么想法可以解决这个问题吗？

Answer 1

您的网页是纯文字。因此，您不需要像 HtmlAgilityPack 这样的工具来解析它。只需下载并使用它即可。

using (var wc = new WebClient())
{
    var bodyText = wc.DownloadString("http://services.runescape.com/m=hiscore_oldschool/index_lite.ws?player=f2pshrympy");

}

从网站C＃的体内抓取InnerText

1 个答案: