r - 使用rvest忽略div类 - Thinbug

使用rvest忽略div类

时间：2016-01-23 23:57:02

标签： r xpath web-scraping rvest

我正在尝试以下方法来提取页面的一部分，同时忽略其中一个嵌套字段＆＃34; rating＆＃34;。以下语法在提取我需要的内容方面都不成功。有什么建议吗？

raw <- read_html(res[[1]])  %>%
  html_nodes(xpath = '(//div[@class="results" and not(div[contains(@class, "span4 mobile-span1 ratings")])]') 

raw <- read_html(res[[1]])  %>%
  html_nodes(xpath = '(//div[not(contains(@class, "span4 .mobile-span1 .ratings"))]//div[@class="results"])')

0 个答案:

没有答案