Question

我正在尝试从网页上截取实时流数据表和数据我试过了：

library(XML)
webpage  <- "http://www.investing.com/indices/us-30"

tables <- readHTMLTable(webpage )
n.rows <- unlist(lapply(tables, function(t) dim(t)[1]))

tables
n.rows

但是我收到了错误。

感谢您的帮助。

Answer 1

我为自己的工作感到骄傲（我一直试图让这些东西长时间处理......）

library(rvest)
url4 <- "http://www.investing.com/indices/us-30-historical-data"

yourdata <- url4 %>% read_html() %>% 
html_nodes(xpath = '//*[@id="results_box"]/table[1]') %>% 
html_table()   

yourdata <- as.data.frame(yourdata)

webscraping来自网页的数据表和数据

1 个答案: