使用R的季度Yahoo财务数据

时间:2020-08-07 14:57:47

标签: r rvest

我正在尝试对Yahoo财务数据进行网络抓取。我找到了一种适用于某些数据的解决方案...但是我不知道如何实现季度数据的飞跃。我想知道我是否走错了道路。这是一个对我有用的解决方案,但我不知道如何实现季度数据而非年度数据的飞跃: R: web scraping yahoo.finance after 2019 change

1 个答案:

答案 0 :(得分:0)

抓取该页面的问题之一是它默认为“年度数据”。用户单击“季度”按钮后,季度数据将加载到浏览器中。虽然这对抓取不利,但对拦截API请求却有利。如果在浏览器中打开开发人员的控制台,请转到“网络”选项卡,然后选择“季度”按钮,您将看到一个请求(由于URL的确很长,所以将其放在底部)。该请求将返回JSON数据。

免责声明:我对R并不了解很多。但是,在进行了一些研究之后,我发现R有几个允许您读取JSON数据的程序包,您可以执行以下操作:< / p>

# using rjson
url = "<get from down below>"
data = rjson::readJSON(file=url)

# using jsonlite
library(jsonlite)

url = "<get from down below>"
data <- readJSON(url)

以下是网址:

https://query1.finance.yahoo.com/ws/fundamentals-timeseries/v1/finance/premium/timeseries/AAPL?lang=en-US&region=US&symbol=AAPL&padTimeSeries=true&type=annualEbitda%2CtrailingEbitda%2CannualDilutedAverageShares%2CtrailingDilutedAverageShares%2CannualBasicAverageShares%2CtrailingBasicAverageShares%2CannualDilutedEPS%2CtrailingDilutedEPS%2CannualBasicEPS%2CtrailingBasicEPS%2CannualNetIncomeCommonStockholders%2CtrailingNetIncomeCommonStockholders%2CannualNetIncome%2CtrailingNetIncome%2CannualNetIncomeContinuousOperations%2CtrailingNetIncomeContinuousOperations%2CannualTaxProvision%2CtrailingTaxProvision%2CannualPretaxIncome%2CtrailingPretaxIncome%2CannualOtherIncomeExpense%2CtrailingOtherIncomeExpense%2CannualInterestExpense%2CtrailingInterestExpense%2CannualOperatingIncome%2CtrailingOperatingIncome%2CannualOperatingExpense%2CtrailingOperatingExpense%2CannualSellingGeneralAndAdministration%2CtrailingSellingGeneralAndAdministration%2CannualResearchAndDevelopment%2CtrailingResearchAndDevelopment%2CannualGrossProfit%2CtrailingGrossProfit%2CannualCostOfRevenue%2CtrailingCostOfRevenue%2CannualTotalRevenue%2CtrailingTotalRevenue&merge=false&period1=493590046&period2=1596836602&corsDomain=finance.yahoo.com

您可以使用另一个URL来获取季度损益表数据,但在美国以外的公司使用时似乎有些不规律:

https://query2.finance.yahoo.com/v10/finance/quoteSummary/aapl?modules=incomeStatementHistoryQuarterly

相关问题