在通过POST请求发送服务器数据时,我无法弄清楚如何模仿浏览器的功能。以下是相关网址,下面有解释。
(1) http://kenpom.com/
(2) http://kenpom.com/register.php?frompage=1
<form id="login" method="POST" action="handlers/login_handler.php">
<label>E-mail </label><input type="text" name="email" />
<label>Password </label><input type="password" name="password" />
<input type="submit" name="submit" value="Login!" />
(3) http://kenpom.com/team.php?team=Rice
(1)主页(未登录时选择团队页面,重定向 - &gt;(2))
(2)登录页面(成功登录后重定向到团队特定页面)
(3)团队特定页面:例如稻
url <- ("http://kenpom.com/team.php?team=Rice")
login <- list(
email = "login",
password = "password"
)
teampage <- POST(url, body = login)
Response [http://kenpom.com/register.php?frompage=1]
Date: 2015-03-07 23:04
Status: 200
Content-Type: text/html
Size: 7.45 kB
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<HTML>
<HEAD>
<LINK REL=stylesheet TYPE="text/css" HREF="css/rate.css?1414365416">
<TITLE>kenpom.com subscription</TITLE>
最终想要使用rvest软件包抓取一些信息,但最终结果是空的,因为它试图刮掉:http://kenpom.com/register.php?frompage=1
答案 0 :(得分:17)
尝试
library(httr)
login <- list(
email = "login",
password = "password"
submit = "Login!"
)
res <- POST("http://kenpom.com/handlers/login_handler.php", body = login, encode = "form", verbose())
team <- GET("http://kenpom.com/team.php?team=Rice", verbose())