使用httr R包发送POST请求

时间:2015-03-08 09:09:17

标签: r http post login httr

在通过POST请求发送服务器数据时,我无法弄清楚如何模仿浏览器的功能。以下是相关网址,下面有解释。

(1) http://kenpom.com/

(2) http://kenpom.com/register.php?frompage=1
<form id="login" method="POST" action="handlers/login_handler.php">
<label>E-mail </label><input type="text" name="email" />
<label>Password </label><input type="password" name="password" />
<input type="submit" name="submit" value="Login!" />

(3) http://kenpom.com/team.php?team=Rice

(1)主页(未登录时选择团队页面,重定向 - &gt;(2))

(2)登录页面(成功登录后重定向到团队特定页面)

(3)团队特定页面:例如稻

url <- ("http://kenpom.com/team.php?team=Rice")

login <- list(
        email = "login",
        password = "password"
)

teampage <- POST(url, body = login)

Response [http://kenpom.com/register.php?frompage=1]
  Date: 2015-03-07 23:04
  Status: 200
  Content-Type: text/html
  Size: 7.45 kB
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<HTML>
<HEAD>
<LINK REL=stylesheet TYPE="text/css" HREF="css/rate.css?1414365416">
<TITLE>kenpom.com subscription</TITLE>

最终想要使用rvest软件包抓取一些信息,但最终结果是空的,因为它试图刮掉:http://kenpom.com/register.php?frompage=1

1 个答案:

答案 0 :(得分:17)

尝试

library(httr)
login <- list(
  email = "login",
  password = "password"
  submit = "Login!"
)
res <- POST("http://kenpom.com/handlers/login_handler.php", body = login, encode = "form", verbose())
team <- GET("http://kenpom.com/team.php?team=Rice", verbose())