Python 3.6.4,抓取需要登录的网站

时间:2018-02-01 01:25:00

标签: python-3.x

登录地址:https://joffice.jeunesseglobal.com/login.asp。    需要输入两个数据:用户名和密码。    使用cookie访问:https://joffice.jeunesseglobal.com/members/back_office.asp    无法登录。

#-*-coding:utf8-*-
import  urllib
import  http.cookiejar

url = 'https://joffice.jeunesseglobal.com/members/back_office.asp'
login_url = "https://joffice.jeunesseglobal.com/login.asp"
login_username = "jianghong181818"
login_password = "Js@168168!"

login_data = {
    "Username" : login_username,
    "pw" : login_password,
}

post_data = urllib.parse.urlencode(login_data).encode('utf-8')


headers = {'User-agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.113 Safari/537.36'}

req = urllib.request.Request(login_url, headers = headers, data = post_data)

cookie = http.cookiejar.CookieJar()

opener = urllib.request.build_opener(urllib.request.HTTPCookieProcessor(cookie))

resp = opener.open(req)
print(resp.read().decode('utf-8'))

1 个答案:

答案 0 :(得分:2)

使用请求

简单方法:

>>>import requests
>>>page = requests.get(" https://joffice.jeunesseglobal.com/login.asp", auth=
('username', 'password'))

使用HTTP Basic Auth发出请求

>>> from requests.auth import HTTPBasicAuth
>>> requests.get(" https://joffice.jeunesseglobal.com/login.asp", auth=HTTPBasicAuth('user', 'pass'))