Python Selenium正在将URL重定向到注册页面

时间:2016-09-09 18:45:36

标签: python session selenium

我有以下代码:

> from selenium import webdriver
> browser = webdriver.Chrome(executable_path =
 r"C:\Users\ABC\AppData\Local\Programs\Python\Python35-32\Lib\site-packages\selenium\webdriver\common\chromedriver.exe")
> browser.get('http://www.linkedin.com/pub/dir/?first=jatin&last=wadhwa&trk=prof-samename-search-submit')
> print (browser.page_source)

发生了什么事我想打开尽管 - >  http://www.linkedin.com/pub/dir/?first=jatin&last=wadhwa&trk=prof-samename-search-submit

它正在 - >

https://www.linkedin.com/start/join?session_redirect=http%3A%2F%2Fwww.linkedin.com%2Fpub%2Fdir%2F%3Ffirst%3Djatin%26last%3Dwadhwa%26trk%3Dprof-samename-search-submit&source=sentinel_org_block&trk=login_reg_redirect

任何解决方案,以便打开所需的链接而不是重定向的链接?

2 个答案:

答案 0 :(得分:0)

Linkedin不会在没有登录的情况下搜索页面。首先登录linkedin,然后你可以刮取数据。

browser.get('https://www.linkedin.com/')
elem = browser.find_element_by_name('session_key')
elem.clear()
elem.send_keys(email_id) # enter your email id or phone number

elem = browser.find_element_by_name('session_password')
elem.clear()
elem.send_keys(password) # enter your linkedin password

submit = browser.find_element_by_xpath('//*[@id="pagekey-uno-reg-guest-home"]/div[1]/div/form/input[6]')
actions = ActionChains(browser)
actions.click(submit)
actions.perform() # after this you will be login

# Now you can open url without redirecting

browser.get(`'http://www.linkedin.com/pub/dir/?first=jatin&last=wadhwa&trk=prof-samename-search-submit')`

答案 1 :(得分:0)

如果您爬得太快,Linkedin会将您重定向到注册页面。

我建议你在每个http请求之间等待一段时间。

删除你的cookies也不会受到伤害。