我有以下代码:
> from selenium import webdriver
> browser = webdriver.Chrome(executable_path =
r"C:\Users\ABC\AppData\Local\Programs\Python\Python35-32\Lib\site-packages\selenium\webdriver\common\chromedriver.exe")
> browser.get('http://www.linkedin.com/pub/dir/?first=jatin&last=wadhwa&trk=prof-samename-search-submit')
> print (browser.page_source)
发生了什么事我想打开尽管 - > http://www.linkedin.com/pub/dir/?first=jatin&last=wadhwa&trk=prof-samename-search-submit
它正在 - >
任何解决方案,以便打开所需的链接而不是重定向的链接?
答案 0 :(得分:0)
Linkedin不会在没有登录的情况下搜索页面。首先登录linkedin,然后你可以刮取数据。
browser.get('https://www.linkedin.com/')
elem = browser.find_element_by_name('session_key')
elem.clear()
elem.send_keys(email_id) # enter your email id or phone number
elem = browser.find_element_by_name('session_password')
elem.clear()
elem.send_keys(password) # enter your linkedin password
submit = browser.find_element_by_xpath('//*[@id="pagekey-uno-reg-guest-home"]/div[1]/div/form/input[6]')
actions = ActionChains(browser)
actions.click(submit)
actions.perform() # after this you will be login
# Now you can open url without redirecting
browser.get(`'http://www.linkedin.com/pub/dir/?first=jatin&last=wadhwa&trk=prof-samename-search-submit')`
答案 1 :(得分:0)
如果您爬得太快,Linkedin会将您重定向到注册页面。
我建议你在每个http请求之间等待一段时间。
删除你的cookies也不会受到伤害。