如何从字符串列表中提取所需的数据?

时间:2019-06-17 11:04:42

标签: python web-scraping

我正在用Python编写刮板。 我想提取一些数据,例如公司名称,工作职位和公司位置。

我在Python的字符串理解的帮助下成功提取了数据并制作了一个数组。 现在,我想根据上述要求将数据标记化。 我想写一个刮板应该只刮擦自上次运行以来网站上的新信息。以下是应抓取的字段:

  • 正在招聘的公司名称
  • 公司所在地
  • 广告所针对的位置

这是链接https://news.ycombinator.com/jobs

我抓取的所有数据的列表:

 [
'Pachyderm Raised $10M and Is Looking for a Senior JavaScript Engineer',
'Jerry (YC S17) Is Hiring Software Developers in Toronto', 
'Streak (YC S11) Is Hiring in Vancouver', 
'Iris Automation Is Hiring a Computer Vision Expert – AI for Drones',
'Taplytics Is Hiring Full-Stack Engineers in Toronto', 
'Tesorio (YC S15) Is Hiring Engineering Managers, Senior Engineers',
'Pivot (YC W15) is hiring engineers – live home fitness using computer vision', 
'Shipamax (YC W17) is hiring a software engineer with ML experience (London)', 
'Flexport is hiring software engineers for our humanitarian relief logistics team', 
'Find your next job at the YC Work at a Startup Expo June 29', 
'Mux is hiring for video at scale: product, engineering, and developer advocates', 
'Human Interest (YC S15) Is Hiring a Senior Product Manager', 
'The Muse (YC W12) Is Hiring a Senior Marketing Analyst', 
'Lawn Love (YC S14) is hiring a Product Manager – Help change a $90B industry', 
'ReadMe (YC W15) is hiring – help make APIs easy to use',
'Scale AI is hiring engineers to accelerate the development of AI',
'Ribbon (YC S17) is hiring engineers – help simplify healthcare using data', 
'BuildZoom (YC W13) is hiring – help make remodeling cheaper', 
'Impraise (YC S14) Is Hiring an Sales Account Executive in NYC', 
'Hypcloud (YC S18) is hiring a senior front-end engineer in Berlin and/or remote', 
'ZeroCater (YC W11) Is Hiring a Principal Engineer in SF: Foodies Welcome', 
'CoinTracker (YC W18) is hiring – a profitable crypto company without an ICO', 
'SmartAsset (YC S12) Is Hiring SEM Manager', 
'Circle Medical (YC S15) Is Hiring Phlebotomists in San Francisco',
'Atomwise (YC W15) Is Hiring a Senior Machine Learning Research Scientist in SF', 
'Etleap (YC W13) Is Hiring a Marketing Manager in SF', 
'Demodesk (YC W19) Is Hiring Software Engineers in Munich, Germany',
'Newfront (YC W18) is hiring engineers to help transform the insurance industry', 
'OneSignal Is Hiring Full Stack Engineers in San Mateo', 
'Aptible (YC S14) Is Hiring a Senior Software Engineer – Remote/Onsite (NYC, SF)'
]

0 个答案:

没有答案