Question

我试图解析Beautiful Soup中的以下链接，我不确定这样做的最佳方法是什么。任何建议都将不胜感激。

由于

Answer 1

如果有人对此感兴趣，我想出了如何做到这一点：

 from bs4 import BeautifulSoup
xml = requests.get("http://www.realclearpolitics.com/epolls/2010/governor/2010_elections_governor_map.html").text
def find_governor_races(html):
    soup = BeautifulSoup(html, 'html.parser')
    pattern = "http://www.realclearpolitics.com/epolls/????/governor/??/*-*.html"

    links = []

    for option in soup.find_all('option'):
        links.append(option['value'])

    matched_links = []

    for link in links:
        if fnmatch(link, pattern):
            matched_links.append(link)

    return matched_links

获取Beautiful Soup中的链接

1 个答案: