我无法编辑我的代码,因此您可以删除此帖子
答案 0 :(得分:1)
看看这个lib http:HTML Agility Pack 它可以帮助您解析HTML。
答案 1 :(得分:1)
HtmlAgilityPack.HtmlDocument doc = new HtmlAgilityPack.HtmlDocument();
doc.LoadHtml(html);
var teams = doc.DocumentNode.SelectNodes("//td[@width='313']")
.Select(td => new TeamClass
{
TeamName = td.Element("a").InnerText,
TeamId = HttpUtility.ParseQueryString(td.Element("a").Attributes["href"].Value)["ItemTypeID"]
})
.ToList();
答案 2 :(得分:0)
您可以使用Regular expression
String html; //your html string
String pattern = @"action=ViewItemDetails&ItemType[I|i]D=(\d*)"">(.*)</a>";
MatchCollection matches = Regex.Matches(html, pattern);
var list = new List<TeamClass>();
foreach (Match match in matches)
{
TeamClass team = new TeamClass();
team.TeamName = match.Groups[2].Value;
team.TeamId = Int32.Parse(match.Groups[1].Value);
list.Add(team);
}
答案 3 :(得分:0)
尝试Html Agility:
尝试类似(未经测试的代码):var TeamList = from lnks in document.DocumentNode.Descendants()
where lnks.Name == "a" &&
lnks.Attributes["href"] != null &&
lnks.InnerText.Trim().Length > 0
select new
{
TeamId= (lnks.Attributes["href"].Value).
Substring((lnks.Attributes["href"].Value).Length-1, 1),
TeamName= lnks.InnerText
};