Html Agility pack额外<a> tag</a>

时间:2013-02-02 15:14:55

标签: html-agility-pack

以下额外<A>会导致selectnode()返回太多元素。如何删除多余的字符?

<DIV align=center><STRONG><A><A class=white 
      href="javascript: event_info = openWin('/events/search/index_results.cfm?action=plan&amp;event_number=2013292001&amp;cde_comp_group=CONF&amp;cde_comp_type=&amp;NEW_END_DATE1>=&amp;key_stkhldr_event=&amp;mixed_breed=N', 'eventinfo', 'width=800,height=600,toolbar=1,location=0>,directories=0,status=0,menuBar=0,scrollBars=1,resizable=1' ); event_info.focus()"><STRONG>Labrador 
      Retriever Club of the Piedmont</STRONG></A> </STRONG></DIV

&GT;

1 个答案:

答案 0 :(得分:1)

您只能选择那些<a>个标签,例如href属性集:

var doc = new HtmlDocument();
doc.LoadHtml(html);
var anchors = doc.DocumentNode
    .SelectNodes("//a[@href]")
    .ToList();

foreach (var anchor in anchors)
{
    //process your node here
}