C# 正则表达式抓取

来源:百度知道 编辑:UC知道 时间:2024/06/15 03:14:47
<td><a href="pages/cicads.html" onmouseover="MM_swapImage('img2','','images/red_arrow_new.jpg',1)" onmouseout="MM_swapImgRestore()" class="mainLinks">Concise International Chemical Assessment Documents (CICADs)</a></td>
我要把pages/cicads.html和后面的mainLinks">Concise International Chemical Assessment Documents (CICADs)抓取出来

是用JS还是C#?

一:(?:\<a)(?:.*?)(?:\>)(?<url>.*?)(?=\</a\>)" ;
二:(?:\<a.*?href\s*=)(?:[\s""']*)(?!#|mailto|location.|javascript|.*css|.*this\.)(?<url>.*?)(?:[\s>""'])(?:.*?)(?=\</a\>)

string str = "<td><a href=\"pages/cicads.html\" onmouseover=\"MM_swapImage('img2','','images/red_arrow_new.jpg',1)\" onmouseout=\"MM_swapImgRestore()\" class=\"mainLinks\">Concise International Chemical Assessment Documents (CICADs)</a></td>";

string pattern = "<a\\s+href\\s*=\\s*\"(.+?)\".+?class\\s*=\\s*\"(.+?)</a>";
System.Text.RegularExpressions.Regex reg = new System.Text.RegularExpressions.Regex(pattern, System.Text