An Elementary HTML Parser 
    http://www.codeproject.com/csharp/htmlparser.asp

    WebWagon – an HTML Container Class 
    http://www.vsj.co.uk/dotnet/display.asp?id=389

    Regex 
    http://www.regexlib.com/Search.aspx

    Parsing html markup text using MSHTML 
    http://www.eggheadcafe.com/articles/parsinghtml.asp

    Weather Screen Scraping with C# 
    http://www.csharphelp.com/archives2/archive435.html

    Searcharoo Too : Populating the Search Catalog with a C# Spider 
    http://www.users.bigpond.com/conceptdevelopment/Search/SearcharooV2/

    SearchEngine-A .NET crawler, indexer and search engine software from Active Data Online 
    http://www.activedataonline.com.au/searchengine.html

    .NET Html Agility Pack: How to use malformed HTML just like it was well-formed XML... 
    http://blogs.msdn.com/smourier/archive/2003/06/04/8265.aspx

    Identifying, Structuring and Searching HTML Objects 
    http://www.nada.kth.se/utbildning/grukth/exjobb/rapportlistor/2005/rapporter05/embreus_ivar_05043.pdf

    Working with text files in Microsoft .NET 
    http://www.automatedqa.com/techpapers/textfiles_net.asp

    Parsing HTML in Microsoft C# 
    http://www.developer.com/net/csharp/article.php/10918_2230091_2

    C#/VB - Automated WebSpider / WebRobot 
    http://www.codeproject.com/csharp/DavWebSpider.asp

相关文章: