iSeeker logo

Abstract

Project Plan

Ideas

  • web-communities
  • live update
  • HTML parser
  • adaptable GUI
  • distributed SE

    Research

    Download

    Source Code

    Misc

    Contact


    Home

  • iSeeker: Project Ideas
    [HTML parser]

    After completing the first prototype, where the application just gets an HTML page from the internet, I had to come up with a way of extracting the information that I need from that HTML.
    One of the main goals of the application is to find keywords in the page and in which situation (eg. within a meta tag, header, link, 'ALT' for a picture, etc...)
    Therefore I need to build an HTML Parser class which the application calls every time it gets a new page. The class accepts raw HTML in the form of a CString and returns an array with each bit of text marked with whatever tags it was within.
    The application will then use this array to calculate a score for the page (perhaps making use of another class: evalHTML).

    The HTML Parser will be structured in the following way: