
Abstract
Project Plan
Ideas
web-communities
live update
HTML parser
adaptable GUI
distributed SE
Research
Download
Source Code
Misc
Contact
Home
|
iSeeker: Project Ideas
[HTML parser]
After completing the first prototype, where the application just gets an HTML page from the internet, I had to come up with a way of extracting the information that I need from that HTML.
One of the main goals of the application is to find keywords in the page and in which situation (eg. within a meta tag, header, link, 'ALT' for a picture, etc...)
Therefore I need to build an HTML Parser class which the application calls every time it gets a new page. The class accepts raw HTML in the form of a CString and returns an array with each bit of text marked with whatever tags it was within.
The application will then use this array to calculate a score for the page (perhaps making use of another class: evalHTML).
The HTML Parser will be structured in the following way:
|
|