|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
| Class Summary | |
|---|---|
| CrawlState | Class containing state information about a crawling process. |
| HTMLLinkInfo | Contains information about all links that were found in an HTML page. |
| HTMLParser | |
| HTTPFile | Class representing a file that can be retrieved using the HTTP. |
| URLNormalizer | Class that can be used to normalize/canonicalize URLs so that the can be compared. |
| WebCrawlReader | Reader implementation that provides an XML representation of the (filtered) contents of (a) website(s) providing one or seed URLs. |
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||