|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object com.armatiek.infofuze.stream.filesystem.webcrawl.CrawlState
public class CrawlState
Class containing state information about a crawling process.
Constructor Summary | |
---|---|
CrawlState(java.lang.String seedURI,
int maxDepth,
int wait,
boolean followImages,
boolean followScripts,
boolean followLinks)
|
Method Summary | |
---|---|
boolean |
getFollowImages()
Returns whether or not to follow links defined in the src attribute of img tags. |
boolean |
getFollowLinks()
Returns whether or not to follow links defined in the href attribute of link tags. |
boolean |
getFollowScripts()
Returns whether or not to follow links defined in the src attribute of script tags. |
int |
getMaxDepth()
Returns the maximum depth to crawl. |
java.lang.String |
getSeedURI()
´ Returns the seed uri of the current crawl. |
java.util.Set<java.lang.String> |
getVisitedURLSet()
Returns the set of visited URLs. |
int |
getWait()
Returns ths number of milliseconds to wait after each request. |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public CrawlState(java.lang.String seedURI, int maxDepth, int wait, boolean followImages, boolean followScripts, boolean followLinks) throws org.apache.commons.httpclient.URIException, java.net.URISyntaxException
org.apache.commons.httpclient.URIException
java.net.URISyntaxException
Method Detail |
---|
public boolean getFollowImages()
public boolean getFollowScripts()
public boolean getFollowLinks()
public int getMaxDepth()
public java.lang.String getSeedURI()
public java.util.Set<java.lang.String> getVisitedURLSet()
public int getWait()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |