com.armatiek.infofuze.stream.filesystem.webcrawl
Class URLNormalizer

java.lang.Object
  extended by com.armatiek.infofuze.stream.filesystem.webcrawl.URLNormalizer

public class URLNormalizer
extends java.lang.Object

Class that can be used to normalize/canonicalize URLs so that the can be compared. For background see: http://en.wikipedia.org/wiki/URL_normalization

Author:
Maarten Kroon

Constructor Summary
URLNormalizer()
           
 
Method Summary
static java.lang.String normalize(java.lang.String url)
           
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

URLNormalizer

public URLNormalizer()
Method Detail

normalize

public static java.lang.String normalize(java.lang.String url)
                                  throws java.net.URISyntaxException,
                                         org.apache.commons.httpclient.URIException
Throws:
java.net.URISyntaxException
org.apache.commons.httpclient.URIException