com.armatiek.infofuze.source.extractor
Class BinaryFileExtractor

java.lang.Object
  extended by com.armatiek.infofuze.source.extractor.FileExtractor
      extended by com.armatiek.infofuze.source.extractor.FileFileExtractor
          extended by com.armatiek.infofuze.source.extractor.BinaryFileExtractor
All Implemented Interfaces:
IOFileFilter

public class BinaryFileExtractor
extends FileFileExtractor

Class that represents the definition of a filesystem based source within infofuze-config.xml. The class implements IOFileFilter and thus can be used to determine if a specific FileIf is accepted by this extractor definition.

Author:
Maarten Kroon

Field Summary
 
Fields inherited from class com.armatiek.infofuze.source.extractor.FileExtractor
logger
 
Constructor Summary
BinaryFileExtractor(org.w3c.dom.Element configElem)
           
 
Method Summary
 boolean getExtractMetadata()
          Returns whether to provide the metadata of files to the XML stream.
 boolean getExtractText()
          Returns whether to provide the plain text of files to the XML stream.
 boolean getPreParse()
          Returns whether to check the well-formedness of the XML that is produced by the Apache Tika parser of files before that XML is appended to the XML stream.
 
Methods inherited from class com.armatiek.infofuze.source.extractor.FileFileExtractor
getCacheStream, getIncludeBinary
 
Methods inherited from class com.armatiek.infofuze.source.extractor.FileExtractor
accept
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

BinaryFileExtractor

public BinaryFileExtractor(org.w3c.dom.Element configElem)
                    throws java.lang.Exception
Throws:
java.lang.Exception
Method Detail

getExtractMetadata

public boolean getExtractMetadata()
Returns whether to provide the metadata of files to the XML stream.


getExtractText

public boolean getExtractText()
Returns whether to provide the plain text of files to the XML stream.


getPreParse

public boolean getPreParse()
Returns whether to check the well-formedness of the XML that is produced by the Apache Tika parser of files before that XML is appended to the XML stream.