com.armatiek.infofuze.source.extractor
Class BinaryFileExtractor
java.lang.Object
com.armatiek.infofuze.source.extractor.FileExtractor
com.armatiek.infofuze.source.extractor.FileFileExtractor
com.armatiek.infofuze.source.extractor.BinaryFileExtractor
- All Implemented Interfaces:
- IOFileFilter
public class BinaryFileExtractor
- extends FileFileExtractor
Class that represents the definition of a filesystem based
source within infofuze-config.xml. The class implements IOFileFilter and thus
can be used to determine if a specific FileIf is accepted by this
extractor definition.
- Author:
- Maarten Kroon
Method Summary |
boolean |
getExtractMetadata()
Returns whether to provide the metadata of files to the XML stream. |
boolean |
getExtractText()
Returns whether to provide the plain text of files to the XML stream. |
boolean |
getPreParse()
Returns whether to check the well-formedness of the XML that is produced by
the Apache Tika parser of files before that XML is appended to the XML
stream. |
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
BinaryFileExtractor
public BinaryFileExtractor(org.w3c.dom.Element configElem)
throws java.lang.Exception
- Throws:
java.lang.Exception
getExtractMetadata
public boolean getExtractMetadata()
- Returns whether to provide the metadata of files to the XML stream.
getExtractText
public boolean getExtractText()
- Returns whether to provide the plain text of files to the XML stream.
getPreParse
public boolean getPreParse()
- Returns whether to check the well-formedness of the XML that is produced by
the Apache Tika parser of files before that XML is appended to the XML
stream.