com.armatiek.infofuze.source.extractor
Class BinaryFileExtractor
java.lang.Object
  
com.armatiek.infofuze.source.extractor.FileExtractor
      
com.armatiek.infofuze.source.extractor.FileFileExtractor
          
com.armatiek.infofuze.source.extractor.BinaryFileExtractor
- All Implemented Interfaces: 
 - IOFileFilter
 
public class BinaryFileExtractor
- extends FileFileExtractor
 
Class that represents the  definition of a filesystem based
 source within infofuze-config.xml. The class implements IOFileFilter and thus
 can be used to determine if a specific FileIf is accepted by this
 extractor definition.
- Author:
 
  - Maarten Kroon
 
 
 
 
| 
Method Summary | 
 boolean | 
getExtractMetadata()
 
          Returns whether to provide the metadata of files to the XML stream. | 
 boolean | 
getExtractText()
 
          Returns whether to provide the plain text of files to the XML stream. | 
 boolean | 
getPreParse()
 
          Returns whether to check the well-formedness of the XML that is produced by
 the Apache Tika parser of files before that XML is appended to the XML
 stream. | 
 
 
 
| Methods inherited from class java.lang.Object | 
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait | 
 
BinaryFileExtractor
public BinaryFileExtractor(org.w3c.dom.Element configElem)
                    throws java.lang.Exception
- Throws:
 java.lang.Exception
getExtractMetadata
public boolean getExtractMetadata()
- Returns whether to provide the metadata of files to the XML stream.
 
 
getExtractText
public boolean getExtractText()
- Returns whether to provide the plain text of files to the XML stream.
 
 
getPreParse
public boolean getPreParse()
- Returns whether to check the well-formedness of the XML that is produced by
 the Apache Tika parser of files before that XML is appended to the XML
 stream.