Infofuze

Infofuze is a Java library and server application that can be used to transform and combine XML data from various sources into a specific XML or other text output format that can be stored or indexed.
These transformations can be fully configured and scheduled. Infofuze is based on the XML transformation interface of the Java API for XML Processing (JAXP) which is bundled with standard Java (J2SE).
Infofuze is written in 100% pure Java and will run on all systems for which a Java 1.6 Virtual Machine is available. It is licensed under LGPL v3.

Sources include:

  • Relational databases
  • Local filesystems
  • Common Internet File System (CIFS/SMB): Microsoft Windows Network and Unix based systems running Samba
  • Websites (crawled)
  • WebDAV enabled webservers
  • FTP servers
  • Webservices

Targets include:

  • Apache Solr
  • Relational databases
  • Local files
  • Webservices

For file based sources a XML representation of the the following file formats can be transformed:
XML, (X)HTML, JSON, CSV (full structured), all Microsoft Office (Word, Excel, Powerpoint) and OpenOffice formats, Visio, PDF, RTF and many more.

Read more in the Infofuze Technical Overview.

Releases

Infofuze 0.9 (requires JDK 1.6)

Infofuze 0.9 does not have binary releases just yet, but you can build them from the sources.

View the JavaDoc API documents

Support

The Infofuze mailing list act as the main support forum. The user list is suitable for most library usage queries. The develop list is intended for the development discussion.

Issues may be reported via the SourceForge Infofuze Issue Tracker. Please read the instructions carefully to submit a useful bug report or enhancement request.