Downloadliste

Projektbeschreibung

Texterize is a text and metadata extraction tool and library which can be used to quickly get the text content of a file. It currently supports file formats like PDF, Excel, Powerpoint, Word, RTF, WordPerfect, MP3, Ogg, and all OpenDocument file formats. The output of texterize is either text or XML. It is also designed to work with Unicode input and output, and the default output character set is UTF-8. Texterize also has a recursive mode so that whole directories (or whole filesystems) can be converted to text. This recursion also works through archive files and compressed files like zip, tar, and gz files.

Systemanforderungen

Die Systemvoraussetzungen sind nicht definiert
Information regarding Project Releases and Project Resources. Note that the information here is a quote from Freecode.com page, and the downloads themselves may not be hosted on OSDN.

2008-02-03 13:54 Zurück zur Release-Liste
0.1.2

Viele Abstürze durch Fuzzing gefunden wurden behoben. Einige wichtige PDF Bugs wurden gefixt (inkl. Font-Parser Fehler in 0.1.1 eingeführt). Das configure-Skript wurde verbessert (nicht mehr gezwungen CFLAGS).
Tags: Major bugfixes
Many crashes found through fuzzing were fixed.
Some major PDF bugs were fixed (including a font
parser bug introduced in 0.1.1). The configure
script was improved (no more forced CFLAGS).

Project Resources