|File Search||Catalog||Content Search|
Currently, libextractor supports the following formats: HTML, PDF, PS, OLE2 (DOC, XLS, PPT), OpenOffice (sxw), StarOffice (sdw), DVI, MAN, MP3 (ID3v1 and ID3v2), OGG, WAV, EXIV2, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, ELF, REAL, RIFF (AVI), MPEG, QT and ASF.
Also, various additional MIME types are detected. It can also be used to compute hash functions (SHA-1, MD5, ripemd160).
This package contains the library and all plugins, except EXIV2, MPEG, OGG, OLE2 and thumbail - they are splittet out to libextractor-plugins.
libextractor ============ libextractor is a simple library for keyword extraction. libextractor does not support all formats but supports a simple plugging mechanism such that you can quickly add extractors for additional formats, even without recompiling libextractor. libextractor typically ships with a dozen helper-libraries that can be used to obtain keywords from common file-types. libex more»
Sat Nov 11 00:04:34 EET 2006 Added an NSF ( NES Sound Format ) plugin Tue Apr 18 14:44:37 PDT 2006 Added dictionaries for Finnish, French, Gaelic and Swedish (for printable extractors). Thu Mar 9 17:55:09 PST 2006 Word history extraction works (wordleaker). Thu Jul 14 22:30:45 CEST 2005 exiv2 works. Fri May 6 06:02:02 EST 2005 Added Python binding. Thu Mar 17 08:07:04 EST 2005 more»
libextractor (0.5.16-2) unstable; urgency=medium * Added -dev packages to depends of libextractor-dev (Closes: #400174). -- Daniel Baumann <email@example.com> Sat, 25 Nov 2006 17:25:00 +0100 libextractor (0.5.16-1) unstable; urgency=low * New upstream release. -- Daniel Baumann <firstname.lastname@example.org> Sun, 12 Nov 2006 09:42:00 +0100 libextractor (0.5.15-2) unstable; urgency=medium * F more»
Sat Nov 11 16:04:38 MST 2006 Fixed libltdl side-effect of loading libextractor; code now preserves more»
FIX: * check exiv2 memory consumption on very large files; also investigate 500kb (!) allocation/ more»
Core Team: Vidyut Samanta <email@example.com> Christian Grothoff <firstname.lastname@example.org> Formats: h more»
This package was first debianized by Glenn McGrath <email@example.com> on Wed, 5 Feb 2003 13:28:56 +1 more»