|File Search||Catalog||Content Search|
Currently, libextractor supports the following formats: HTML, PDF, PS, OLE2 (DOC, XLS, PPT), OpenOffice (sxw), StarOffice (sdw), DVI, MAN, MP3 (ID3v1 and ID3v2), OGG, WAV, EXIV2, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, ELF, REAL, RIFF (AVI), MPEG, QT and ASF.
Also, various additional MIME types are detected. It can also be used to compute hash functions (SHA-1, MD5, ripemd160).
This package contains the library and all plugins, except EXIV2, MPEG, OGG, OLE2 and thumbail - they are splittet out to libextractor-plugins.
libextractor ============ libextractor is a simple library for keyword extraction. libextractor does not support all formats but supports a simple plugging mechanism such that you can quickly add extractors for additional formats, even without recompiling libextractor. libextractor typically ships with a dozen helper-libraries that can be used to obtain keywords from common file-types. libextr more»
Tue Aug 12 04:40:49 EEST 2008 Added an S3M (Scream Tracker 3 Module) plugin. Added an XM (eXtended Module) plugin. Added an IT (Impulse Tracker) plugin. Mon Jun 23 19:05:07 EET 2008 Fixed concurrency issues in plugin (un-)loading by adding locking around libltdl functions. Fri Jun 20 23:34:02 EET 2008 Added an FFmpeg-based thumbnail extractor plugin, initially supporting only bmp and png more»
libextractor (1:0.5.23+dfsg-7+b2) unstable; urgency=low * Binary-only non-maintainer upload for amd64; no source changes. * Rebuild against libexiv2-9 -- amd64 Build Daemon (brahms) <firstname.lastname@example.org> Fri, 09 Jul 2010 20:38:28 +0000 libextractor (1:0.5.23+dfsg-7) unstable; urgency=low * Adding patch from Ralph Siemsen <email@example.com> to fix proc parsing more»
Sat Jul 4 23:05:22 CEST 2009 Fixed code to work with RPM 4.7. Releasing libextractor 0.5.23. Sat more»
FIX: * check exiv2 memory consumption on very large files; also investigate 500kb (!) allocation/l more»
Core Team: Vidyut Samanta <firstname.lastname@example.org> Christian Grothoff <email@example.com> Formats: h more»
Upstream-Contact: Christian Grothoff <firstname.lastname@example.org> Upstream-Homepage: http://www.gnunet.o more»