|File Search||Catalog||Content Search|
Currently, libextractor supports the following formats: HTML, PDF, PS, OLE2 (DOC, XLS, PPT), OpenOffice (sxw), StarOffice (sdw), DVI, MAN, MKV, MP3 (ID3v1 and ID3v2), OGG, WAV, EXIV2, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, ELF, REAL, RIFF (AVI), MPEG, QT and ASF.
Also, various additional MIME types are detected. It can also be used to compute hash functions (SHA-1, MD5, ripemd160).
This package contains the library and all plugins, except EXIV2, MPEG, OGG, OLE2 and thumbail - they are splittet out to libextractor-plugins.
libextractor ============ libextractor is a simple library for keyword extraction. libextractor does not support all formats but supports a simple plugging mechanism such that you can quickly add extractors for additional formats, even without recompiling libextractor. libextractor typically ships with a few dozen helper-libraries (plugins) that can be used to obtain keywords from common file-t more»
Mon Nov 28 12:17:42 CET 2011 Added MKV (Matroska) plugin. Tue Aug 12 04:40:49 EEST 2008 Added an S3M (Scream Tracker 3 Module) plugin. Added an XM (eXtended Module) plugin. Added an IT (Impulse Tracker) plugin. Mon Jun 23 19:05:07 EET 2008 Fixed concurrency issues in plugin (un-)loading by adding locking around libltdl functions. Fri Jun 20 23:34:02 EET 2008 Added an FFmpeg-based thu more»
libextractor (1:0.6.3-5) unstable; urgency=low * debian/control: add Vcs-Git and Vcs-browser fields. * Switch the libpoppler-dev build dependency to libpoppler-private-dev (Closes: #673304). * Include a patch to load plugins from path with underscore, thanks to Harun Trefry (Closes: #675063). -- Bertrand Marc <email@example.com> Sat, 23 Jun 2012 18:45:49 +0200 libextractor (1: more»
Mon Nov 28 12:17:42 CET 2011 Fixing compiler warnings, cleaning up ASF plugin. Finishing Matroska more»
* ffmpeg needs make 3.81: add configure check for it Core: * port test cases * support "hash" plugi more»
Core Team: Christian Grothoff <firstname.lastname@example.org> Nils Durner <email@example.com> Formats: htm more»
Files: * Copyright: (C) 2002-2011 Christian Grothoff <firstname.lastname@example.org> (C) 2002-2005 Vidyut more»