Filewatcher File Search File Search
Catalog
Content Search
» » » » »

HTML-ExtractMain-0.60.tar.gz

Homepage:-
Package version:-
Architecture:-
Distribution:Perl-CPAN
Filename:HTML-ExtractMain-0.60.tar.gz

/HTML-ExtractMain-0.60/README

HTML::ExtractMain

HTML::ExtractMain takes HTML content, and extracts the HTML section
representing the main body of the page, skipping headers, footers,
navigation, etc.

HTML::ExtractMain's Readability algorithm is ported from Arc90's
JavaScript-based Readability application, online at
http://lab.arc90.com/experiments/readability/

INSTALLATION

To install this module, run the following commands
more»

/HTML-ExtractMain-0.60/Changes

Revision history for HTML-ExtractMain

0.60    December 13, 2010
        Fixed a memory thanks to David Guthrie; documentation fixes

0.50    April 15, 2010
        Added tests of real content, details to documentation

0.10    August 1, 2009
        First version


Browse inside HTML-ExtractMain-0.60.tar.gz

         [DIR]HTML-ExtractMain-0.60/ (10)

Download HTML-ExtractMain-0.60.tar.gz

Results 1 - 1 of 1
Help - FTP Sites List - Software Dir.
Search over 15 billion files
© 1997-2016 FileWatcher.com