|File Search||Catalog||Content Search|
html2text reads HTML documents supplied in the command line (or from standard input), converts each of them into a stream of plain text characters and writes output to the file or the terminal.
Debian version also can recognize encoding of documents, do on-fly input and output recoding.
html2text was written because the author wasn't happy with the output of "lynx -dump" and so he wrote something better.
-------------------------------- Some notes about using html2text -------------------------------- 1. HTTP support The original html2text doesn't support any complicated HTTP queries and answers. The Debian version of html2text doesn't provide http support at all. However, you can easily operate by using the curl or wget packages: curl -s http://www.server.org/aaa/bbb/ccc.html | html2text wge more»
## This is the README file for html2text Wed Jan 14 14:35:57 CET 2004 ## =========================================================================== html2text is a command line utility, written in C++, that converts HTML documents into plain text. It was written up to version 1.2.2 for and is copyrighted by GMRS Software GmbH, Unterschlei�heim. html2text reads HTML documents from standa more»
html2text (1.3.2a-10) experimental; urgency=low Added support for output recoding, see README.Debian, section 'Output recoding'. Due to UTF-8 issues, backspaces in output are disabled now. See README.Debian, section 'Backspaces' for more info. -- Eugene V. Lyubimkin <email@example.com> Sat, 20 Sep 2008 00:52:42 +0300 html2text (1.3.2a-9) experimental; urgency=low Ad more»
html2text(1) html2text(1) NAME html2text - an a more»
html2textrc(5) html2textrc(5) NAME html2textrc - fo more»
html2text (1.3.2a-14build1) lucid; urgency=low * rebuild rest of main for armel armv7/thumb2 opti more»
## CHANGES - What has changed ? Wed Jan 14 14:47:26 CET 2004 ## ================== more»
## TODO - What has to be done ? Mon Jan 12 14:19:08 CET 2004 ## ================== more»