Trisha Shetty (Editor)

HTMLDOC

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Developer(s)
  
Michael R Sweet

Development status
  
Active

Preview release
  
NA / NA

Written in
  
C, C++

Stable release
  
1.8.29 / January 3, 2016; 13 months ago (2016-01-03)

Operating system
  
Windows 2000+, Mac OS X v10.4+, Linux 2.4+, Solaris 8+

HTMLDOC is a previously commercially developed open-source program that converts HTML web pages and files to indexed HTML, PostScript, and PDF files, complete with a table-of-contents and (in the current development snapshot) index. HTMLDOC can be used from the command-line, a simple GUI, or from a web server. Development originally occurred through the author's now-defunct business, Easy Software Products, and now continues on the author's personal web site.

Contents

Features and limitations

HTMLDOC 1.8.x supports most of HTML 3.2 with some elements of HTML 4.01, with no support for CSS. HTMLDOC 1.9.x will support most of HTML 4.01 and CSS1, with some support for selected CSS2.x properties.

HTMLDOC 1.8.x supports the following character sets: Windows-874, Windows-1250, Windows-1251, Windows-1252, Windows-1253, Windows-1255, Windows-1256, Windows-1257, Windows-1258, ISO-8859-1, ISO-8859-2, ISO-8859-3, ISO-8859-4, ISO-8859-5, ISO-8859-6, ISO-8859-7, ISO-8859-8, ISO-8859-9, ISO-8859-14, ISO-8859-15, KOI8-R; you cannot mix characters from different code pages. There is no support for CJK and Arabic characters, and support for ISO-8859-13 is missing. HTMLDOC 1.9.x improves the character set support and includes limited support for UTF-8.

License and availability

Licensed under the terms of the GNU General Public License version 2. It is legal to compile the sources and distribute the program, and various versions can be found on the Internet. For example, HTMLDOC is included as part of the Debian operating systems.

References

HTMLDOC Wikipedia