.\" @(#)unhtml.1 1.16 90/02/15 SMI; from Linux 1.2.8 and up
.TH UNHTML 1 "3 February 1998"
.SH NAME
unhtml \- strip the HTML formatting from a document or the standard input
stream and display it to the standard output
.SH SYNOPSIS
.B unhtml
.B \-version
| [
.I filename
]
.LP
.SH DESCRIPTION
.LP
Parses text read from the standard input, or a file if a file name is
supplied, and removes any HTML formatting it finds. Prints the
resulting cleansed text to the standard output for easy redirection.
The version included with this man page has been improved to handle
comments and scripts.
.LP
.SH OPTIONS
.TP
.B \-version
Version.
.B unhtml
will display its version and exit.
.SH EXAMPLES
.LP
This example simply scans a file called "index.html" and prints the file to
the standard output with the HTML formatting removed. The standard output
is redirected to a file called "index.txt" which, after running, will
contain the plain text of the .html file.
.LP
.RS
.ft B
.nf
example% unhtml index.html > index.txt
.fi
.ft R
.LP
.SH BUGS
Currently, if the output is redirected to a file of the same name as the
input file, the result will be an empty file of the same name, but this is
really an idiosyncracy of the redirect operator, and cannot be corrected in
the program.
.SH DEVELOPMENT
This document is Copyright (C) 1998 by
Kevin Swan.