]>
Commit | Line | Data |
---|---|---|
997358a6 MW |
1 | .TH HTML2FOUR 1 "August 1999" |
2 | .\" RCSID $Id: html2four.1,v 1.1 2004/03/15 20:35:24 as Exp $ | |
3 | .SH NAME | |
4 | html2four - extract headers from HTML files into four-field lines | |
5 | .SH SYNOPSIS | |
6 | .B html2four | |
7 | [-digit] file* | |
8 | command [ argument ...] | |
9 | .SH DESCRIPTION | |
10 | .I html2four | |
11 | extracts information from HTML files and writes it out with four | |
12 | tab-separated fields: filename, last label (<a name=> tag) seen, | |
13 | header tag type (H[0-9]), and header text. This is an intermediate | |
14 | format convenient for generating a permuted index with four2perm(1) | |
15 | or a table of contents with a simple awkscript. | |
16 | ||
17 | The only option is a digit to limit the header levels extracted. | |
18 | For example, with -3 only h1, h2, h3 tags are taken. By default, | |
19 | it takes h[0-9], though HTML only defines levels 1 to 6. | |
20 | .SH SEE ALSO | |
21 | .hy 0 | |
22 | four2perm(1) | |
23 | .SH HISTORY | |
24 | Written for the Linux FreeS/WAN project | |
25 | <http://www.xs4all.nl/~freeswan/> | |
26 | by Sandy Harris. |