"Fossies" - the Fresh Open Source Software Archive
Member "html2sgml-0.3/doc/html2sgml.txt" (26 Aug 1997, 5732 Bytes) of package /linux/misc/old/html2sgml-0.3.tar.gz:
As a special service "Fossies" has tried to format the requested text file into HTML format (style: standard
) with prefixed line numbers.
Alternatively you can here view
the uninterpreted source code file.
1 html2sgml documentation
2 Peter Antman
3 Tue Aug 26 12:26:46 MET DST 1997
5 11.. RREEAADDMMEE
7 html2sgml is a program wich converts html to sgml accroding to
8 linuxdoc.sgml. With a file in linuxdoc.dtd format you can create nice
9 typset books, well structured html-documents and so forth.
10 linuxdoc.dtd is the format used in Linux HOWTOS, for example.
12 html2sgml i tuned to work well with Applix HTML, and will convert any
13 footnotes apearent in the applix word-file that was used to produce
14 the html.
16 To use html2sgml you need Perl. To use the image converting routines
17 you also need: giftopnm, ppmtopgm and pnmtops
19 To do something usefull with the resulting file you also need
20 linuxdoc-sgml or the follow up sgml-tools
23 11..11.. GGeettttiinngg hhttmmll22ssggmmll
25 The homepage of html2sgml is
26 http://www.abc.se/~m9339/prog/html2sgml.html It is possible to ftp it
27 from ftp://ftp.mc.hik.se/pub/users/mia95anp/html2sgml/. It has also
28 been upploded to ftp://ftp.redhat.com and
31 11..22.. IInnssttaallllaattiioonn
33 To install html2sgml unpack the tarfile and cd into the disrtibution.
36 make install
38 It will install the programs html2sgml and mkbook, som files in the
39 specified documentdirectory, including _e_x_t_r_a_s a couple of script that
40 shows some examples of how you can merg severall html-files into one
41 to use with html2sgml. A manual page will be installed too.
43 Edit the makefile to change where to install and where Perl is on your
44 system. The default is /usr/bin/perl and prefix = /usr/local
46 11..33.. UUssaaggee
48 See the manual page
50 22.. MMaannuuaall ppaaggee
52 22..11.. NNAAMMEE
54 html2sgml -- convert html to sgml according to linuxdoc.dtd
56 22..22.. SSYYNNOOPPSSIISS
58 html2sgml _f_i_l_e_._h_t_m_l
60 22..33.. DDEESSCCRRIIPPTTIIOONN
62 _h_t_m_l_2_s_g_m_l is a fileconverter that converts html-files to sgml-files
63 according to linuxdoc.dtd. It will ouput a file with the same name as
64 the specified file but with the ending html changed to sgml.
66 It will not work on every html-file because of the free format of
67 html. It is tuned to work well with html produced from _A_p_p_l_i_x _H_T_M_L_-
68 _e_d_i_t_o_r. If it finds a applix word file in the same directory and with
69 the same name as the specified file, it will include any _f_o_o_t_n_o_t_e_s
70 from the aw-file in the produced sgml-file.
72 _h_t_m_l_2_s_g_m_l will also try to convert all included images of type gif to
75 By default html2sgml produces a ducument of type _a_r_t_i_c_l_e. To change to
76 _b_o_o_k you can use the script _m_k_b_o_o_k. It also fills in a dummy name. If
77 there is a title tag in the html-file it will use that as a title for
78 the sgml-file. To change this you have to hand edit the sgml-file.
80 If there are more than one _H_1 tag these are used as the toplevel
81 section. Everything marked H1 will become a _s_e_c_t in sgml, and _H_2 will
82 become sect1 and so forth. If there is only one ore no H1, H2 will be
83 used instead. If there is no H* tags, then the document i broken by
84 design :-)
86 The resulting sgml-file can then be used by _s_g_m_l_-_t_o_o_l_s _(_w_a_s _l_i_n_u_x_d_o_c_-
87 _s_g_m_l_) to produce various new fileformats, eg latex, info, rtf.
89 22..44.. TTIIPPSS
91 _h_t_m_l_2_s_g_m_l should work fine with straight html, that is, when no
92 special layout formating has been done. For example: it can handle
93 html table tags, but it can not handle them well if they are used to
94 produce layout.
96 The best working thing is to use it with _A_p_p_l_i_x _h_t_m_l. You can both
97 write directly in Applix Word or import document to applix word. Try
98 to use predefined styles for your document. You can create heading1,
99 heading2, pre, quote and so forth. Open Applix HTML and use
100 _F_i_l_e_-_>_I_m_p_o_r_t _w_o_r_d_s _d_o_c_u_m_e_n_t. You will then get the chance to tell
101 Applix wich html-tags your defined styles should match, eg heading1 ->
102 html_h1. Then use _F_o_r_m_a_t _-_> _H_T_M_L _d_o_c_u_m_e_n_t _s_e_t_t_i_n_g where you can fill
103 in the title; here you can also fill in the alternative to export
104 Applix images as gif files. This is good to do because html2sgml can
105 convert the gif files to ps-files wich can be used when/if converting
106 to latex.
108 22..55.. BBUUGGSS AANNDD FFEEAATTUURREESS
110 _h_t_m_l_2_s_g_m_l is still under development and will most probably contain
111 bugs. It also contain som features. All possible HTML and sgml tags
112 are not implemented. Unimplemented HTML tags will show up in the sgml
113 file where you have to hand edit them away. Some tags in sgml are also
114 unsupported. More specific: no math tags is implemented. You can check
115 the resulting sgml file with the command _s_g_m_l_c_h_e_c_k to discover any
116 leftover tags.
118 I have concentrated on making it work in english and in swedish. This
119 means that there are a lot of characters that probably not will work
120 OK, specialy when converting Applix footnotes. Look in the source and
121 try to put in the missing characters if you have any problems. And
122 pleas send the new improved version to mee.
124 22..66.. AAUUTTHHOORR
126 Peter Antman (firstname.lastname@example.org)
128 22..77.. SSEEEE AALLSSOO
130 sgml2latex(1), sgml2html(1), sgml2txt(1), sgml2info(1), sgml2rtf,