"Fossies" - the Fresh Open Source Software Archive

Member "html2sgml-0.3/doc/html2sgml.txt" (26 Aug 1997, 5732 Bytes) of package /linux/misc/old/html2sgml-0.3.tar.gz:


As a special service "Fossies" has tried to format the requested text file into HTML format (style: standard) with prefixed line numbers. Alternatively you can here view or download the uninterpreted source code file.

    1   html2sgml documentation
    2   Peter Antman
    3   Tue Aug 26 12:26:46 MET DST 1997
    4 
    5   11..  RREEAADDMMEE
    6 
    7   html2sgml is a program wich converts html to sgml accroding to
    8   linuxdoc.sgml.  With a file in linuxdoc.dtd format you can create nice
    9   typset books, well structured html-documents and so forth.
   10   linuxdoc.dtd is the format used in Linux HOWTOS, for example.
   11 
   12   html2sgml i tuned to work well with Applix HTML, and will convert any
   13   footnotes apearent in the applix word-file that was used to produce
   14   the html.
   15 
   16   To use html2sgml you need Perl. To use the image converting routines
   17   you also need: giftopnm, ppmtopgm and pnmtops
   18 
   19   To do something usefull with the resulting file you also need
   20   linuxdoc-sgml or the follow up sgml-tools
   21   http://www.xs4all.nl/~cg/sgmltools/.
   22 
   23   11..11..  GGeettttiinngg hhttmmll22ssggmmll
   24 
   25   The homepage of html2sgml is
   26   http://www.abc.se/~m9339/prog/html2sgml.html It is possible to ftp it
   27   from ftp://ftp.mc.hik.se/pub/users/mia95anp/html2sgml/.  It has also
   28   been upploded to ftp://ftp.redhat.com and
   29   ftp://sunsite.unc.edu/pub/linux/.
   30 
   31   11..22..  IInnssttaallllaattiioonn
   32 
   33   To install html2sgml unpack the tarfile and cd into the disrtibution.
   34   Type
   35 
   36   make install
   37 
   38   It will install the programs html2sgml and mkbook, som files in the
   39   specified documentdirectory, including _e_x_t_r_a_s a couple of script that
   40   shows some examples of how you can merg severall html-files into one
   41   to use with html2sgml. A manual page will be installed too.
   42 
   43   Edit the makefile to change where to install and where Perl is on your
   44   system.  The default is /usr/bin/perl and prefix = /usr/local
   45 
   46   11..33..  UUssaaggee
   47 
   48   See the manual page
   49 
   50   22..  MMaannuuaall ppaaggee
   51 
   52   22..11..  NNAAMMEE
   53 
   54   html2sgml -- convert html to sgml according to linuxdoc.dtd
   55 
   56   22..22..  SSYYNNOOPPSSIISS
   57 
   58   html2sgml _f_i_l_e_._h_t_m_l
   59 
   60   22..33..  DDEESSCCRRIIPPTTIIOONN
   61 
   62   _h_t_m_l_2_s_g_m_l is a fileconverter that converts html-files to sgml-files
   63   according to linuxdoc.dtd. It will ouput a file with the same name as
   64   the specified file but with the ending html changed to sgml.
   65 
   66   It will not work on every html-file because of the free format of
   67   html. It is tuned to work well with html produced from _A_p_p_l_i_x _H_T_M_L_-
   68   _e_d_i_t_o_r. If it finds a applix word file in the same directory and with
   69   the same name as the specified file, it will include any _f_o_o_t_n_o_t_e_s
   70   from the aw-file in the produced sgml-file.
   71 
   72   _h_t_m_l_2_s_g_m_l will also try to convert all included images of type gif to
   73   postscript.
   74 
   75   By default html2sgml produces a ducument of type _a_r_t_i_c_l_e. To change to
   76   _b_o_o_k you can use the script _m_k_b_o_o_k. It also fills in a dummy name. If
   77   there is a title tag in the html-file it will use that as a title for
   78   the sgml-file. To change this you have to hand edit the sgml-file.
   79 
   80   If there are more than one _H_1 tag these are used as the toplevel
   81   section.  Everything marked H1 will become a _s_e_c_t in sgml, and _H_2 will
   82   become sect1 and so forth. If there is only one ore no H1, H2 will be
   83   used instead. If there is no H* tags, then the document i broken by
   84   design :-)
   85 
   86   The resulting sgml-file can then be used by _s_g_m_l_-_t_o_o_l_s _(_w_a_s _l_i_n_u_x_d_o_c_-
   87   _s_g_m_l_) to produce various new fileformats, eg latex, info, rtf.
   88 
   89   22..44..  TTIIPPSS
   90 
   91   _h_t_m_l_2_s_g_m_l should work fine with straight html, that is, when no
   92   special layout formating has been done. For example: it can handle
   93   html table tags, but it can not handle them well if they are used to
   94   produce layout.
   95 
   96   The best working thing is to use it with _A_p_p_l_i_x _h_t_m_l. You can both
   97   write directly in Applix Word or import document to applix word. Try
   98   to use predefined styles for your document. You can create heading1,
   99   heading2, pre, quote and so forth. Open Applix HTML and use
  100   _F_i_l_e_-_>_I_m_p_o_r_t _w_o_r_d_s _d_o_c_u_m_e_n_t. You will then get the chance to tell
  101   Applix wich html-tags your defined styles should match, eg heading1 ->
  102   html_h1. Then use _F_o_r_m_a_t _-_> _H_T_M_L _d_o_c_u_m_e_n_t _s_e_t_t_i_n_g where you can fill
  103   in the title; here you can also fill in the alternative to export
  104   Applix images as gif files. This is good to do because html2sgml can
  105   convert the gif files to ps-files wich can be used when/if converting
  106   to latex.
  107 
  108   22..55..  BBUUGGSS AANNDD FFEEAATTUURREESS
  109 
  110   _h_t_m_l_2_s_g_m_l is still under development and will most probably contain
  111   bugs. It also contain som features. All possible HTML and sgml tags
  112   are not implemented. Unimplemented HTML tags will show up in the sgml
  113   file where you have to hand edit them away. Some tags in sgml are also
  114   unsupported. More specific: no math tags is implemented. You can check
  115   the resulting sgml file with the command _s_g_m_l_c_h_e_c_k to discover any
  116   leftover tags.
  117 
  118   I have concentrated on making it work in english and in swedish. This
  119   means that there are a lot of characters that probably not will work
  120   OK, specialy when converting Applix footnotes. Look in the source and
  121   try to put in the missing characters if you have any problems. And
  122   pleas send the new improved version to mee.
  123 
  124   22..66..  AAUUTTHHOORR
  125 
  126   Peter Antman (peter.antman@abc.se)
  127 
  128   22..77..  SSEEEE AALLSSOO
  129 
  130   sgml2latex(1), sgml2html(1), sgml2txt(1), sgml2info(1), sgml2rtf,
  131   sgml2lyx(1)
  132