This is diac -- a suite of lex programs programs for converting Ascii
German into diacritical German.

 Version 1.0 written by 
  Dorai Sitaram, Rice University, 1990   dorai@titan.rice.edu
 
 Version 1.1:
  General rewrite, using some Material from 
    H.Kaeslin, Behandlung der Umlaute bei der Verarbeitung deutscher 
    Texte unter Unix, in: it, Vol 1, 1988
  and Duden - die Rechtschreibung.
  Gustaf Neumann, Wirtschaftsuniversitaet Wien, October 1990
  neumann@dec4.wu-wien.ac.at         neumann@awiwuw11.bitnet

 Version 1.2: 
  Enhanced rule-set (sounds like IBM), 
        Nov 27 1990  -- Gustaf Neumann
  
 Verson 1.3:
  Separate programs diacu (for umlaut characters) and diacs (sharp s)
  to achieve independence of the two rule sets.
  8-bit-clean lex (eg. flex 2.3.6) is needed!
  Sharp-s-rules debugged with the help of Helmut Feldweg
  <ACQUI@MPI.NL>. Vielen Dank, Helmut!
        Sun Dec  2 20:06:21 1990 -- Gustaf Neumann

 Verson 1.4:
  Numerous improvements in the rulesets for umlaut characters
  and sharp s based on running diac against various dictionaries
  and newspaper text. Many thanks to Andreas Eisele
  <andreas@ims.uni-stuttgart.de> for testing various prereleases

                   Mar 11, 1996 -- Gustaf Neumann
       


    --------------------------------------------------------

The distribution contains:

     a) the CORE programs:

	diact	-- a program that accepts a TeX (LaTeX) source file
		   and expands all \input and \include files (located
                   in the current directory) recursively. 
	diacu	-- a program that performs Umlaut conversion 
		   (eg. from ue to Umlaut-u).
	diacs	-- a program that performs sharp s conversion
		   (from ss to sharp-s).
        diac    -- a shell script containing "diact | diacu | diacs".


     b) Utility programs:

        umltex	-- translate ascii umlaut characters into
                   corresponding TeX macros.
        texuml	-- translate umlaut TeX macros into
                   corresponding ascii umlaut characters.
        lagl	-- a program to convert .ll-files (containing comments
                   etc.) into valid lex programs.

    --------------------------------------------------------



		   
Installation:
   0) prerequisite: 8-bit-clean lex (eg. flex 2.5.2)
   1) make
   2) diac < testwords | more
      a) if your machine uses the iso8859-1 ascii set, you will see
         the output containing umlaut characters, you are all set.
      b) otherwise you might use umltex to view the result
         diac < testwords | umltex | more
      c) if you have an 8-bit-clean editor, you might edit texuml.l
         and insert the umlaut characters of you favourite Ascii set.
         go back to step 1.

 To prevent the conversion from Ascii German to umlauts or sharp-s,
 empty groups may be inserted (such as in Ka{}eslin).  In several
 cases it is not decideable whether ss or the sharp s should be used.
 Such cases are:

 input      default translation   alternate meaning   achieved by    

 Masse      Ma{\ss}e              Masse               Mas{}se 
 Fuessen    F{\"u}{\ss}en         F{\"u}ssen          Fues{}sen
 Busse      Busse                 Bu{\ss}e            Bu{}sse
 Flosse     Flosse                Flo{\ss}e           Flo{}sse
 Russe      Russe                 Ru{\ss}e            Ru{}sse
 

-gustaf
---------------------------------------------------------------------------
Wirtschaftsinformatik und Software-Technik        
Universitaet GH Essen, FB5
Altendorfer Strasse 97-101, Eingang B, D-45143 Essen
Tel.: +49 (0201) 81003-74, Fax:  +49 (0201) 81003-73
Gustaf.Neumann@uni-essen.de
