Corpora: Software for Corpus storing

Sven Hartrumpf (Sven.Hartrumpf@FernUni-Hagen.de)
Mon, 6 Sep 1999 08:47:29 +0200 (MET DST)

Dear Rudolf Muhr.

> 2) Which standard of text-encoding would you choose? (SGML, "Light" SGML,
> others)?

SGML is certainly a good choice for your task. (There are some tendencies
in favor of XML because it is much simpler for tools to process; but the
differences from a user's point of view are minor.)
However, the choice of SGML doesn't restrict your type of encoding very much;
the question is which DTD to use.
the CES (Corpus Encoding Standard) DTD as a starting point.

http://www.cs.vassar.edu/CES/

Best regards
Sven

------------------------------------------------------------------------------
Sven Hartrumpf e-mail: Sven.Hartrumpf@FernUni-Hagen.de
Computer Science VII (AI) phone: +49 2331 987 4553
University of Hagen fax: +49 2331 987 392
58084 Hagen - Germany http://pi7.fernuni-hagen.de/hartrumpf