Corpora: German Corpora for Adding Ambiguity Annotations

Sven.Hartrumpf@FernUni-Hagen.de
Wed, 4 Feb 1998 14:38:30 +0100

I am looking for a German corpus of written language (preferably a newspaper
corpus). The corpus should be preprocessed, e.g. sentences and words should
be marked up.
Is this available for ELRA's ECI/MCI corpus? Other corpora?

(I have prepared such a newspaper corpus with 8 million words in CES format
(http://www.cs.vassar.edu/CES), but I don't dare to go through the process
of negotiating about copyright etc. with the copyright holders in order to
share resources.)

In my research, I will add annotations for different syntactic and semantic
ambiguity problems.

Thanks for any help.

Sven Hartrumpf

*************************************************************************
* Sven Hartrumpf e-mail: Sven.Hartrumpf@FernUni-Hagen.de *
* Computer Science VII (AI) phone: +49 2331 987 4553 *
* University of Hagen fax: +49 2331 987 392 *
* 58084 Hagen - Germany *
* http://www.informatik.fernuni-hagen.de/pi7/hartrumpf *
*************************************************************************