Corpora: Russian

Natalia Zinovjeva (natalia@strindberg.ling.uu.se)
Thu, 1 Oct 1998 16:10:26 +0200 (DFT)

Dear list members,

Many thanks to those who responded to my message!

My question was as follows:

>What has been done for Russian in the area of computational linguistics?
>Are there any tagged corpora of Russian text available (or untagged
>corpora, if they are interesting in any other respect)?
>Are there any machine readable lexica for Russian available?
>Morphological descriptions? Grammars?
>Do people feel that the methods developed for languages such as English
>are adequate for Russian? (My own feeling is that they are not always
>adequate..)

Here are the answers and a list of people who responded:

Tomaz Erjavec
Ann Lawson
Tony Mcenery
Kjetil Ra Hauge
Anna Sågvall Hein
Elisabeth Seitz
Mike Scott
Rainer Siemund
Nicolas Turenne

---------------
A Russian translation of Orwell's 1984:
http://nl.ijs.si/ME/CD/docs/1984.html

A Russian library on the WWW:
http://www.alkar.net/moshkow/html-KOI

CFRL - Computer Fund of Russian Language
http://solaris3.ids-mannheim.de/~cfrl/

A Russian corpus is available at Upppsala University,
contact person:

Dr. Ingrid Maier
Institute of Slavic Languages
Box 513
S-751 20 Uppsala (Sweden)
(E-Mail: ingrid.maier@slaviska.uu.se)

Dr. Elizabeth Seitz, Digital Corpora and Databases: New Horizons in Slavic
Linguistics
http://www.uni-tuebingen.de/uni/nss/docs/seminar/ES/pub/corpora.html

The TELRI project, an infrastructure activity chiefly for language of
Central
and Eastern Europe and the Newly Independent States.
http://www.ids-mannheim.de/telri/

The Tuebingen University Slavic Department's website:
http://www.uni-tuebingen.de/uni/nss/

Morgan Nilsson's homepage:
http://www.hum.gu.se/~slamn/m_nils.htm

Thanks a lot,

Natalia Zinovjeva

-------------------------------
Natalia Zinovjeva
Department of Linguistics,
Uppsala University

e-mail:
natalia@stp.ling.uu.se
natalia@ling.gu.se