Corpora: massive text corporisation

From: P bI K O B B.B. (rykov@narod.ru)
Date: Fri Jun 01 2001 - 16:04:50 MET DST

  • Next message: jim@ling.ed.ac.uk: "Corpora: EvoLang2002"

    Hello !

    Maybe somebody remembers that I mentioned before that there is enourmous collection of Russian texts here collected by Sergey Lesnikov in Komi Republic University.

    There are 4 Gb of thousands of texts there.

    Now he thinks that his problem is to begin converting them into corpus/corpora. I think that the corpus is smth totally different word unit. Maybe I am wrong.

    Maybe there are people who will be too kind to have time to give him a good advice?

    I am not sure I am guru or No 1 in Corpus Linguistics Phylosophy.

    -- 
    Vladimir Rykov, PhD in Comp Linguistics, Linguistic Institute RAS, MOSCOW
    



    This archive was generated by hypermail 2b29 : Fri Jun 01 2001 - 16:00:34 MET DST