RE: [Corpora-List] Concordancing russian text

From: cfairon@fltr.ucl.ac.be
Date: Wed Jun 25 2003 - 23:49:41 MET DST

  • Next message: Alistair Knott: "[Corpora-List] Call for papers: 2003 Australasian Language Technology Workshop"

    Dear Ivonne,

    You can also try UNITEX (Free and Open Source) which comes with a large Russian
    dictionary: http://www-igm.univ-mlv.fr/~unitex/

    NOTE:
    There is a free online concordance service that will give you an idea of the
    UNITEX search possibilities. The name of this service is "GlossaNet Instant".
    http://glossa.fltr.ucl.ac.be/instant/

    It enables the users to search the web edition of a hundred newspapers (in 11
    languages) and to build concordances.

    GlossaNet uses UNITEX dictionaries and offers advanced search options (you can
    use grammatical codes, etc.). For instance, here are some valid queries for English:

    <be><PREP><DET><N>
    <be>going to <V>
    etc
     
    GlossaNet has currently only one Russian Newspaper: Moskovski Komsomolets (but
    the webmaster add new titles upon demand). You can search for simple 'tokens' or
    for more refined expressions.

    The nice thing with the online version is that you don't have to install
    anything (great for students).

    If you like GlossaNet instant... You can also register to the full monitoring
    service (free too): http://glossa.fltr.ucl.ac.be
    With this second service, you register your query once and get a new concordance
    every day/week (when the newspaper web site is updated).

    Feel free to ask for help if you want to use GlossaNet.

    Cedrick

    Cedrick Fairon
    Directeur du CENTAL
    Centre de traitement automatique du langage
    Université catholique de Louvain
    Place Blaise Pascal, 1
    1348 Louvain-la-Neuve
    Belgique
    http://cental.fltr.ucl.ac.be
    ->CENTAL is coorganizer of JADT2004 : www.jadt.org

    Quoting sidorov <sidorov@cic.ipn.mx>:

    > Dear Ivonne,
    >
    >
    >
    > You can use my concordancing program specially written for Russian several
    > years ago. It is free.
    >
    > Its advantage is that the program has an option to treat Russian morphology.
    >
    >
    >
    > Sincerely,
    >
    > Grigori
    >
    >
    >
    > Grigori Sidorov, PhD
    >
    > Natural Language Processing Laboratory,
    >
    > Center for Computing Research, National Polytechnic Institute,
    >
    > Mexico City, Mexico
    >
    > E-mail: <mailto:sidorov@cic.ipn.mx> sidorov@cic.ipn.mx
    >
    >
    >
    > -----Mensaje original-----
    > De: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] En nombre
    > de Webmaster CL
    > Enviado el: Martes, 24 de Junio de 2003 07:56 a.m.
    > Para: CORPORA@UIB.NO
    > Asunto: [Corpora-List] Concordancing russian text
    >
    >
    >
    > Dear members of the Corpora-Mailing List,
    >
    >
    >
    > I have been trying to put together a small corpus in Russian (Cyrillic
    > coding) for classroom concordancing, i.e. for the use with concordancers
    > such as MonoConc and the like. I am aware of the online availability of the
    > Uppsala corpus at <http://www.sfb441.uni-tuebingen.de/b1/korpora.html>
    > http://www.sfb441.uni-tuebingen.de/b1/korpora.html yet for presentation
    > purposes at a workshop I was hoping to present offline concordancing with a
    > standalone program. Currently, my problem lies not so much on the
    > linguistic/theoretical side of things but rather the technical realisation
    > of working with Cyrillic text. So far I have not succeeded in putting
    > together a *.txt-file in Russian that MonoConcPro would read. I thought that
    > it was actually capable of it though. Could somebody point me into the right
    > direction? Any help would be much appreciated.
    >
    > Best regards,
    >
    > Yvonne Breyer
    >
    >
    >
    > ----------------------------------------------
    >
    > Webmaster @ <http://www.corpus-linguistics.info/>
    > http://www.corpus-linguistics.info
    >
    > University of Essen, Germany
    >
    >

    -------------------------------------------------
    This mail sent through IMP: http://horde.org/imp/



    This archive was generated by hypermail 2b29 : Thu Jun 26 2003 - 22:45:02 MET DST