Re: Corpora: Parallel corpora and French software

From: Jean Veronis (Jean.Veronis@newsup.univ-mrs.fr)
Date: Tue Jun 06 2000 - 16:34:21 MET DST

  • Next message: Christopher Tribble: "Corpora: State of the art"

    At 15:07 06/06/2000 +0100, NOELLE-VERONIQUE SERPOLLET wrote:

    >Ideally, I would like to gather a French/English parallel corpus
    >(with the texts being aligned if possible).

    I just answered another question on this list where I mentioned the ARCADE
    corpus of ca. 1.5M words of Fr/En texts aligned at sentence level:

    http://www.up.univ-mrs.fr/~veronis/arcade

    The corpus is distributed by ELRA:

    http://www.icp.grenet.fr/ELRA/home.html

    >Furthermore, are you aware of corpus tools (taggers/lemmatizers)
    >that I could use for my analyses of the French?
    >(I know about Cordial 6 Universites and will probably purchase it,
    >and I am currently working with ParaConc (Barlow, 1995)).
    >
    >I would be grateful if you could tell me where I could obtain
    >a tagger/concordancer which would enable me to retrieve occurrences
    >of the French subjunctive.

    Cordial does it :

    1 Il il PPER3S
    2 faut falloir VINDP3S
    3 que que SUB
    4 je je PPER1S
    5 vienne venir VSUBP1S
    6 . . PCTFORTE

    Jean Véronis
    http://www.up.univ-mrs.fr/~veronis



    This archive was generated by hypermail 2b29 : Tue Jun 06 2000 - 16:33:28 MET DST