Re: [Corpora-List] corpus of student translations - looking for references

From: Andrei Popescu-Belis (andrei.popescu-belis@issco.unige.ch)
Date: Wed Dec 08 2004 - 14:04:55 MET

  • Next message: Shlomo Argamon: "Re: [Corpora-List] automatic search for orthographic recurring patterns"

    Spela Vintar wrote:
    > Dear Corpora Members,
    >
    > We have started with a group of students to compile a corpus of student
    > translations, planned as a parallel corpus of original texts and several
    > student translations, marked by their teachers.
    > Has anyone been involved in a similar project? Any advice concerning the
    > annotation scheme and corpus encoding, as well as the overall
    > methodology, would be greatly appreciated.
    >

    Hello,

    We developed a small pilot corpus (EN->FR and FR->EN) at the School of
    Translation of the University of Geneva, which is briefly described in:

    Popescu-Belis A., King M. & Benantar H. (2002) - Towards a Corpus of
    Corrected Human Translations. In: Handbook of the LREC 2002 Workshop
    "Machine Translation Evaluation: Human Evaluators Meet Automated
    Metrics", Las Palmas de Gran Canaria, Spain, p.17-21.
    http://issco-www.unige.ch/projects/isle/mteval-may02/mteval-lrec2002.ps.gz
    (the link is to the entire workbook of the workshop)

    The paper gives preliminary guidelines for the annotation of student
    translation mistakes (based on teacher's annotations on the paper
    version), and suggests some uses for such a corpus. Part of the data has
    already been used in an exercise that compared human and automated
    evaluation metrics. The exercise was organized at an LREC 2002 workshop,
    and is summarized in :

    Popescu-Belis A. (2003) - An experiment in comparative evaluation:
    humans vs. computers. In: Proc. of Machine Translation Summit IX, New
    Orleans, LA, USA, p.307-314.
    http://www.amtaweb.org/summit/MTSummit/FinalPapers/60-Popescu-final.pdf

    You could also look at the eCoLoRe and Mellanges European projects and
    try to find out whether they have more precise guidelines for making a
    student translation corpus, which is one of their objectives:
    http://mellange.eila.jussieu.fr/index.shtml.en
    http://ecolore.leeds.ac.uk/

    Best regards,
    Andrei Popescu-Belis

    -- 
    ISSCO/TIM/ETI, Université de Gene`ve
    tél: +41 (0)22 379 86 81      40, bd. du Pont d'Arve
    fax: +41 (0)22 379 86 89      1211 Gene`ve 4 - Suisse
           http://www.issco.unige.ch/staff/andrei
    



    This archive was generated by hypermail 2b29 : Wed Dec 08 2004 - 13:58:25 MET