Corpora: Annotation question

From: Nancy M. Ide (ide@cs.vassar.edu)
Date: Tue Mar 14 2000 - 19:00:46 MET

  • Next message: Eduard Hovy: "Corpora: Job in SPEECH RECOGNITION and DIALOGUE PROCESSING"

    Hello,

    I would like to know if people (or systems?) are actively using TEI
    elements such as <offset> and <distance> in markup of text for
    representing the analysis of relative temporal and spatial
    expressions, and attributes such as "reg" (for capturing
    normalization) and "exact" (for indicating fuzziness). If they are
    being used, then I'd be interested in what they're being used for,
    whether the uses require extreme degrees of agreement among annotators
    about the guidelines for annotation, what the 'lessons learned' are,
    etc.

    Also, I would like information on what are *people* and/or *systems*
    being expected to annotate, and what do the annotations capture, in
    particular for "distinguished expressions" (names, patterns).

    Thanks in advance,
    Nancy

    =======================================================

    Nancy Ide

    Professor and Chair
    Department of Computer Science, Vassar College
    Poughkeepsie, NY 12604-0520 USA
    Tel: +1 914 437-5988 Fax: +1 914 437-7498
    ide@cs.vassar.edu

    Chercheur Invite
    Equipe Langue et Dialogue, LORIA/CNRS
    Campus Scientifique - BP 239
    54506 Vandoeuvre-les-Nancy FRANCE
    Tel: +33 (0)3 83 59 20 47 Fax: +33 (0)3 83 41 30 79
    ide@loria.fr

    =======================================================



    This archive was generated by hypermail 2b29 : Tue Mar 14 2000 - 19:01:07 MET