Re: [Corpora-List] ACL proceedings paper in the American National Corpus

From: Alexander Yeh (asy@mitre.org)
Date: Fri Sep 27 2002 - 21:28:18 MET DST

  • Next message: Alexander Yeh: "Re: [Corpora-List] ACL proceedings paper in the American National Corpus"

    Nancy Ide wrote:

    > The American National Corpus Consortium, with permission from the
    > Association for Computational Linguistics, will include in the American
    > National Corpus a selection of recent papers written by American authors
    > and published in ACL proceedings and anthologies. Any authors who object
    > to having their papers included in the American National Corpus should
    > contact Nancy Ide (ide@cs.vassar.edu) to have their papers removed.
    >
    > Note that this applies to papers whose authors are native speakers of
    > American English only.

    Two questions. What is your definition of native speaker? and how are you
    going to determine who meets your definition?

    This is not as trivial as it may sound. When I was in school, there were a
    bunch of people who spent their entire lives in the US (were born in the
    US., etc.), but because their parents came from other countries and spoke
    English as a second language, one project in speech recognition did not
    consider that bunch of people "native" speakers.

    Also, determining who meets what ever standard you have may be tricky: a
    probably extreme example: I know of a person who was born in the US, has a
    last name typical of country A, but is growing-up in a household where
    American English and a language from country B is spoken (the mother comes
    from country B).

    Thanks
    -Alex Yeh

    >
    > =======================================================
    >
    > Nancy Ide
    >
    > Professor and Chair
    > Department of Computer Science, Vassar College
    > Poughkeepsie, NY 12604-0520 USA
    > Tel: +1 845 437-5988 Fax: +1 845 437-7498
    > ide@cs.vassar.edu
    >
    > Chercheur Associe
    > Equipe Langue et Dialogue, LORIA/CNRS
    > Campus Scientifique - BP 239
    > 54506 Vandoeuvre-les-Nancy FRANCE
    > Tel: +33 (0)3 83 59 20 47 Fax: +33 (0)3 83 41 30 79
    > ide@loria.fr
    >
    > =======================================================



    This archive was generated by hypermail 2b29 : Fri Sep 27 2002 - 21:36:37 MET DST