Corpora: A Survey of Open Language Archives

From: Steven Bird (sb@unagi.cis.upenn.edu)
Date: Tue Aug 15 2000 - 21:56:44 MET DST

  • Next message: Priscilla Rasmussen: "Corpora: NAACL-2001 Call for Papers"

    A Survey of Open Language Archives
    http://www.ldc.upenn.edu/exploration/survey.html

    By "Open Language Archive" we mean a digital repository of language
    data, documentation and description, including texts, recordings,
    dictionaries, grammars and field notes, where there is an intent to
    make the materials openly available. "Open Language Archive" is
    construed broadly, to include any such repository which has an
    accessible digital component, even if it is just an online catalog or
    a few digital holdings. It also encompasses organizations which
    publish language data in electronic form. (NB. Our use of "open" is
    inspired by the Open Archives initiative [www.openarchives.org]).

    Examples of the kinds of archives we have in mind are listed here:
    http://www.ldc.upenn.edu/exploration/archives.html

    The purpose of this survey is to identify:

    1. all existing digital archives of language materials, the nature of
        their holdings, and any electronic publication activities
        (whether digital media or web-based);

    2. all planned digital archives of language materials
        and their present status;

    3. any technical, legal or moral obstacles particular to
        archives of language materials.

    Additionally, we would like to find out about the metadata
    (i.e. catalog fields) used for classifying archived
    language materials.

    If you (plan to) archive and/or publish digital language
    materials and/or the associated metadata, please
    visit this URL and complete our online survey:

            http://www.ldc.upenn.edu/exploration/survey.html

    If you know of such an archive or publisher, please forward this
    message to the appropriate person.

    Many thanks,
    Steven Bird.

    --
    Steven.Bird@ldc.upenn.edu  http://www.ldc.upenn.edu/sb
    Assoc Director, LDC; Adj Assoc Prof, CIS & Linguistics
    Linguistic Data Consortium, University of Pennsylvania
    3615 Market St, Suite 200, Philadelphia, PA 19104-2608
    



    This archive was generated by hypermail 2b29 : Tue Aug 15 2000 - 21:54:51 MET DST