From: Alessandro Lenci (
Date: Sat Apr 20 2002 - 09:07:51 MET DST

  • Next message: Nadja Nesselhauf: "Re: Corpora: Learner Corpora"

                         COLING 2002 Post-Conference Workshop

                              INTERNATIONAL STANDARDIZATION

                      Center of Academia Activities, Academia Sinica
                                    Taipei, Taiwan

                                    August 31, 2002


    Language resources play an important role in recent corpus-based natural
    language processing research. A lot of effort has been focused on compiling
    various kinds of language resources, particularly in the US and European
    countries. In addition, standards represent a necessary step to consolidate
    technological achievements in this sector, to enhance and foster the
    exchange of know-how between research and industry, and to define
    infrastructures for the re-use and sharing of existing language resources
    through the specification of common formats and frameworks. Since 1993 the
    Commission of the European Union has been actively supporting the
    standardization process in human language technology, in particular by
    sponsoring the EAGLES initiative. This activity has extended to the
    framework of the EU-US International Research Co-operation, supported by
    NSF and the European Union (

    Compared to English and many European languages the availability and
    accessibility of Asian language resources is still limited. Moreover, there
    is more diversity of Asian languages from viewpoints of character sets and
    grammatical properties. Because of these peculiarities, Asian languages do
    not always fit with the existing linguistic resource standardization

    We have held two workshops on the same topic, the first was in January of
    2001 at Tokyo on invited basis and the second was in conjunction with the
    6th Natural Language Processing Pacific Rim Symposium (NLPRS 2001) in
    November of 2001 at Tokyo ( In this third
    workshop, we would like to put emphasis on standardization of Asian
    language resources, and to provide a chance to discuss research results and
    the possibilities of international collaboration on the development of
    Asian language resources in the future. The workshop also aims to introduce
    the status of Asian language resources to researchers in other regions.

    We invite papers on all topics related to language resources, in particular
    Asian language resources and their development including, but not limited to:

    * Text corpora
    * Machine-readable dictionaries
    * Lexicons
    * Grammars
    * Exchange and annotation schemata
    * Infrastructure for constructing and sharing language resources
    * Exchange formats
    * Best practices for creating and disseminating language resources
    * Metadata for resource classification and discovery
    * Strategies and priorities for EU-US and Asian cooperation
    * Standards for language resources (lexicons, corpora, ontologies, etc.)
    * Lexical standards and multilinguality
    * Standards for content management
    * Standards and applications
    * Standards and evaluation


    * Nicoletta Calzolari (co-chair) - Istituto di Linguistica
    Computazionale CNR, Pisa (Italy)
    * Key-Sun Choi (co-chair) - Korea Advanced Institute of Science and
    Technology (Korea)
    * Asanee Kawtrakul (co-chair) - Kasetsart University (Thailand)
    * Alessandro Lenci (co-chair) - Dipartimento di Linguistica - Universita di
    Pisa (Italy)
    * Tokunaga Takenobu (co-chair) - Tokyo Institute of Technology (Japan)
    * Steven Bird - University of Pennsylvania (US)
    * Nuria Bel - GILCUB (Spain)
    * Ehara Terumasa - NHK (Japan)
    * Christiane Fellbaum - Princeton University (USA)
    * Ralph Grishman - New York University (USA)
    * Chu-Ren Huang - Academia Sinica (Taiwan)
    * Hammam Riza - BPPT (Indonesia)
    * Kurohashi Sadao - University of Tokyo (Japan)
    * Martha Palmer - University of Pennsylvania (USA)
    * Hae-Chang Rim - Korea University (Korea)
    * Rajeev Sangal - Indian Institute of Information Technology (India)
    * Shirai Kiyoaki - Japan Advanced Institute of Science and Tecchnology (Japan)
    * Virach Sornlertlamvanich - NECTEC (Thailand)
    * Gregor Thurmair - SAIL Labs (Munich)
    * Benjamin Tsou - City University of HongKong (China)
    * Antonnio Zampolli - Istituto di Linguistica Computazionale ? CNR (Italy)

    Paper submission due
    April 30, 2002

    Notification of acceptance
    June 7, 2002

    Deadline for camera-ready papers
    June 29, 2002

    Workshop date
    August 31, 2002

    Center of Academia Activities, Academia Sinica, Taipei, Taiwan.


    A paper no more than 8 pages long should be sent via E-mail in the PDF
    format with all non-ASCII fonts embedded, no later than April 30, 2002 to
    Alessandro Lenci ( After acceptance
    notification, the authors are requested to make a camera-ready no more than
    8 pages long, and in the format prescribed by COLING 2002. Please see to get style sheet. The camera-ready
    should be sent electronically in the PDF format with all non-ASCII fonts
    embedded, no later than June 29, 2002 to Tokunaga Takenobu (

    This archive was generated by hypermail 2b29 : Sat Apr 20 2002 - 09:15:03 MET DST