Corpora: EXTENDED DEADLINE: Workshop on Asian Resources and International Standardization

From: Alessandro Lenci (
Date: Mon Apr 29 2002 - 17:53:14 MET DST

  • Next message: Tadeusz Piotrowski: "Corpora: Counting semantic propositions (was Relatve text length)"

                 ********************* EXTENDED DEADLINE ****************

                             ----- NEW DEADLINE: MAY 10th -----

                          COLING 2002 Post-Conference Workshop

                       The 3rd Workshop on Asian Language Resources
                               International Standardization

                                     Center of Academia
                              Activities, Academia Sinica Taipei,
                                   Taiwan August 31, 2002


    Language resources play an important role in recent corpus-based natural
    language processing research. A lot of effort has been focused on
    compiling various kinds of language resources, particularly in the US
    and European countries. In addition, standards represent a necessary
    step to consolidate technological achievements in this sector, to
    enhance and foster the exchange of know-how between research and
    industry, and to define infrastructures for the re-use and sharing of
    existing language resources through the specification of common formats
    and frameworks. Since 1993 the Commission of the European Union has been
    actively supporting the standardization process in human language
    technology, in particular by sponsoring the EAGLES initiative. This
    activity has extended to the framework of the EU-US International
    Research Co-operation, supported by NSF and the European Union (
    Compared to English and many European languages the availability and
    accessibility of Asian language resources is still limited. Moreover,
    there is more diversity of Asian languages from viewpoints of character
    sets and grammatical properties. Because of these peculiarities, Asian
    languages do not always fit with the existing linguistic resource
    standardization frameworks.
    We have held two workshops on the same topic, the first was in January
    of 2001 at Tokyo on invited basis and the second was in conjunction with
    the 6th Natural Language Processing Pacific Rim Symposium (NLPRS 2001)
    in November of 2001 at Tokyo ( In this third
    workshop, we would like to put emphasis on standardization of Asian
    language resources, and to provide a chance to discuss research results
    and the possibilities of international collaboration on the development
    of Asian language resources in the future. The workshop also aims to
    introduce the status of Asian language resources to researchers in other
    We invite papers on all topics related to language resources, in
    particular Asian language resources and their development including, but
    not limited to:

    * Text corpora
    * Machine-readable dictionaries
    * Lexicons
    * Grammars
    * Exchange and annotation schemata
    * Infrastructure for constructing and sharing language resources
    * Exchange formats
    * Best practices for creating and disseminating language resources
    * Metadata for resource classification and discovery
    * Strategies and priorities for EU-US and Asian cooperation
    * Standards for language resources (lexicons, corpora, ontologies, etc.)

    * Lexical standards and multilinguality
    * Standards for content management
    * Standards and applications
    * Standards and evaluation


    * Nicoletta Calzolari (co-chair) - Istituto di Linguistica
    Computazionale CNR, Pisa (Italy)
    * Key-Sun Choi (co-chair) - Korea Advanced Institute of Science and
    Technology (Korea)
    * Asanee Kawtrakul (co-chair) - Kasetsart University (Thailand)
    * Alessandro Lenci (co-chair) - Dipartimento di Linguistica - Universita
    di Pisa (Italy)
    * Tokunaga Takenobu (co-chair) - Tokyo Institute of Technology (Japan)
    * Steven Bird - University of Pennsylvania (US)
    * Nuria Bel - GILCUB (Spain)
    * Ehara Terumasa - NHK (Japan)
    * Christiane Fellbaum - Princeton University (USA)
    * Ralph Grishman - New York University (USA)
    * Chu-Ren Huang - Academia Sinica (Taiwan)
    * Hammam Riza - BPPT (Indonesia)
    * Kurohashi Sadao - University of Tokyo (Japan)
    * Martha Palmer - University of Pennsylvania (USA)
    * Hae-Chang Rim - Korea University (Korea)
    * Rajeev Sangal - International Institute of Information Technology
    Hyderabad (India)
    * Shirai Kiyoaki - Japan Advanced Institute of Science and Tecchnology
    * Virach Sornlertlamvanich - NECTEC (Thailand)
    * Gregor Thurmair - SAIL Labs (Munich)
    * Benjamin Tsou - City University of HongKong (China)
    * Antonio Zampolli - Istituto di Linguistica Computazionale ? CNR


    Paper submission due *** NEW DATE!! ***

    ** May 10, 2002 **

    Notification of acceptance
    June 7, 2002

    Deadline for camera-ready papers
    June 29, 2002

    Workshop date
    August 31, 2002


    Center of Academia Activities, Academia Sinica, Taipei, Taiwan.


    A paper no more than 8 pages long should be sent via E-mail in the PDF
    format with all non-ASCII fonts embedded, no later than April 30, 2002
    to Alessandro Lenci ( After acceptance
    notification, the authors are requested to make a camera-ready no more
    than 8 pages long, and in the format prescribed by COLING 2002. Please
    see to get style sheet. The
    camera-ready should be sent electronically in the PDF format with all
    non-ASCII fonts embedded, no later than June 29, 2002 to Tokunaga
    Takenobu (

    This archive was generated by hypermail 2b29 : Mon Apr 29 2002 - 17:58:58 MET DST