DEADLINE REMINDER: January, 23rd, 1995

evelyne@research.att.com
Wed, 18 Jan 95 12:19:19 -0500

From: Evelyne Tzoukermann

CALL FOR PAPERS
FROM TEXTS TO TAGS: ISSUES IN MULTILINGUAL LANGUAGE ANALYSIS

EACL SIGDAT WORKSHOP
Dublin, Ireland - March 27, 1995

Third Announcement

Workshop organized by the ACL special interest group SIGDAT
to be held in conjunction with the meeting of the European Chapter
of the Association of Computational Linguistics. The meeting will be
co-chaired by Susan Armstrong, ISSCO and Evelyne Tzoukerman, AT&T Bell
Laboratories.

Submission deadline: Jan 23
Notice of acceptance/rejection: February 10
Camera ready copy due: March 1

With the growing amount of multilingual corpus data becoming
available, there is a pressing need to explore issues in
representation and analysis of these texts. Although extensive and
leading work has been accomplished for languages such as English, for
the most part many theoretical and concrete issues need to be resolved
in the representation and tagging of other languages.

The focus of this workshop is on multilingual text analysis, from the
level of text itself, e.g. tokenization, sentence separation, etc, to
morphosyntactic analysis, specifically tagging. We intend to focus on
tagging since it appears to be the case that, from a computational
point of view, part of speech tagging is often an important
prerequisite to further structural analysis. Additionally, many NLP
systems can make use of tagged corpora for various applications.
However, tasks such as tokenization and tagging continue to raise
serious challenges in multilingual text analysis, due to differing
types of morphological characteristics across languages.

Topics of Interest include (but are not limited to):

- tokenization and segmentation
- interfaces between morphological analysis and part-of-speech tagging
- size and choice of tagset
- defining and refining new tag sets
- mapping between tag sets
- universal vs. language specific tags
- multilingual approaches to tagging

We invite submissions on topics that in general reflect an awareness
of differences and similarities in working on multilingual text.
We also welcome substantive descriptions of newly started and ongoing
projects.

Program Committee:

K. Church, USA
B. Gale, USA
J.-M. Lange, FR
G. Leech, UK
A. Voutilainen, FI

FORMAT FOR SUBMISSION: Authors should submit extended abstracts
(2000-3000 words), either electronically or in hard-copy. Electronic
submissions must either be plain ascii text or a postcript file
following the EACL-95 stylesheet. Hard copy backup should include
two (2) copies of the paper. Abstracts should be sent to either of
the addresses:

Evelyne Tzoukermann Susan Armstrong-Warwick
AT&T Bell Laboratories ISSCO University of Geneva
Room 2D-448, P.O. Box 636 54 route des Acacias
600 Mountain Avenue
Murray Hill, NJ, 07944-0636 CH-1227 Geneve
USA Switzerland
tel. +1-908-582-2924 +41-22-705-7113
fax +1-908-582-7308 +41-22-300-1086
email evelyne@research.att.com susan@divsun.unige.ch