Corpora: Granada Workshop Call for Papers

Roberta Catizone (R.Catizone@dcs.shef.ac.uk)
Thu, 8 Jan 1998 11:55:29 GMT

ADAPTING LEXICAL AND CORPUS RESOURCES TO SUBLANGUAGES AND APPLICATIONS

a workshop to be held at the

FIRST INTERNATIONAL CONFERENCE
ON LANGUAGE RESOURCES AND EVALUATION

GRANADA, SPAIN, 26 MAY 1998

The workshop will provide a forum for those researchers involved in
the development of methods to integrate corpora and MRDs, with the aim of
adding adaptive capabilities to existing linguistic resources.

Organisers: Roberto Basili (University of Roma "Tor Vergata"),
Roberta Catizone (University of Sheffield),
Maria Teresa Pazienza (University of Roma "Tor Vergata"),
Paola Velardi (University of Roma "La Sapienza),
Yorick Wilks (University of Sheffield)

WORKSHOP SCOPE AND AIMS

Lexicons, i.e., those components of a NLP system that contain "computable"
information about words, cannot be considered as static objects. Words may
behave very differently in different domains, and there are language
phenomena that do not generalize across sublanguages.
Lexicons are a snapshot of a given stage of development of a language,
normally provided without support for adaptation changes, whether caused
by language creativity and development or the shift to such
a previously unencountered domain.

The divergence of corpus usages from lexical norms has been studied
computationally at least since the late Sixties, but only recently
has the availability of large on-line corpora made it possible to establish
methods to cope systematically with this problem.
An emerging branch of research is now involved in studies and experiments
on corpus-driven linguistics, with the aim of complementing and
extending earlier work on lexicon acquisition based on Machine Readable
Dictionaries (MRD): data are extracted from texts, as embodiments of
language in
use, so as to capture lexical regularities and to code them into operational
forms. The purpose of this workshop will be to provide an updated snapshot
of current work in the area, and promote discussion of how to make progress.

Central topics will be (though this list is in no way exclusive):

* corpus-driven tuning of MRDs to optimize domain-specific inferences,
* terminology and jargon acquisition,
* sense extensions,
* acquisition of preference or subcategorization information from corpora
* taxonomy adaptation,
* statistical weighting of senses etc. to domains
* use of MRDs to provide explanations of linguistic phenomena in corpora
* what is the scope of "lexical tuning"
* the evaluation of lexical tuning as a separate task, or as part
of a more generic task

PROGRAM COMMITTEE

Yorick Wilks University of Sheffield
Roberta Catizone University of Sheffield
Paola Velardi University of Roma "La Sapienza"
Maria Teresa Pazienza University of Roma "Tor Vergata"
Roberto Basili University of Roma "Tor Vergata"
Bran Boguraev Brandeis University
Sergei Nirenburg New Mexico State University
James Pustejowsky Brandeis University
Ralph Grishman New York University
Christiane Fellbaum Princeton University

PAPER SUBMISSION

FORMATTING GUIDELINES:
Papers should not exceed 4000 words or 10 pages.

HARD COPIES:

Three hard copies should be sent to:

Paola Velardi
Dipartimento di Scienza dell'Informazione
via Salaria 113
00198 Roma
Italy

ELECTRONIC SUBMISSION:

Electronic submission will be allowed in Poscript or Word per Mac or RTF.
An ftp site will be available on demand.
Authors should send an info email to Paola Velardi
(velardi@dsi.uniroma1.it) even
if they submit in paper form. An electronic submission should be
accompanied by a plain ascii text.

# NAME : Name of first author
# TITLE: Title of the paper
# PAGES: Number of pages
# FILES: Name of file (if also submitted electronically)
# NOTE : Anything you'd like to add
# KEYS : Keywords
# EMAIL: Email of the first author
# ABSTR: Abstract of the paper
# . . . . . .

IMPORTANT DATES

Paper Submission Deadline (Hard Copy/Electronic) February 20
Paper Notification March 20
Camera-Ready Papers Due April 15
L&CT workshop May 26

CONFERENCE INFORMATION

General information about the conference is at:
<http://www.icp.inpg.fr/ELRA/conflre.html>

Specific queries about the conference should be directed to:

LREC Secretariat
Facultad de Traduccion e Interpretacion
Dpto. de Traduccion e Interpretacion
C/ Puentezuelas, 55
18002 Granada, SPAIN
Tel: +34 58 24 41 00 - Fax: +34 58 24 41 04
reli98@goliat.ugr.es