New Corpus from the Linguistic Data Consortium

LDC Office (ldc@unagi.cis.upenn.edu)
Wed, 16 Oct 1996 16:47:12 EDT

Messages sorted by: [ date ][ thread ][ subject ][ author ]
Next message: Vivi Juul-Pedersen: "International Masters Degree"
Previous message: Kavi Mahesh: "CFP: NLP for the WWW (due Oct 25)"

Announcing a NEW RELEASE from the
LINGUISTIC DATA CONSORTIUM

Voice Across Hispanic America
VAHA

Voice Across Hispanic America (VAHA) is a corpus of Spanish telephone
speech, recorded digitally from 915 native speakers of Spanish in
various parts of the United States. With nearly 39,000 recorded and
transcribed utterances, VAHA will be useful for a variety of research
studies, but it is intended primarily for speech technology research
and development in telecommunications applications. It is patterned
after MACROPHONE (LDC94S21), an American English corpus that is
widely used for this purpose.

This corpus was collected by Texas Instruments in Dallas, TX, for the
Linguistic Data Consortium at the University of Pennsylvania.

Institutions that have membership in the LDC during the 1996
membership year will receive VAHA on request at no additional
charge, in the same manner as all other text and speech corpora
published by the LDC.

Nonmembers can receive a copy of VAHA for research purposes only for
a fee of $3500. If you would like to order a copy of this corpus,
please email your request to ldc@unagi.cis.upenn.edu. If you need
additional information before placing your order, or would like to
inquire about membership in the LDC, please send email or call (215)
898-0464.

Further information about the LDC and its available corpora can be
accessed on the Linguistic Data Consortium WWW Home Page at URL
http://www.ldc.upenn.edu/.

Next message: Vivi Juul-Pedersen: "International Masters Degree"
Previous message: Kavi Mahesh: "CFP: NLP for the WWW (due Oct 25)"