Corpora: SPECIAL ANNOUNCEMENT FROM ELRA - SPEECH RECOGNITION

Valerie Mapelli (info-elra@calva.net)
Mon, 19 Jan 1998 17:21:36 +0100 (MET)

[ We apologise for the duplicate posting of this announcement ]

EUROPEAN LANGUAGE RESOURCES ASSOCIATION
ELRA News
=====================================

*** SPECIAL ANNOUNCEMENT FROM ELRA - SPEECH RECOGNITION ***

Paris, France -- The European Language Re-sources Association, the central
distribution unit for Language Resources in Europe, today presents a special
announcement on resources in speech recognition. As of today, the
association can offer as many as 70 databases in the area of Spoken
resources. The Language Resources available for Speech and Speech
Recognition are illustrated below.

*Speech Recognition for Telephone Applications*

American English: Siemens VoiceMail: 921 American speakers recorded 25,000
utterances over the digital telephone network.
Danish: Danish SpeechDat(M) database: 1,523 speakers.
Dutch: Dutch Polyphone database: Read & spon-taneous speech over the
telephone from 5,050 Dutch speakers.
English: English SpeechDat(M) database: 1,000 speakers over digital
telephone lines.
French: FRESCO: 1,000 speakers over the tele-phone in France.
German: German SpeechDat(M) database: 1,000 speakers.
SieTill (Siemens Tillman): 730 speakers and 36,000 utterances (digit
sequences, dates, spelled names, ...).
Italian: COLLECT: 500 Italian speakers uttered the 10 Italian digits and 5
command words.
Swiss-French: Swiss-French SpeechDat(M) polyphone database: 5,000 speakers
answered 10 questions and read 28 items.

*Microphone-Based Databases*

Dutch: GRONINGEN: Over 20 hours of Dutch read speech material from 238 speakers.
English: TED (Translanguage English Database.): 188 oral presentations in
English given at Eurospeech'93 in Berlin.
TEDPhone: Polyphone/SpeechDat-like recordings of 64 speakers in English and
in their native lan-guage.
French: BREF-80: Training data of 5,330 sen-tences read by 80 French speakers.
French
BREF-POLYGLOT: training data of 3,193 sen-tences read by 6 French speakers.
German: PHONDAT 1: Read speech from 201 German speakers who read 450
different sentences each.
PHONDAT 2: 200 different sentences from a train inquiry task read by 16
German speakers.
SIEMENS 100: 100 sentences from the German newspaper SüdDeutsch Zeitungen
and read by 101 speakers.
SIEMENS 1000: 1,000 sentences from the German newspaper SüdDeutsch Zeitungen
and read by 10 speakers.
VERBMOBIL: German spontaneous speech data-bases recorded in a dialogue task.
Italian: Apasci: 100 speakers with 16,090 utter-ances and digits, 58,924
words and 641 minutes of speech.
EUROM1i: Over 60 speakers who pronounced numbers, sentences, isolated words
using close talking microphone.

*Speaker Verification/Identification*

English: COST232 - Multi-English database: 797 calls received in Italy and
in the UK.
POLYCOST: Over 100 speakers with ca. 10 call sessions per speaker (English
spoken by foreigners).
German: PolyVar: 143 speakers with 3600 recor-ded sessions.
SpeechDat Speaker Verification database: Sub-set of PolyVar with 20 speakers
who recorded 50 ses-sions.
Multilingual: M2VTS: Multilingual database using multimodal identification
of human faces (speech & image).

The European Languages Resources Association, funded in Luxembourg in 1995,
provides the infra-structure for identifying, collecting, classifying,
validating, distributing, and exploiting language resources. Such resources
include both speech and text, terminology and software tools.

More details on these and all other ELRA databases can be found on the ELRA
Web-site: http://www.icp.grenet.fr/ELRA/home

Contact: ELRA/ELDA, 87 Avenue d'Italie, 75011 Paris, France, Tel +33-1-45 86
53 00, E-mail: elra@calvanet.calvacom.fr
============<><><><><><><><><><><><>============
Valerie Mapelli Tel: +33 1 45 86 53 00
ELRA/ELDA Fax: +33 1 45 86 44 88
87, Avenue d'Italie E-mail: info-elra@calva.net
75013 PARIS http://www.icp.grenet.fr/ELRA

============<><><><><><><><><><><><>============
FIRST INTERNATIONAL CONFERENCE ON
LANGUAGE RESOURCES AND EVALUATION
GRANADA, SPAIN, 28-30 MAY 1998

http://www.icp.grenet.fr/ELRA/conflre.html
============<><><><><><><><><><><><>============