Corpora: ELRA News

Valerie Mapelli (mapelli@elda.fr)
Tue, 11 May 1999 17:18:14 +0200

[ We apologise for the duplicate posting of this announcement ]

___________________________________________________________
ELRA
European Language Resources Association
ELRA News
___________________________________________________________

*** ELRA NEW RESOURCES ***

We are happy to announce a new resource available via ELRA:
_______________________________________
· ELRA-S0063 German SpeechDat(II) FDB 4000
_______________________________________

The German SpeechDat(II) FDB 4000 consists of 4000 calls over
the German fixed network, stored on 17 CD-ROMs in the final
SpeechDat(II) database exchange format. The speech databases
made within the SpeechDat(II) project were validated by SPEX,
the Netherlands, to assess their compliance with the SpeechDat
format and content specifications.

The following items were recorded:
- 1 isolated digit
- 1 sequence of 10 isolated digits
- prompt sheet number = 5
- 9-11 digit telephone number (read)
- 15-16 digit credit card number (read, 150 different credit card
numbers were found)
- 6 digit PIN code (read)
- 1 natural number (read)
- 1 money amount (read)
- 2 yes/no questions (spontaneous, not prompted)
- 3 dates (1 spontaneous, e.g. birthday; 1 prompted text form; 1 relative
and general date form)
- 1 time of day (spontaneous)
- 1 time phrase (read)
- 3 application words
- 1 word spotting phrase
- 5 directory assistance names (1 spontaneous name (e.g. forename),
1 spontaneous city name, 1 read city name (from a list of 500 most
frequent), 1 read company/agency name (from a list of 500 most frequent),
1 read proper name, fore- and surname (from list of 150 SDB names).
- 3 spellings (1 spontaneous, e.g. forename; 1 directory city name; 1
real/artificial word)
- 4 isolated words
- 9 phonetically rich sentences (read)

The speech files are stored as sequence of 8-bit, 8kHz A-law speech files
and are not compressed. Each prompt utterance is stored within a separate
file and has an accompanying ASCII SAM label file.

=====================================
For further information, please contact :

ELRA/ELDA Tel : +33 01 43 13 33 33
55-57 rue Brillat-Savarin Fax : +33 01 43 13 33 30
F-75013 Paris, France E-mail : mapelli@elda.fr

or visit our Web site:

http://www.icp.grenet.fr/ELRA/home.html
=====================================