Corpora: ELRA News - New speech resources 1/2

Valerie Mapelli (info-elra@calva.net)
Mon, 6 Apr 1998 13:50:02 +0200 (MET DST)

[ We apologise for the duplicate posting of this announcement ]

EUROPEAN LANGUAGE RESOURCES ASSOCIATION
ELRA News
=====================================

*** NEW SPEECH RESOURCES - Part 1 ***

The ELRA catalogue is growing up. Since our last news on this electronic list,
the following resources appeared in our catalogue.

****************************
* ELRA-S0046 PolyVar *
****************************

PolyVar is a speaker verification database comprising native and
non-native speakers of French, mainly from Switzerland but also
from other European countries. It consists of read and spontaneous
speech recorded by 143 speakers (85 male and 58 female) amounting
to 160 hours of speech. Each speaker recorded from 1 to 229 sessions,
giving a total of 3,600 recorded sessions. The data are provided with
orthographic annotation.

The number of calls per speaker is as follows:
· 13 speakers called 100 times
· 9 speakers called from 51 to 100 times
· 16 speakers called from 21 to 50 times
· 3 speakers called from 11 to 20 times
· 31 speakers called from 2 to 10 times
· 71 speakers called only once

Each speaker uttered up to 53 different items per session, including:
· 3 sequences of digits (1 ID number, 1 credit card number and 1 sequence
of 6 digits)
· 24 application words (17 words about tourism – Martigny)
· 10 read sentences
· 4 numbers (2 natural numbers, 2 amounts)
· 2 items with dates (1 read/1 spontaneous)
· 2 items with hours (1 read/1 spontaneous)
· 2 spelled words
· 3 spontaneous answers (questions about their gender, native language and
the weather)
· 1 comment
· 1 telephone enquiry

File format: 8-bit a-law
Standard in use: NIST
Sampling rate: 8 kHz
Medium: 8 CD-ROMs

Price for ELRA members:
for research use: 1,000 ECU
for commercial use: 2,000 ECU

Price for non-members:
for commercial use: 4,000 ECU
for research use: 2,000 ECU

**************************************************************
* ELRA-S0047 SpeechDat Speaker Verification database *
**************************************************************

This subset of PolyVar consists of 20 speakers which recorded 50 sessions.
The format in use is a-law with SAM headers.

Medium: 3 CD-ROMs

Price for ELRA members:
for research use: 750 ECU
for commercial use: 1500 ECU

Price for non members:
for research use: 1500 ECU
for commercial use: 3000 ECU

********************************************
For more information, please contact:
ELRA/ELDA
55-57 rue Brillat Savarin
75013 PARIS
Tel: +33 1 43 13 33 33
Fax: +33 1 43 13 33 30
E-mail: info-elra@calva.net
http://www.icp.grenet.fr/ELRA/home.html
********************************************