Corpora: ELRA News 1/2 (plain text)

From: Magali Duclaux (duclaux@elda.fr)
Date: Mon Apr 30 2001 - 16:10:51 MET DST

  • Next message: Magali Duclaux: "Corpora: ELRA News 2/2 (plain text)"

    [ We apologise for the duplicate posting of this announcement ]

    *********************************************************************************************************************************************************
    ELRA
    European Language Resources Association
    ELRA News
    *********************************************************************************************************************************************************
    We are happy to announce new resources available via ELRA in the framework
    of the Aurora project.

      *** AURORA PROJECT DATABASE ***

    [ELRA is releasing 5 databases made within the ETSI STQ-AURORA DSR working
    group.]

    AURORA PROJECT DATABASE 2.0

    The Aurora project is releasing a revised version of the Noisy TI digits
    database to follow on the work of ETSI. This CD set is a replacement for
    the previous set (version 1.0 consisted of 2 CDs while version 2.0 now
    consists of 4 CDs) .
    This database is intended for the evaluation of algorithms for front-end
    feature extraction algorithms in background
    noise but may also be used more widely by speech researchers to evaluate
    and compare the performance of
    noise robust speech recognition algorithms.

    Compared to version 1.0 the changes are as follows:

    1) The files are restored to the energy level of the original speech
    in the TI digits database.
    2) One of the noise types added to the speech has been changed
    (the babble one)
    3) There is an additional test sets where the noises are mismatched
    to those used in the training set.
    4) There is a convolutional distortion test.
    5) There is a clean training set
    The CD ROM will be used for the next round of ETSI Aurora standards
    evaluation.

    AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR FINNISH DATABASE

    This database is a subset of the SpeechDat-Car database in Finnish language
    which has been collected as part of the European Union funded SpeechDat-Car
    project. It contains isolated and connected Finnish digits spoken in the
    following driving conditions inside a car:
    1. 0 km/hr with the car engine on
    2. 40-60 km/hr with the car windows closed
    3. 40-60 km/hr with the car windows open
    4. 100-120km/hr with no music in the background
    5. 100-120km/hr with music in the background
    The database also contains the software needed to run simulations
    using the Entropic's HTK, which has been adopted as the "standard"
    HMM recogniser for the Aurora standard evaluation.

    AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR DANISH DATABASE

    The Aurora project was originally set up to establish a world wide standard
    for the feature extraction software which forms the core of the front-end
    of a DSR (Distributed Speech Recognition) system. ETSI formally adopted
    this activity as work items 007 and 008.The two work items within ETSI are:
    - ETSI DES/STQ WI007: Distributed Speech Recognition - Front-End
    Feature Extraction Algorithm & Compression Algorithm
    - ETSI DES/STQ WI008: Distributed Speech Recognition - Advanced
    Feature Extraction Algorithm.

    This database is a subset of the SpeechDat-Car database in Danish language
    which has been collected as part of the European Union funded SpeechDat-Car
    project. It contains isolated and connected Danish digits spoken in the
    following noise and driving conditions inside a car:

    1. High speed good road
    2. Low speed rough road
    3. Stopped with motor running
    4. Town traffic

    AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR GERMAN DATABASE

    The Aurora project was originally set up to establish a world wide standard
    for the feature extraction software which forms the core of the front-end
    of a DSR (Distributed Speech Recognition) system. ETSI formally adopted
    this activity as work items 007 and 008.The two work items within ETSI are:
    - ETSI DES/STQ WI007: Distributed Speech Recognition - Front-End
    Feature Extraction Algorithm & Compression Algorithm
    - ETSI DES/STQ WI008: Distributed Speech Recognition - Advanced
    Feature Extraction Algorithm.

    This database is a subset of the SpeechDat-Car database in German language
    which has been collected as part of the European Union funded SpeechDat-Car
    project. It contains isolated and connected German digits spoken in the
    following noise and driving conditions inside a car:

    5. High speed good road
    6. Low speed rough road
    7. Stopped with motor running
    8. Town traffic

    AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR SPANISHDATABASE

    The Aurora project was originally set up to establish a world wide standard
    for the feature extraction software which forms the core of the front-end
    of a DSR (Distributed Speech Recognition) system. ETSI formally adopted
    this activity as work items 007 and 008.The two work items within ETSI are:
    - ETSI DES/STQ WI007: Distributed Speech Recognition - Front-End
    Feature Extraction Algorithm & Compression Algorithm
    - ETSI DES/STQ WI008: Distributed Speech Recognition - Advanced
    Feature Extraction Algorithm.

    This database is a subset of the SpeechDat-Car database in Spanish language
    which has been collected as part of the European Union funded SpeechDat-Car
    project. It contains isolated and connected Spanish digits spoken in the
    following noise and driving conditions inside a car:

    1. Quiet environment. Stop motor running.
    2. Low noise. Town traffic + low speed rough road.
    3. High noise: High speed good road.
    =====================================
    For further information, please contact:
    ELRA/ELDA Tel +33 01 43 13 33 33
    55-57 rue Brillat-Savarin Fax +33 01 43 13 33 30
    F-75013 Paris, France E-mail mapelli@elda.fr
    or visit the online catalogue on our Web site:
    http://www.icp.grenet.fr/ELRA/home.html
    or http://www.elda.fr
    =====================================



    This archive was generated by hypermail 2b29 : Mon Apr 30 2001 - 16:11:43 MET DST