Re: [Corpora-List] are there corpora of fast speech?

From: Ute Römer (ute.roemer@uni-koeln.de)
Date: Wed Jan 15 2003 - 09:09:36 MET

  • Next message: tebeka michael: "[Corpora-List] DB of (base form, inflected form) - Summary of answers"

    Dear Dinoj and others,

    With the Bergen Corpus of London Teenage Language (COLT) Eric Atwell
    mentioned you can get both transcripts and MP3 files (55 hours of
    spontaneous conversation on CDs -- don't know how many words per minute you
    get). Also, there is a sound-text-alignment, so you can search the corpus
    and get hyperlinks to the sound files. The texts are orthographically
    transcribed (and word class tagged); I doubt that you will manage to find
    many phonemically (or even phonetically) transcribed corpora (if any at
    all -- who would want to phonetically transcribe 500,000 words or more?).
    For more information on COLT see http://www.hit.uib.no/colt/

    Good luck with your research!
    Best wishes... Ute

    _______________________

    Ute Römer
    English Department
    University of Cologne
    Albertus-Magnus-Platz 1
    50923 Köln
    Germany

    Phone: 0049 (0)221 470 3038
    Email: ute.roemer@uni-koeln.de
    _______________________

    ----- Original Message -----
    From: "Dinoj Surendran" <dinoj@cs.uchicago.edu>
    To: <CORPORA@HIT.UIB.NO>
    Sent: Tuesday, January 14, 2003 7:43 PM
    Subject: [Corpora-List] are there corpora of fast speech?

    > Dear list members,
    >
    > Does anyone know if there is a (at least) phonetically transcribed corpus
    > of fast English speech? A corpus of spontaneous speech known to have
    > several fast speakers could also work. And while I would prefer to have
    > both the sound files and the transcription files, the latter only will
    > still be of use.
    >
    > Thanks,
    >
    > Dinoj Surendran
    > Graduate Student
    > Computer Science Dept
    > University of Chicago
    >
    >



    This archive was generated by hypermail 2b29 : Wed Jan 15 2003 - 09:10:45 MET