AW: [Corpora-List] Query on Linking Text & Sound Files

From: Thomas Schmidt (thomas.schmidt@uni-hamburg.de)
Date: Mon Oct 21 2002 - 10:02:52 MET DST

  • Next message: Martin Wynne: "RE: [Corpora-List] 19th century electronic corpora"

    Dear Rita,

    we use TableTrans from the AG Toolkit (http://agtk.sourceforge.net/) to chop
    up sound files into conveniently sized pieces and our own EXMARaLDA-Software
    (http://www.rrz.uni-hamburg.de/exmaralda) to align the sound files with the
    transcript and output HTML for online delivery. (Check the examples on
    (German version of) the webpage to see if this is what you are looking for.)

    With kind regards,

            Thomas

    ---------------------------------------
    Thomas Schmidt
    SFB 538 'Mehrsprachigkeit' Teilprojekt Z
    Tel: ++ 49 (040) 42838-6425
    Fax: ++ 49 (040) 42838-6116
    http://www.rrz.uni-hamburg.de/exmaralda
    http://www.rrz.uni-hamburg.de/SFB538/
    ---------------------------------------

    > -----Ursprungliche Nachricht-----
    > Von: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no]Im
    > Auftrag von Rita Carol Simpson
    > Gesendet: Donnerstag, 17. Oktober 2002 16:40
    > An: corpora@hd.uib.no
    > Betreff: [Corpora-List] Query on Linking Text & Sound Files
    >
    >
    > Dear Colleagues,
    >
    > Does anyone have first-hand experience with any tools or techniques for
    > linking existing text transcripts of speech to sound files?
    >
    > Specifically, the MICASE project team is looking for input on how to go
    > about linking the sound files (currently in mp3 format, but convertable
    > to other formats) with the text transcripts, in relatively small
    > increments, and deliver the sound files online along with the transcripts
    > (which are already available online, via the website listed below).
    >
    > Ideally we would like to link the transcripts to the sound files in such a
    > way that from any point in the transcript you could click on a sound file
    > link and get to that portion of the transcript -- to the nearest, say,
    > 30-second or 1-minute increment. I realize the labor to do this for nearly
    > 200 hours of naturally-recorded speech may be significant, but we are
    > prepared to hire a number of research assistants to do as much of the
    > markup, chunking, & aligning tasks as possible.
    >
    > If you know of & have used any kind of software that will simplify or
    > streamline this process at all, or have suggestions about how we might go
    > about it, I would very much appreciate hearing from you.
    > (I do have a copy of the CSAE project's SoundWriter program, but because
    > I have not been able to find any documentation on it, I cannot properly
    > evaluate it; I suspect, however, that it's not exactly suited to our
    > goals -- specifically the eventual web-delivery aspect. If you've
    > successfully used this particular program, I would be interested in
    > finding out more about it.)
    >
    > I will be happy to post a summary to the list of any replies I get.
    >
    > Thank you in advance,
    >
    > Rita Simpson
    > _________________________________________________________________________
    >
    > Rita Simpson, PhD.
    > Project Director, Michigan Corpus of Academic Spoken English
    > English Language Institute, University of Michigan TEL: 734-763-7133
    > www.lsa.umich.edu/eli/micase/micase.htm www.hti.umich.edu/m/micase/
    > _________________________________________________________________________
    >
    >
    >
    >
    >



    This archive was generated by hypermail 2b29 : Mon Oct 21 2002 - 10:20:42 MET DST