Re: Corpora: Broadcast corpus

From: Christopher Cieri (ccieri@ldc.upenn.edu)
Date: Mon Jan 17 2000 - 19:00:23 MET

  • Next message: David Graff: "Re: Corpora: Broadcast corpus"

    Professor Chandrasekar,

    Thanks for your post. Someone from LDC did write to Professor Maucec
    directly. We probably should have copied the whole list since the
    information may be of interest to others. We will do that presently.

    Thanks and best wishes,
    Chris

    Raman Chandrasekar wrote:

    > LDC does have transcribed broadcast news. See
    > http://morph.ldc.upenn.edu/Catalog/by_type.html under the heading
    > Broadcast text . You'll see the following:
    >

                                  Broadcast text
                                       [text]
               LDC98T31 1996 CSR Hub-4 Language Model
               LDC97T22 1996 English Broadcast News Transcripts (Hub-4)
               LDC98T28 1997 English Broadcast News Transcripts (Hub-4)
               LDC98T24 1997 Mandarin Broadcast News Transcripts (Hub-4NE)
               LDC98T29 1997 Spanish Broadcast News Transcripts (Hub-4NE)
               LDC99T36 USC Marketplace Broadcast News Transcripts
    > However, access to these collections may require you to be a member.
    > I'm cc'ing LDC on this, hopefully they'll get back to you
    > directly.Regards, -- Raman Chandrasekar
    >
    > -----Original Message-----
    > From: Mirjam Sepesy Maucec [mailto:mirjam.sepesy@uni-mb.si]
    > Sent: Sunday, January 16, 2000 10:41 PM
    > To: corpora@hd.uib.no
    > Subject: Corpora: Broadcast corpus
    >
    > Hi,
    >
    > my research topic is domain based adaptation of language
    > model. For my work I hardly need a text corpus
    > with topic tags.
    > Broadcast corpus seems to be appropriate. Where can I get
    > it? I don't find it in LDC catalog. I also write 2
    > e-mails to Primary Source Media to get some information and
    > I got no answer.
    > Please, help!
    >
    > Mirjam
    >
    > --
    > _____________________________________________________________
    >
    > Mirjam Sepesy Maucec
    > Faculty of Electrical Engineering and Computer Science
    > University of Maribor
    > Smetanova 17
    > 2000 MARIBOR
    > tel: ++386 (062) 220 7225
    > e-mail: mirjam.sepesy@uni-mb.si
    >
    >
    >

    --
    Christopher Cieri
    Executive Director, Linguistic Data Consortium
    3615 Market Street, Philadelphia, PA 19104-2608 USA
    phone: 215-573-5489, fax: 215-573-2175
    mailto:Christopher.Cieri@ldc.upenn.edu
    http://www.ldc.upenn.edu
    



    This archive was generated by hypermail 2b29 : Mon Jan 17 2000 - 18:56:09 MET