Re: Corpora: Corpus of business emails?

From: Christopher Cieri (ccieri@ldc.upenn.edu)
Date: Tue Feb 01 2000 - 08:26:21 MET

  • Next message: Robert Batusek: "Corpora: TSD 2000 - First Announcement and Call for Papers"

    David,

    This isn't exactly what you requested but the LDC distributes a
    Voicemail Corpus containing 1801 business messages that average 30
    seconds in duration. Both the audio and transcripts are available. If
    you are interested, please see:
            http://www.ldc.upenn.edu/Catalog/LDC98S77.html
    for more information.

    Good luck,
    Chris

    David Meyer wrote:
    >
    > Dear list members,
    >
    > Is anyone aware of the availability of a corpus of emails, especially
    > those having a business context? The use would be for the analysis of the
    > contents of the body for purposes of automated summarization and
    > information extraction. No information contained in the emails would be
    > released.
    > Thanks in advance.
    > Sincerely,
    >
    > David Meyer
    > Inxight Software



    This archive was generated by hypermail 2b29 : Tue Feb 01 2000 - 05:18:51 MET