[Corpora-List] a new dialogue data corpus: Dialogue Diversity Corpus -- (DDC)

From: William Mann (bill_mann@sil.org)
Date: Thu Oct 10 2002 - 16:30:41 MET DST

  • Next message: Diego Molla: "[Corpora-List] Call for Participation -- ANLP2002"

      
    Announcement

    DIALOGUE DIVERSITY CORPUS

    http://www-rcf.usc.edu/~billmann/diversity

    (apologies if you receive multiple copies)

    A new corpus is available for facilitating research on human dialogue.

    The Dialogue Diversity Corpus (DDC) gives direct access to a set of dialogue transcripts (13 sources, more than 12 hours of dialogue, all in English.). It also gives a set of links and methods for accessing hundreds of additional dialogues (principally in English.) Several sources provide speech data as well as transcripts.

    The dialogues in this corpus occurred in a very diverse collection of interactive situations. Thus it is a data resource for studies of the breadth of coverage of particular dialogue models, and for studies that compare dialogue from different situations.

    For smaller projects such as pilot studies, program testing and even some term papers, the direct access portion will be sufficient. The access methods may yield enough dialogue data for some much larger studies.

    The corpus is designed for data finding rather than for bulk processing. Taken as a whole, it is irregular and not homogeneous in any way. It is generally unsuitable for drawing any conclusions about dialogue taken as a single category.

    ===============
    William C. Mann
     
    bill_mann@sil.org



    This archive was generated by hypermail 2b29 : Thu Oct 10 2002 - 16:42:17 MET DST