RE: [Corpora-List] English-language paraphrase corpora

From: Paula Newman (paulan@earthlink.net)
Date: Tue Feb 01 2005 - 11:50:54 MET

  • Next message: David Evans: "Re: [Corpora-List] English-language paraphrase corpora"

    Olga,
    The website for the DUC (document understanding conference)
    run by US NIST contains clusters of relatively short articles
    on the same topics. http://www-nlpir.nist.gov/projects/duc/data.html
    Accessing the data requires obtaining some permissions, described
    on that web page.
    Paula

    > [Original Message]
    > From: Olga Shaumyan <olgas@sussex.ac.uk>
    > To: <corpora@uib.no>
    > Date: 2/1/2005 3:41:26 AM
    > Subject: [Corpora-List] English-language paraphrase corpora
    >
    >
    > Dear All,
    >
    > I am looking for English-language "comparable" corpora. I.e. I want,
    > e.g., 2 collections of articles from different sources describing same
    events.
    >
    > Alternatively, would anyone know off-hand how one would go about
    > constructing such comparable collections?
    >
    > (This is to be used for automatic paraphrasing.)
    >
    > Any pointers greatly appreciated,
    >
    > Olga
    > University of Sussex NLP group
    >
    >
    >
    >
    >



    This archive was generated by hypermail 2b29 : Tue Feb 01 2005 - 16:08:50 MET