Re: [Corpora-List] English-language paraphrase corpora

From: David Evans (devans@cs.columbia.edu)
Date: Tue Feb 01 2005 - 16:20:56 MET

  • Next message: Eric Atwell: "[Corpora-List] Chair of Modern English Language, Leeds Univ"

    We have a system at Columbia that crawls the web, and clusters documents
      into related sets:

    http://newsblaster.cs.columbia.edu/

    It has archives going back to 2001 or so.

    Dave

    Olga Shaumyan wrote:

    > Dear All,
    >
    > I am looking for English-language "comparable" corpora. I.e. I want,
    > e.g., 2 collections of articles from different sources describing same events.
    >
    > Alternatively, would anyone know off-hand how one would go about
    > constructing such comparable collections?
    >
    > (This is to be used for automatic paraphrasing.)
    >
    > Any pointers greatly appreciated,
    >
    > Olga
    > University of Sussex NLP group
    >
    >
    >
    >
    >



    This archive was generated by hypermail 2b29 : Tue Feb 01 2005 - 16:15:23 MET