Re: [Corpora-List] English-language paraphrase corpora

From: radev@umich.edu
Date: Tue Feb 01 2005 - 14:39:10 MET

  • Next message: Paula Newman: "RE: [Corpora-List] English-language paraphrase corpora"

    Our system, a precursor to Google News is also active on the Web:

    www.newsinessence.com

    Using it, we have collected 50,000 or so clusters of related news.

    --
    Drago
    

    nielsen@dcs.kcl.ac.uk wrote: > > > If you don't mind collecting raw text, news.google.com does this. > > Leif > > > > > Dear All, > > > > I am looking for English-language "comparable" corpora. I.e. I want, > > e.g., 2 collections of articles from different sources describing same > > events. > > > > Alternatively, would anyone know off-hand how one would go about > > constructing such comparable collections? > > > > (This is to be used for automatic paraphrasing.) > > > > Any pointers greatly appreciated, > > > > Olga > > University of Sussex NLP group > > > > > > > > > > > > > > > > > > >

    -- Dragomir R. Radev radev@umich.edu Assistant Professor of Information, Electrical Engineering and Computer Science, and Linguistics, the University of Michigan, Ann Arbor Phone: 734-615-5225 Fax: 734-764-2475 http://www.si.umich.edu/~radev



    This archive was generated by hypermail 2b29 : Tue Feb 01 2005 - 14:40:31 MET