Re: Corpora: e-mail corpus

From: Bouma G. (gosse@let.rug.nl)
Date: Thu Aug 23 2001 - 16:43:18 MET DST

  • Next message: Stefan.Wermter: "Corpora: EmerNet book: Emergent Neural Computational Architectures"

    Paul,

    > Is there any email corpus available
    > for research groups somewhere?

    The lingspam corpus consists of messages posted to the
    linguist list and spam messages. It has been used as a
    test bench for spam filtering.

    There is a link to it from the publications page of Ion
    Androutsopoulos
    www.http://www.iit.demokritos.gr/~ionandr/publications.htm

    Depending on what you have in mind, this might be useful,

    best,

    Gosse.

    -- 
    Gosse Bouma, Alfa-informatica, RUG, Postbus 716, 9700 AS Groningen
    gosse@let.rug.nl      tel. +31-50-3635937      fax  +31-50-3636855
    



    This archive was generated by hypermail 2b29 : Thu Aug 23 2001 - 16:37:55 MET DST