Corpora: Summary: Foreign stop words

From: Alex Franz (alex@google.com)
Date: Wed Aug 16 2000 - 01:32:24 MET DST

  • Next message: David Campbell: "Corpora: Question about a Brown Corpus tag"

    Thanks to everyone who responded to my query about non-English
    stopwords, including:

    Mike Scott <lexical@netcomuk.co.uk>
    Stefan Thomas Gries <StThGries@t-online.de>
    Jean Veronis <Jean.Veronis@newsup.univ-mrs.fr>
    Robert J. Kuhns <kuhns@world.std.com>
    Tim Buckwalter <tbuckwalter@tegic.com>

    I received the following information:

    - You can use Wordsmith (www.netcomuk.co.uk/~lexical/)
      to generate frequency lists.

    - Mike Barlow's site at http://www.ruf.rice.edu/~barlow/corpus.html
      hast lists for German, French, and English.

    - Jean Veronis has a list for French on his homepage
      http://www.up.univ-mrs.fr/~veronis/

    - There are French stop word lists at

      http://www.comp.lancs.ac.uk/computing/research/ucrel/public/1485.html
      http://www.loria.fr/~bonhomme/sw/

    --Alex



    This archive was generated by hypermail 2b29 : Wed Aug 16 2000 - 08:09:13 MET DST