Re: [Corpora-List] token clustering tool

From: Steven Bird (sb@cs.mu.oz.au)
Date: Wed May 12 2004 - 03:07:58 MET DST

  • Next message: Clive De Silva: "[Corpora-List] IDF values"

    On Wed, 2004-05-12 at 09:07, Normand Peladeau wrote:
    > At 2004-05-11 03:24, you wrote:
    > > Dear all,
    > >
    > > Does anyone know of a tool (or algorithm), preferably available
    > > freely
    > > for research purposes, that takes as its input a corpus only and
    > > produces as its output clusters of tokens that occur close to each
    > > other
    > > relatively often?
    >
    > I created such a software but it is a commercial product...

    Its easy to write a program to do this using NLTK, and its free.

    Please see: http://nltk.sourceforge.net/

    -Steven Bird



    This archive was generated by hypermail 2b29 : Wed May 12 2004 - 03:59:06 MET DST