Re: [Corpora-List] token clustering tool

From: Steven Bird (sb@cs.mu.oz.au)
Date: Wed May 12 2004 - 03:07:58 MET DST

Next message: Clive De Silva: "[Corpora-List] IDF values"

Previous message: Normand Peladeau: "Re: [Corpora-List] token clustering tool"
Maybe in reply to: Murk Wuite: "[Corpora-List] token clustering tool"
Next in thread: Alan M Wallington: "[Corpora-List] wanted: corpora of student-teacher interactions"
Next in thread: Jose Maria Gomez Hidalgo: "Re: [Corpora-List] token clustering tool"
Reply: Alan M Wallington: "[Corpora-List] wanted: corpora of student-teacher interactions"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Wed, 2004-05-12 at 09:07, Normand Peladeau wrote:
> At 2004-05-11 03:24, you wrote:
> > Dear all,
> >
> > Does anyone know of a tool (or algorithm), preferably available
> > freely
> > for research purposes, that takes as its input a corpus only and
> > produces as its output clusters of tokens that occur close to each
> > other
> > relatively often?
>
> I created such a software but it is a commercial product...

Its easy to write a program to do this using NLTK, and its free.

Please see: http://nltk.sourceforge.net/

-Steven Bird

Next message: Clive De Silva: "[Corpora-List] IDF values"
Previous message: Normand Peladeau: "Re: [Corpora-List] token clustering tool"
Maybe in reply to: Murk Wuite: "[Corpora-List] token clustering tool"
Next in thread: Alan M Wallington: "[Corpora-List] wanted: corpora of student-teacher interactions"
Next in thread: Jose Maria Gomez Hidalgo: "Re: [Corpora-List] token clustering tool"
Reply: Alan M Wallington: "[Corpora-List] wanted: corpora of student-teacher interactions"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Wed May 12 2004 - 03:59:06 MET DST