Re: [Corpora-List] Free n-gram Software Released

From: Tony Berber Sardinha (tony4@uol.com.br)
Date: Tue Oct 15 2002 - 13:45:58 MET DST

  • Next message: Menno van Zaanen: "[Corpora-List] Call for papers: Special Issue Pattern Recognition"

    Thanks to all who responded, and to Bill for putting up a new website and for
    emailing me the software.

    cheers
    tony.
    -------------------------------------
    Dr Tony Berber Sardinha
    LAEL, PUC/SP
    (Catholic University of Sao Paulo, Brazil)
    tony4@uol.com.br
    http://lael.pucsp.br/~tony
    [New website]

    ----- Original Message -----
    From: "William H. Fletcher" <fletcher@usna.edu>
    To: "Tony Berber Sardinha" <tony4@uol.com.br>; <CORPORA@hd.uib.no>
    Sent: segunda-feira, 14 de outubro de 2002 09:40
    Subject: Re: [Corpora-List] Free n-gram Software Released

    > Tony (and others with thhe same problem),
    >
    > Several people from the list have given me useful feedback, which means they
    > have been able to access it. My website can be slow (I'm looking for
    > another provider), so I've posted keep a current version at
    > http://www.chesapeake.net/~fletcher/kfNgramHelp.html as well.
    >
    > Please don't hesitate to suggest additional features you might like.
    >
    > Regards,
    > Bill
    >
    > PS I've had problems accessing your site and sending you e-mail directly.
    >
    >
    > ----- Original Message -----
    > From: "Tony Berber Sardinha" <tony4@uol.com.br>
    > To: "William H. Fletcher" <fletcher@usna.edu>; "corpora list - messages to
    > list" <CORPORA@hd.uib.no>
    > Sent: Saturday, October 12, 2002 6:37 AM
    > Subject: Re: [Corpora-List] Free n-gram Software Released
    >
    >
    > > Dear list members
    > >
    > > Has anyone managed to download this software? I've been trying since the
    > message
    > > was posted, at different times of the day, without success. The page won't
    > > finish loading.
    > >
    > > cheers
    > > tony.
    > > -------------------------------------
    > > Dr Tony Berber Sardinha
    > > LAEL, PUC/SP
    > > (Catholic University of Sao Paulo, Brazil)
    > > tony4@uol.com.br
    > > http://lael.pucsp.br/~tony
    > > [New website]
    > >
    > > ----- Original Message -----
    > > From: "William H. Fletcher" <fletcher@usna.edu>
    > > To: < >
    > > Sent: segunda-feira, 30 de setembro de 2002 18:39
    > > Subject: [Corpora-List] Free n-gram Software Released
    > >
    > >
    > > > The recent flurry of discussion on n-gram software inspired me to
    > revisit a
    > > > project from last year. I reprogrammed kfNgram using aspects of the
    > > > "suffix array" approach described by Mikio Yamamoto and Kenneth W.
    > Church
    > > > and further developed by Chunyu Kit and Yorick Wilks. The result was a
    > > > quantum leap in performance which makes it useful even for large
    > corpora.
    > > > (It indexes the 25 million word CETENFolha corpus announced here last
    > week
    > > > in about 10 minutes on my Pentium III machine with 800 MHz processor
    > and
    > > > 256 MB RAM, then cranks out n-gram files in under a minute.)
    > > >
    > > > kfNgram supports user-defined character sets and sort orders, and its
    > GUI
    > > > (graphical user interface) makes it accessible even to casual users.
    > > >
    > > > This free Windows program is available at
    > > > http://miniappolis.com/KWiCFinder/kfNgramHelp.html
    > > > Suggestions and comments on its usability and performance will be
    > greatly
    > > > appreciated.
    > > >
    > > > Bill Fletcher
    > > >
    > > > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    > > >
    > > > William H. Fletcher 410.293.6362 [voice]
    > > > Associate Professor, German & Spanish 410.293.2729 [fax]
    > > > Language Studies Department
    > > > US Naval Academy
    > > > 589 McNair Road
    > > > Annapolis, MD 21402 - 5030
    > > >
    > > > fletcher@usna.edu
    > > > http://www.usna.edu/LangStudy/
    > > > http://kwicfinder.com/
    > > > http://miniappolis.com/
    > > >
    > > > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    > > >
    > > >
    > > >
    > >
    > >
    >
    >



    This archive was generated by hypermail 2b29 : Tue Oct 15 2002 - 14:11:08 MET DST