Re: [Corpora-List] Free n-gram Software Released

From: William H. Fletcher (fletcher@usna.edu)
Date: Mon Oct 14 2002 - 14:40:51 MET DST

  • Next message: Alessandro Lenci: "[Corpora-List] ACL 2003 - Call for Workshop Proposals"

    Tony (and others with thhe same problem),

    Several people from the list have given me useful feedback, which means they
    have been able to access it. My website can be slow (I'm looking for
    another provider), so I've posted keep a current version at
    http://www.chesapeake.net/~fletcher/kfNgramHelp.html as well.

    Please don't hesitate to suggest additional features you might like.

    Regards,
    Bill

    PS I've had problems accessing your site and sending you e-mail directly.

    ----- Original Message -----
    From: "Tony Berber Sardinha" <tony4@uol.com.br>
    To: "William H. Fletcher" <fletcher@usna.edu>; "corpora list - messages to
    list" <CORPORA@hd.uib.no>
    Sent: Saturday, October 12, 2002 6:37 AM
    Subject: Re: [Corpora-List] Free n-gram Software Released

    > Dear list members
    >
    > Has anyone managed to download this software? I've been trying since the
    message
    > was posted, at different times of the day, without success. The page won't
    > finish loading.
    >
    > cheers
    > tony.
    > -------------------------------------
    > Dr Tony Berber Sardinha
    > LAEL, PUC/SP
    > (Catholic University of Sao Paulo, Brazil)
    > tony4@uol.com.br
    > http://lael.pucsp.br/~tony
    > [New website]
    >
    > ----- Original Message -----
    > From: "William H. Fletcher" <fletcher@usna.edu>
    > To: < >
    > Sent: segunda-feira, 30 de setembro de 2002 18:39
    > Subject: [Corpora-List] Free n-gram Software Released
    >
    >
    > > The recent flurry of discussion on n-gram software inspired me to
    revisit a
    > > project from last year. I reprogrammed kfNgram using aspects of the
    > > "suffix array" approach described by Mikio Yamamoto and Kenneth W.
    Church
    > > and further developed by Chunyu Kit and Yorick Wilks. The result was a
    > > quantum leap in performance which makes it useful even for large
    corpora.
    > > (It indexes the 25 million word CETENFolha corpus announced here last
    week
    > > in about 10 minutes on my Pentium III machine with 800 MHz processor
    and
    > > 256 MB RAM, then cranks out n-gram files in under a minute.)
    > >
    > > kfNgram supports user-defined character sets and sort orders, and its
    GUI
    > > (graphical user interface) makes it accessible even to casual users.
    > >
    > > This free Windows program is available at
    > > http://miniappolis.com/KWiCFinder/kfNgramHelp.html
    > > Suggestions and comments on its usability and performance will be
    greatly
    > > appreciated.
    > >
    > > Bill Fletcher
    > >
    > > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    > >
    > > William H. Fletcher 410.293.6362 [voice]
    > > Associate Professor, German & Spanish 410.293.2729 [fax]
    > > Language Studies Department
    > > US Naval Academy
    > > 589 McNair Road
    > > Annapolis, MD 21402 - 5030
    > >
    > > fletcher@usna.edu
    > > http://www.usna.edu/LangStudy/
    > > http://kwicfinder.com/
    > > http://miniappolis.com/
    > >
    > > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    > >
    > >
    > >
    >
    >



    This archive was generated by hypermail 2b29 : Mon Oct 14 2002 - 17:49:12 MET DST