RE: [Corpora-List] good, free sentencizer?

From: Evgeniy Gabrilovich (gabr@cs.technion.ac.il)
Date: Tue Feb 11 2003 - 18:13:09 MET

  • Next message: ted pedersen: "[Corpora-List] Improvements to Latin American Registry"

    There is an adaptive sentence boundary detector
    developed by David Palmer and Marti Hearst.
    The software is available in source code (written in C)
    at http://elib.cs.berkeley.edu/src/satz/, and can
    be trained on your corpus.

    Evgeniy.

    --
    Evgeniy Gabrilovich
    Ph.D. student in Computer Science
    Department of Computer Science, Technion - Israel Institute of Technology
    Technion City, Haifa 32000, Israel
    E-mail: gabr@cs.technion.ac.il WWW: http://www.cs.technion.ac.il/~gabr
    Phone: (office) +972-4-8294948
    

    > -----Original Message----- > From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no]On > Behalf Of Joerg Schuster > Sent: Tuesday, February 11, 2003 15:47 > To: corpora@uib.no > Subject: [Corpora-List] good, free sentencizer? > > > Hello, > > does anybody know a good, free sentencizer? > > Thanks in advance, > Jörg Schuster > > > >



    This archive was generated by hypermail 2b29 : Tue Feb 11 2003 - 18:15:52 MET