Re: Corpora: Sentence splitting

Anoop Sarkar (anoop@linc.cis.upenn.edu)
Fri, 16 Oct 1998 17:56:30 EDT

Jeff Reynar and Adwait Ratnaparkhi have a probabilistic technique with
associated software for sentence boundary detection. Here are the details:

Jeffrey C. Reynar and Adwait Ratnaparkhi. A Maximum Entropy Approach to
Identifying Sentence Boundaries. In Proceedings of the Fifth Conference on
Applied Natural Language Processing, March 31-April 3, 1997. Washington, D.C.

There is some freely available software associated with this paper available
at
http://www.cis.upenn.edu/~adwait/statnlp.html

-Anoop.