Re: Corpora: Sentence splitter

Tony Valsamidis (tony@soi.city.ac.uk)
Thu, 07 Oct 1999 12:20:32 +0000

Martin

The Language Technology Group at Edinburgh have released a tokeniser
which disambiguates sentence boundaries. From the documentation:

[includes] a sentence boundary disambiguator which determines whether a
full-stop
is part of an abbreviation or a marker of a sentence boundary.

http://www.ltg.ed.ac.uk/software/ttt/index.html

Cheers

Tony

--
Tony Valsamidis                  Email : tony@soi.city.ac.uk (MIME,PGP)
Information Science Department
URL:http://www.soi.city.ac.uk/~tony
School of Informatics                  Tel: +44 171  477 8391
City University, London, UK, EC1V 0HB  Fax: +44 171  477 8584