Morphologically Analyzed and Disambiguated Turkish Texts available
Kemal Oflazer (ko@cs.bilkent.edu.tr)
Wed, 17 Jul 1996 09:18:11 +0300
Morphologically analyzed and disambiguated Turkish texts are now
available from URL http://www.nlp.cs.bilkent.edu.tr/. There are now two
texts available comprising a total of about 12,000 words. Contrary to
most tagged texts, we have represented what we think as the correct
morphological parse in a hierarchical fashion
with the inflectioal features after the last derived form being shown at
the top-most level, and the nesting levels indicating the derivations in
the lexical form. The disambiguation process also preprocesses the
morphologically analyzed to group all lexicalized and non-lexicalized
collocations.
Any comments and/or corrections are welcome.
--
Kemal Oflazer e-mail: ko@cs.bilkent.edu.tr
http://www.cs.bilkent.edu.tr/~ko/ko.html
Bilkent University tel: (90-312) 266-4133 (Sec)
Computer Engineering Department 266-4000 x1258 (Off)
Bilkent, ANKARA, 06533 TURKIYE 240-1627 (Home)
fax: (90-312) 266-4126
-------------------------------------------------------------
"The stone age was marked by man's clever use of crude tools;
the information age, to date, has been marked by man's crude use of
clever tools." -- Anonymous