Corpora: Latinate words in corpora

Chris Allen (chris.allen@ih.hh.se)
Wed, 20 Oct 1999 09:51:03 +0800

I wondered if anyone on this list could help me with an enquiry.

A student of mine is interested in obtaining frequency information for
Latin words using a corpus. In particular, she would like to come up with a
top 10 list of the most frequent Latinate words in English.

Does anyone know of a corpus which is in some way 'tagged' according to
etymological origin? The only thing I can remotely think of would be the
dictionary database of a historically-orientated dictionary such as the OED
which might be able to supply such etymological information.

Thanks for your help,

Chris Allen
University of Halmstad
Jarnvagsgatan 8b
302 49 Halmstad
Sweden
Tel: +46 35 1012 96(home)
+46 35 167372 (work)