Corpora: Q: POS / Parse Annotation of the Hansards

Keith J. Miller (keith@mitre.org)
Wed, 7 Jul 1999 18:10:12 -0400

I'm using the Hansard corpus (parallel French - English parliamentary
proceedings) for dissertation research on Machine Translation of
prepositions. I need both English and French data marked up with POS
and syntactic parse information. My primary concern is PP
attachment, specifically with respect to distinguishing between
verbal complement PPs and adjunct PPs.

Can anyone recommend a POS tagger or parser for either language
or a source for marked up versions of the Hansards?

Please reply to me directly; I will summarize. Thank you, and I
hope everyone is enjoying their summer (or winter, as the case may be).

----- Keith J. Miller
millerk@gusun.georgetown.edu
Georgetown University
Linguistics Department
Computational Linguistics