Re: Need corpora of COMPOUND NOUNS

Nick Youd (nick@logcam.co.uk)
Thu, 23 Feb 95 16:16:19 GMT

>
>
>
> I am currently working on the semantic interpretation of English
> nominal compounds (e.g. "rising prices", "aircraft flight arrival"),
> and I would greatly appreciate any pointers to corpora of English
> complex nominals.
>
>
> Thanks,
>
>
> Cecile Fabre
>

Cecile
You might want to consider the diy approach.
i) get yourself a corpus (eg LOB) or collect one yourself. The latter is
a good idea if you want to confine yourself to a narrow semantic domain
ii) if it is untagged, run it through a tagger. There are a number of public
domain taggers which are reasonably good
iii) filter out the noun compounds, using your own rules (based on part-of-speec
h)
to determine what count as compounds. I may have a perl program which might
be useful in this respect.

Bonne chance!

Nick