Re: Corpora: Corp[us|ora]

From: Darren Pearce (darrenp@cogs.susx.ac.uk)
Date: Wed Apr 04 2001 - 23:27:32 MET DST

  • Next message: lgerber: "RE: Corpora: Chomsky/Harris - one more fun question."

    AltaVista advanced search page counts:
            corpuses: ~500
            corpora: ~55,500

    I have 50M words of BNC data and the stats I have from that (based on
    dependencies) give:

            corpora: 11
            corpuses: 5

    Surprisingly low (and close)...

    Darren.

    On Wed, 4 Apr 2001, James L. Fidelholtz wrote:

    > On Wed, 4 Apr 2001, Harold Somers wrote:
    >
    > >Could users of this mailing list at least get it right?
    > >One corpus.
    > >Several corpora.
    > >It's not too much to ask is it?
    >
    > Harold:
    > Well, it depends what language you are using. In Spanish, the
    > plural of 'corpus' would be 'corpus', especially if you're not trying to
    > be snotty or hypercorrect. Even in English, except here on the list, I
    > might very well use 'corpuses'. Maybe someone can check the BNC and see
    > what they come up with.
    > Jim
    >
    > --
    > James L. Fidelholtz e-mail: jfidel@siu.buap.mx
    > Posgrado en Ciencias del Lenguaje tel.: +(52-2)229-5500 x5705
    > Instituto de Ciencias Sociales y Humanidades fax: +(01-2) 229-5681
    > Benemérita Universidad Autónoma de Puebla, MÉXICO
    >
    >
    >

    +-------------------------------------------------------------------------+
    | |
    | Darren Pearce |
    | COGS, Sussex University, Falmer, Brighton |
    | Mobile: 07950 255 448 |
    | Email: darrenmpearce@bigfoot.com |
    | |
    +-------------------------------------------------------------------------+



    This archive was generated by hypermail 2b29 : Wed Apr 04 2001 - 23:24:53 MET DST