Re: Corpora: minimum size of corpu?

From: Gordon and Pam Cain (gpcain@rivernet.com.au)
Date: Thu Feb 10 2000 - 07:41:48 MET

  • Next message: michael klotz: "Re: Corpora: minimum size of corpu?"

    Elaine Keown wrote:
    >
    > Hello,
    > What is the minimum size for a corpus? I just started Biblical Aramaic,
    > which is only 10 chapters, 276 verses. Is that large enough to do
    > meaningful corpus work on?. . .

    Elaine--

    Of course, if it is only _Biblical_ Aramaic you're interested in, then
    you either must use what you have, or expand the canon. The latter route
    is highly dubious!

    If you confine yourself to the 276 verses, then yes you can study it as
    a corpus, but very few things will repeat themselves with enough
    frequency for you to make certain statements about. Perhaps a few
    stylistic matters, or a few patterns associated with a few key words in
    your texts is all that will show themselves.

    I recently completed a study of student essays at my college, and though
    I had 29000 words of text (far more than you'll have!), and all of a
    very tightly constrained subject matter, I found I could only make a few
    statements about characteristic lexis with any confidence at all. Huge
    corpora do have their advantages, but such was not available to me, and
    appears not to be to you either.

    Of course, you could always go the road of adding contemporaneous
    Aramaic texts to your corpus, but with an ancient language that is
    frought with difficulty: Which end of four centuries do you date the
    Biblical material at? And did the Biblical authors use Aramaic in a way
    that differs significantly from the authors of other contemporaneous
    texts anyway? That is, were the Biblical writers proficient as native
    speakers, or did they use a dialect peculiar to their Jewish fellows?

    If you were game enough to tackle these questions, then you could
    perhaps work from a larger corpus, or compare the Biblical corpus
    against an extra-Biblical corpus. But you would of course have to
    qualify everything -- assumed dating of all texts, provenance of your
    corpus texts, and variation in subject matter. But that method could
    still yield you some interesting insights on how, say, the Aramiac of
    Daniel compares to the Aramic of royal proclamations or of business
    documents.

    Don't die the death of a thousand qualifications, but do be careful only
    to make responsible and supportable statements based on your small
    corpus.

    Small is beautiful, when that's all that's available!

    Take care
    Gordon

    --
    Gordon Cain 
    Teacher of ESOL
    TAFE International Education Centre
    Liverpool (Sydney) Australia
    gpcain@rivernet.com.au
    



    This archive was generated by hypermail 2b29 : Thu Feb 10 2000 - 07:42:45 MET