RE: [Corpora-List] variant log likelihood calculations

From: Rayson, Paul (rayson@exchange.lancs.ac.uk)
Date: Wed Dec 15 2004 - 11:05:53 MET

  • Next message: Serge HEIDEN: "[Corpora-List] [ATALA] Articuler les traitements sur corpus : déplacement de la journée au 12 février 2005"

    Dear Don,

    Glad you figured out the problem. Had me worried there for a moment!

    The version of the formula I use comes from the Cressie and Read paper(s) that we reference in the publications you listed. For more details, an on-line LL calculator, and the papers, see

    http://ucrel.lancs.ac.uk/llwizard.html

    Note that ln(0) is undefined, so I pre-define it to be zero. Another approach might be to use a very small value estimate for words with zero frequency.

    Regards,
    Paul.

    Dr. Paul Rayson
    Director of UCREL (University Centre for Computer Corpus Research on Language)
    Computing Department, Infolab21, South Drive, Lancaster University, Lancaster, LA1 4WA, UK.
    Web: http://www.comp.lancs.ac.uk/computing/users/paul/
    New telephone number: +44 1524 510357 Fax: +44 1524 510492

    -----Original Message-----
    From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no]On
    Behalf Of Don Hardy
    Sent: 15 December 2004 05:41
    To: Don Hardy
    Cc: CORPORA
    Subject: Re: [Corpora-List] variant log likelihood calculations

    I just figured out what I was doing wrong. I wasn't carrying the Rayson
    and Garside calculation through for all cells of the contingency table.

    Thanks for the help.

    Don



    This archive was generated by hypermail 2b29 : Wed Dec 15 2004 - 11:26:12 MET