Re: [Corpora-List] N-gram string extraction

From: Christer Johansson (christer.johansson@lili.uib.no)
Date: Tue Aug 27 2002 - 18:31:48 MET DST

  • Next message: Dirk Ludtke: "[Corpora-List] n-grams (follow-up question)"

    May I suggest having a look in Ken Church's introduction to Ngrams. You
    will find it using the obvious key words on e.g. google (Church Ngrams).
    Filename: kwc-ngrams.pdf

      The task can be done by simple combinations of paste, tail, sort, uniq
    -c, and filtering programs written in awk. Hard to make anything better
    than that.



    This archive was generated by hypermail 2b29 : Tue Aug 27 2002 - 18:38:50 MET DST