[Corpora-List] Chargrams (letter ngrams) in the BNC

From: William Fletcher (fletcher@usna.edu)
Date: Tue Feb 03 2004 - 20:29:39 MET

  • Next message: Michal Finkelstein: "[Corpora-List] Spanish - common words"

    Bruce Lambert's recent query on letter n-grams prompted me to move this
    up my to-do list. Yesterday I posted a reference to lists of letter
    n-grams in words occurring 15 or more times in the BNC for n=1-3.

    Today I imported my "chargram" lists into a database for easy
    exploration by position (initial, medial, final) and frequency in types
    or token. Wildcard matching is available. While documentation is
    preliminary, the user interface is similar to that of the rest of the
    "Phrases in English" site.

    Online at
      http://pie.usna.edu/explorec.html
    Comments and suggestions welcomed.

    Bill

    - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    AssocProf William H. Fletcher
    Language Studies Department
    United States Naval Academy
    Annapolis MD 21402 5030

    410-293-6362 [voice]
    410-293-2729 [fax]
    Department
       http://usna.edu/LangStudy/
    Phrases in English
       http://pie.usna.edu/
    KWiCFinder
       http://kwicfinder.com/



    This archive was generated by hypermail 2b29 : Tue Feb 03 2004 - 20:29:57 MET