[Corpora-List] RE: Web/Corpora Questions

From: peetm (peet.morris@comlab.ox.ac.uk)
Date: Tue Sep 14 2004 - 12:09:27 MET DST

  • Next message: A.DeRoeck: "RE: [Corpora-List] corpus homogeneity"

    Hi - sorry if this isn't exactly corpora-specific, but, I've a few questions
    that I think members of this list might be able to help me with.

     

     

    1. I'm looking for any articles/information on the application of
    'grammar-analysis' to determine text-type/genre/style/register/other [delete
    as appropriate!]

     

    2. Are there any machine-readable lexicons/databases that contain
    information like:

     

    o - Indication of writer's/reader's 'Reading Age' (correct term?)

    o - Common Synonyms

    o - Common Misspellings

    o - Rarity/Density (e.g., *this* word is used infrequently)

                o - I know the last of these is typically ascertained through
    corpus analysis, but I thought I'd ask anyway!

    3. Does the WordNet database exist in any 'popular' format, e.g., Oracle,
    SQL-Server, Access?

     

    peetm



    This archive was generated by hypermail 2b29 : Tue Sep 14 2004 - 12:03:56 MET DST