Re: [Corpora-List] Punctuation

From: Keith Suderman (suderman@cs.vassar.edu)
Date: Mon Jan 17 2005 - 16:49:05 MET

  • Next message: Joel Tetreault: "[Corpora-List] Re: Reinforcement Learning packages for NLP?"

    Hello Jane,

    At 03:54 PM 1/13/2005 -0800, Jane A. Edwards wrote:
    >Oops! The fact you are addressing me personally makes me think
    >my posting may have been construed as critical of ANC.

    Not at all, and I apologize for any misunderstanding. I wasn't subscribed
    to the corpora list when Nancy forwarded your message to me (a problem
    since corrected), so my reply to the list bounced.

    >My intent was to focus on the interpretation of punctuation by users of
    >any corpus,

    And your comments were well taken. I hate to show my ignorance, but it
    never occurred to me that people would want to search the ANC for
    punctuation. Most search tools that I'm aware of are only concerned with
    the "words" in a corpus and very few index the punctuation in any
    meaningful way. I will definitely follow up on the references you posted.

    >Thank you very much for your posting, though, as I had not known
    >of those great properties of ANC:
    >- inclusion of written, spoken, AND written to be spoken;

    I should clarify this. We don't actually have any "written to be spoken"
    texts yet, but when we do they will be marked as such. ;)

    Cheers,
    Keith

    --------------------------------------------------
    Keith Suderman
    Technical Specialist
    American National Corpus
    suderman@cs.vassar.edu
    http://americannationalcorpus.org



    This archive was generated by hypermail 2b29 : Mon Jan 17 2005 - 16:47:05 MET