RE: [Corpora-List] pronunciation lexica

From: LAWSON, Ann (Lawsona@oup.co.uk)
Date: Wed Sep 04 2002 - 18:16:58 MET DST

  • Next message: Lucian Galescu: "Re: [Corpora-List] pronunciation lexica"

    Sorry - I should have added that this isn't public domain (sorry) but
    reasonable rates apply for academic researchers.
    Ann

    -----Original Message-----
    From: LAWSON, Ann [mailto:Lawsona@oup.co.uk]
    Sent: 04 September 2002 17:11
    To: corpora@hd.uib.no
    Subject: RE: [Corpora-List] pronunciation lexica

    Dear James,

    Here is some information about US English pronunciation resources (over
    165,000 headwords and over 255,000 wordforms) held by Oxford University
    Press and available for research or commercial use. More information can be
    found at www.oup.co.uk/digital_reference. I hope this information is useful
    to you,
    Ann Lawson

    Business Development Manager, Digital Reference
    Academic Division, Oxford University Press
    Great Clarendon Street, Oxford OX2 6DP, UK
    Tel: +44 (0) 1865 353255 Fax: +44 (0) 1865 353658
    Mob: +44 (0) 7979 953 100 Web: www.oup.co.uk
    Passionate about language? www.askoxford.com

    Natural Language Lexicons
    -------------------------
    Alongside OUP's formal dictionary resources, we hold extensive fully-tagged
    SGML databases of morphological and phonetic data purpose-built for natural
    language applications. These source lexicons currently exist for general
    vocabulary in English (UK and US), French, Spanish and Italian, with a
    smaller database in German. Further lexicons are also available in
    specialist reference areas, e.g. medical, business.

    Each headword lemma is provided with a full listing of its possible
    syntactic forms and spelling variants, along with information on their
    relationship to the headword form. In addition, a keyboard representation of
    the IPA pronunciation is given for every form. There is also information on
    domains in which the headwords are used, e.g. computing, engineering,
    zoology.

    English Lexicons
    ----------------
    Coverage: UK over 155,000 headwords; 250,000 wordforms; 25,000 proper nouns
    US over 165,000 headwords; 255,000 wordforms; 25,000 proper nouns; 2,000
    abbreviations

    Additional features: exclusive US or World English orthographic forms;
    phonetic variants; primary and secondary stress information; clear potential
    for subset generation using listed sources as benchmarks; up-to-date
    coverage of special-interest domains drawn from the ongoing Oxford reading
    programme

    -----Original Message-----
    From: James Magnuson [mailto:magnuson@paradox.psych.columbia.edu]
    Sent: 04 September 2002 15:22
    To: corpora@hd.uib.no
    Subject: [Corpora-List] pronunciation lexica

    Does anyone know of any large sets of American English pronunciations
    (preferably including inflected forms, etc. -- the more complete, the
    better, of course) in electronic format and preferably in the public
    domain?

    I have come across two, but I cannot determine their origin. One is
    labelled 'MIT' and has about 25,000 forms, and the other is labelled
    'MOBY' and has about 35,000 forms. If anyone knows the origin of
    these, I would also appreciate learning more about them.

    Thanks,

    jim

    --------------------------------------------------------
    James Magnuson
    Department of Psychology
    Columbia University
    1190 Amsterdam Ave., MC 5501
    New York City, New York 10027
    (212)854-5667
    magnuson@psych.columbia.edu



    This archive was generated by hypermail 2b29 : Wed Sep 04 2002 - 18:21:29 MET DST