Re: [Corpora-List] semantic similarity

From: Eric Atwell (eric@comp.leeds.ac.uk)
Date: Thu Jan 20 2005 - 18:33:44 MET

  • Next message: Antoinette Renouf: "RE: [Corpora-List] semantic similarity"

    George Demetriou used the word-definitions in LDOCE (Longman Dictionary of
    Contemporary English) online lexicon, to measure similarity between 2 words
    in a text by computing overlap between their definitions. See:

    Demetriou G and Atwell E. 2001.
    <a href="http://www.comp.leeds.ac.uk/eric/iwcs.ps"> A domain-independent
    semantic tagger for the study of meaning associations in English text.</a>
    In Harry Bunt, Ielka van der Sluis and Elias Thijsse (editors),
    Proceedings of the Fourth International Workshop on Computational
    Semantics (IWCS-4) pp.67-80. Tilburg, Netherlands.

    Xiao Yuan Duan used the same metric to compute semantic overlap between
    2 web-apges, by aggregating semantic overlaps between every pair of
    words across the 2 documents; see:

    Xiao Yuan DUAN, 2002, <a href="/research/pubs/theses/duan.pdf">
    Lexical Semantic Association Between Web Documents</a>
    MSc research thesis, School of Computing, Leeds University

    LDOCE is good for this becuase all definitions are in terms of a small
    defining vocabulary; but other online dictionaries can also work

    Hope this helps,

    Eric Atwell, Leeds University

    On Thu, 20 Jan 2005, Jana Diesner wrote:

    > Dear list members,
    >
    > We are looking for strategies, algorithms or code to automatically find
    > single terms or multiple adjacent terms that are semantically similar within
    > and across documents. The approach must not require POS tagging or an
    > initial input of a reference term to the system. The resulting clusters of
    > semantically similar terms suggested by the system do not need to be
    > exclusive. We are familiar with secondstring, the software developed by
    > William Cohen, and semantic similarity based on string-edit distances.
    >
    >
    >
    > Thank you very much.
    >
    > Jana
    >
    >
    >
    > ____________________
    >
    > Jana Diesner
    > Carnegie Mellon University
    >
    > jdiesner@andrew.cmu.edu
    >
    >
    >
    >

    -- 
    Eric Atwell, Senior Lecturer, Computer Vision and Language research group,
    School of Computing, University of Leeds, LEEDS LS2 9JT, England
    TEL: +44-113-2335430  FAX: +44-113-2335468  http://www.comp.leeds.ac.uk/eric
    



    This archive was generated by hypermail 2b29 : Thu Jan 20 2005 - 18:43:26 MET