Re: [Corpora-List] semantic similarity

From: Leonid Kontorovich (lkontoro@andrew.cmu.edu)
Date: Thu Jan 20 2005 - 17:57:48 MET

  • Next message: Nuno Seco: "RE: [Corpora-List] semantic similarity"

    Hi Jana,

    have you looked at Latent Dirichlet Allocation, developed by Blei, Jordan
    and Ng? Take a look at Blei's homepage:
    http://www.cs.berkeley.edu/~blei/

    in particular,
    Latent Dirichlet allocation. D. Blei, A. Ng, and M.
    Jordan. Journal of Machine Learning Research, 3:993-1022, January 2003.

    Dave Blei is now a postdoc at CMU, and I'm a grad student here -- so feel
    free to stop by.

    Best,
    -Leo

    On Thu, 20 Jan 2005, Jana Diesner wrote:

    > Dear list members,
    >
    > We are looking for strategies, algorithms or code to automatically find
    > single terms or multiple adjacent terms that are semantically similar within
    > and across documents. The approach must not require POS tagging or an
    > initial input of a reference term to the system. The resulting clusters of
    > semantically similar terms suggested by the system do not need to be
    > exclusive. We are familiar with secondstring, the software developed by
    > William Cohen, and semantic similarity based on string-edit distances.
    >
    >
    >
    > Thank you very much.
    >
    > Jana
    >
    >
    >
    > ____________________
    >
    > Jana Diesner
    > Carnegie Mellon University
    >
    > jdiesner@andrew.cmu.edu



    This archive was generated by hypermail 2b29 : Thu Jan 20 2005 - 17:51:25 MET