[Corpora-List] RE: [MT-List] term extraction info (3)

From: KERREMANS, Koen (Koen.Kerremans@ehb.be)
Date: Fri Oct 04 2002 - 10:01:06 MET DST

  • Next message: Ruslan Mitkov: "Re: [Corpora-List] term extraction info (2)"

    Several people also recommended the Trados software programme ExtraTerm.
    More info can be found at: http://www.trados.com

    Regards,

    Koen

    -----Oorspronkelijk bericht-----
    Van: mt-list-admin@eamt.org [mailto:mt-list-admin@eamt.org]Namens
    KERREMANS, Koen
    Verzonden: vrijdag 4 oktober 2002 9:45
    Aan: mt-list@eamt.org; corpora@hd.uib.no
    Onderwerp: [MT-List] term extraction info (2)

    !!I forgot some references. This is the new (complete) version of
    responses!!

    Hello,

    These are the references I got in answer to my question concerning "term
    extraction" (cf. see below). Each reference is preceded by the name of the
    person who gave me the information. Feel free to add more info to this list.

    Regards,

    Koen Kerremans

    1. Books:

    -Jerome Richalot: Pearson, J. (1998). Terms in context. Benjamins, John
    Publishing Company.
    -Jorge Vivaldi: Bourigault, D.; Jacquemin C. y M.-C. L'Homme (eds.) Recent
    Advances in Computational Terminology. John Benjamins Publishing Company.
    Amsterdam.

    2. Articles:

    -Piklu Gupta: Heid, U. (1999). "Extracting terminologically relevant
    collocations from German technical texts". [search via Google]
    -Klaus Fleischmann: L'homme, Benali, Bertrand, Laudique. (1996). "Definition
    of an evaluation grid for term extraction software". In Terminology 3:2.
    Benjamins Publishing Co.
    -Chantal Enguehard: Enguehard, C., Pantéra, L., "Automatic Natural
    Acquisition of a Terminology", Journal of quantitative linguistics, vol.2,
    n°1, pp.27-32, 1995.
    -Chantal Enguehard: C. Enguehard, B. Daille, E. Morin, “Tools for
    Terminology Processing”, The Indo-European Conference on Multilingual
    Communications Technologies (IEMCT), R. K. Arora, M. Kulkarni, H. Darbari
    (editors), Tata McGraw-Hill, pp.218-229, Pune, India, June 2002.
    -Jorge Vivaldi: Vivaldi, J. y H. Rodríguez (2000) "Improving term extraction
    by combining different techniques". Ananiadou S. y D. Maynard (eds.) in the
    proceedings of Workshop on Computational terminology for medical and
    biological Applications (NLP2000). Patras, 4 de junio. Pags. 61-68.
    -Jorge Vivaldi: Terminology, Vol. 7, num.1. John Benjamins. Pag. 31-47. John
    Benjamins Publishing Company, Amsterdam.
    -Vivaldi J. and H. Rodríguez (2002) Medical Term Extraction using EWN
    ontology. Proceedings of "Terminology and Knowledge Engineering 2002"
    (TKE'02). Nancy.

    3. Websites:

    -Johan Haller: http://www.iai.uni-sb.de/de/pub.html
    -John Kohl: http://www.xplanation.com [xplanation has a term-extraction tool
    that is part of the MT system that this company uses. It is pretty good at
    identifying noun phrases. They are located in Leuven. They also have
    controlled-English software]
    -Ross Smith: http://www.mkms.xerox.com [XEROX have a terminology management
    program called XTS which contains an extraction function]
    -François Rousselot:
    http://www-ensais.u-strasbg.fr/liia/LIIA_Products_Installers/install.htm
    [this tool is based on repeated segments: there is a small english
    documentation in the program]
    -Scheiden: http://www.biomath.jussieu.fr/ATALA/outil/ [section "Extraction
    de termes"]
    -Nicholas Hernandez: http://www.limsi.fr/Individu/jacquemi/ [Fastr is a
    parser for term and variant recognition. Fastr take as input a corpus and a
    list of terms and ouputs the indexed corpus in which terms and variants are
    recognized]

    4. Notes:

    -Sabine Kirchmeier-Andersen
    (http://www.id.cbs.dk/medarbej/ska/sabine_da.shtml) recommends Word Smith
    Tools and Quirk who both can use a LGP korpus in order to identify
    automatically frequent LSP candidates. She thinks the latest articles by
    Beatrice Daille et al. about term extraction describe the most efficient
    methodology.
    -Antal van den Bosch (http://ilk.kub.nl/~antalb/) did some experimenting
    with a memory-based shallow parser after which he extracted terms using the
    tf*idf method in statistics

    > -----Original Message-----
    > Hello,
    >
    > Does anyone know of good term extraction tools/methods? My purpose is to
    > compare some of the existing methodologies to one another and to evaluate
    > their performances on domain-specific texts. Good references or surveys of
    > term extraction tools/methods are welcome as well.
    >
    > Regards,
    >
    > Koen Kerremans
    >

    --
      For MT-List info, see http://www.eamt.org/mt-list.html
    



    This archive was generated by hypermail 2b29 : Fri Oct 04 2002 - 10:25:14 MET DST