Re: [Corpora-List] Q: How to identify duplicates in a largedocument collection

From: Marc Kupietz (kupietz@ids-mannheim.de)
Date: Wed Jan 05 2005 - 16:54:47 MET

  • Next message: Keith Herold: "Re: [Corpora-List] open-source program tranforming xml tags"

    Hi Bill,

    I'm currently preparing some platform-portable code to share.

    Regards,
    Marc

    Am Mittwoch, den 05.01.2005, 06:33 -0500 schrieb William Fletcher:
    > Hi Marc and Normand,
    >
    > How about sharing your code scripts? I am sure everyone would be grateful for an of-the-shelf solution that could be easily adapted to one's own needs or serve as inspiration for other applications.
    >
    > Regards,
    > Bill
    >

    -- 
    Marc Kupietz                                      Tel. (+49) 621/1581-409
    Institut für Deutsche Sprache, Dept. of Lexical Studies/Corpus Technology
    PO Box 101621, 68016 Mannheim, Germany        http://www.ids-mannheim.de/
    



    This archive was generated by hypermail 2b29 : Wed Jan 05 2005 - 16:54:31 MET