Re: Corpora: Measuring Text Reuse

From: Bill Mann (bill_mann@sil.org)
Date: Sat May 06 2000 - 13:10:06 MET DST

  • Next message: John W. Du Bois: "Corpora: Final Program for CSDL Conference May 11-14"

         Paul and all:
         
         A large amount of thought has gone into finding clear boundaries and
         categories of reuse, under labels such as copyright, plagiarism,
         translation, intellectual property and such. It would be nice if that
         work could be used directly or the concepts adapted for corpus
         research. It seems worth looking at.
         
         Bill Mann

    ______________________________ Reply Separator _________________________________
    Subject: Corpora: Measuring Text Reuse
    Author: <p.clough@dcs.shef.ac.uk> at Internet
    Date: 5/5/00 9:39 AM

    Hi,
         
    I am a postgraduate working on a project looking at how text from a British
    news agency is being reused by various newspapers. I am interested in
    whether anyone else is working on anything similar or knows any other
    projects dealing with reuse. I am trying to get a grasp on how to actually
    define reuse as not only am I dealing with verbatim copy of text, I am also
    looking at cases where text is paraphrased. Has anyone any ideas or opinions
    on what they consider as reuse of text, or what tools could be used to
    extract reused material?
         
    Thanks for any comments,
         
    Paul Clough.
    University of Sheffield.



    This archive was generated by hypermail 2b29 : Sat May 06 2000 - 13:22:07 MET DST