RE: [Corpora-List] Legal aspects of compiling corpora

From: Khalid CHOUKRI (choukri@elda.fr)
Date: Sun Jun 15 2003 - 09:37:56 MET DST

  • Next message: Mark Sanderson: "RE: [Corpora-List] Legal aspects of compiling corpora"

    Dear Colleagues

    I am not sure the way suggested by Adam is the right thing to do,
    ELRA and LDC have been trying to negotiate such rights in order to provide
    the users of corpora with good and legally-cleared resources and we are
    happy to help request and get the authorizations to use the data in a more
    sound and clean legal context.

    of course we can all do whatever we feel fair and then if no one sue us it
    is fine but just imagine that after your 5 years work someone come across
    your publication in which you refer to the data and managed to prevent you
    from such reference/publication or even ask you to delete all such data ..

    Best regards
    Khalid CHOUKRI
    European Language Resources Association

    At Friday 13/06/2003 15:36(), Adam Kilgarriff wrote:
    >On the one hand, if your enemies are rich enough you'll lose.
    >
    >On the other you're probably less worth sueing than Google and they are
    >still going strong (anyone out there from Google? Your contribution
    >most welcome), and it doesn't sound like you are doing anything with any
    >salient legal difference. (Getting authors' agreements takes huge
    >amounts of resources and isn't feasible; listing references doesn't
    >help.)
    >
    >People do get unhappy about their pictures and audio being grabbed from
    >the web for use in other people's databases, and I have heard of cases
    >of web developers having to rein in their ambitions because objections
    >have been made. As yet, mercifully, that hasn't happened with text -
    >people don't seem alarmed at the idea that the text they publish on the
    >web gets re-used. Let's all pray it stays that way (though sooner or
    >later we're bound to get chancers trying it on - can't help fearing the
    >web is in its honeymoon phase, and the racketeers will mess it all up
    >before too long).
    >
    >In the meantime - take courage! Do it!
    >
    >
    > Adam
    >
    >
    >=======================
    >Adam Kilgarriff
    >Lexicography MasterClass Ltd: http://www.lexmasterclass.com
    >adam@lexmasterclass.com
    >+44 (0)1273 705773
    > --and--
    >ITRI, University of Brighton
    >Lewes Road, Brighton BN2 0BL, UK
    >http://www.itri.brighton.ac.uk/~Adam.Kilgarriff
    >adam@itri.brighton.ac.uk
    >+44 (0)1273 642919
    >==============================
    >World is crazier and more of it than we think,
    >Incorrigibly plural
    > ---'Snow', Louis MacNeice
    >==============================
    >
    >
    > > -----Original Message-----
    > > From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no]
    >On
    > > Behalf Of delucca@nilc.icmc.usp.br
    > > Sent: 13 June 2003 13:49
    > > To: corpora@hd.uib.no
    > > Subject: [Corpora-List] Legal aspects of compiling corpora
    > >
    > >
    > > Dear Linguists and Lawyers,
    > >
    > > I am troubled with Legal aspects of corpora compiling. I am in
    > > doubt if is an illegal procedure storage webpages (or part of them)
    > > in a database (see at http://www.dictionarium.com/project.htm),
    > > not available to public, and display its contents as short
    >collocations
    > > less than 100 characters by time by search method.
    > >
    > > On the other hand, the Internet search engines uses cached (temporary
    >?)
    > > copies of the sites and display a short of the web pages.
    > >
    > > My procedure is wrong? Which the Legal difference? I need ask
    >permission
    > > for each website to storage its pages? If I mention the source and the
    > > author
    > > I will be protecting the copyrights?
    > >
    > >
    > > I look forward to hearing from you.
    > >
    > >
    > > Yours Sincerely,
    > >
    > >
    > > J. L. De Lucca
    > >
    > > -------------------------------------------------
    > > This mail sent through IMP: http://horde.org/imp/

    *************************************************************
    Khalid CHOUKRI mailto:choukri@elda.fr
    ELRA CEO
    Tel. +33 1 43 13 33 33 - Fax. +33 1 43 13 33 30
    Postal Mail: 55 Rue Brillat-Savarin, 75013 Paris France
    Home page: http://www.elda.fr/ or http://www.elra.info/
    LREC news: http://www.lrec-conf.org/
    ***************************************************************



    This archive was generated by hypermail 2b29 : Sun Jun 15 2003 - 09:52:20 MET DST