Re: Corpora: protein name list

From: George Demetriou (g.demetriou@dcs.shef.ac.uk)
Date: Thu Nov 01 2001 - 16:24:22 MET

  • Next message: Philip Resnik: "Re: Corpora: protein name list"

    You can find lists of protein names in the following sites:

    SCOP database:
    http://scop.mrc-lmb.cam.ac.uk/scop/

    CATH database:

    http://www.biochem.ucl.ac.uk/bsm/cath/index.html

    Protein Data Bank:

    ftp://ftp.rcsb.org/pub/pdb/

    Enzyme names from Expasy:

    ftp://www.expasy.ch/databases/enzyme

    Also, lists of protein names we have used in the PASTA project
    (http://www.dcs.shef.ac.uk/nlp/pasta/) can be made available on request.
    Some of the terms in the lexicons were derived from the above sources
    but were supplemented with protein names extracted from texts.

    George Demetriou

    %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
                          Dr George Demetriou

    Dept. of Computer Science Room: 219
    The University of Sheffield Tel: +44 (0) 114 2221894
    Regent Court FAX: +44 (0) 114 2221810
    211 Portobello Street e-mail: demetri@dcs.shef.ac.uk
    Sheffield, S1 4DP, UK
    %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

    > Dear Colleagues,
    >
    > I am collecting protein name list for bioinformatics research.
    > Does anyone know of public protein name list?
    >
    > Thanks.
    >
    > Hsin-Hsi Chen
    > National Taiwan Unversity



    This archive was generated by hypermail 2b29 : Thu Nov 01 2001 - 16:58:02 MET