Protein identification with N and C-terminal sequence tags in proteome projects

被引:76
作者
Wilkins, MR
Gasteiger, E
Tonella, L
Ou, K
Tyler, M
Sanchez, JC
Gooley, AA
Walsh, BJ
Bairoch, A
Appel, RD
Williams, KL
Hochstrasser, DF
机构
[1] Univ Hosp Geneva, Cent Lab Clin Chem, CH-1211 Geneva 14, Switzerland
[2] Univ Geneva, Dept Biochem Med, CH-1211 Geneva, Switzerland
[3] Macquarie Univ, Australian Proteome Anal Facil, Sydney, NSW 2109, Australia
[4] Univ Hosp Geneva, Med Informat Div, CH-1211 Geneva 14, Switzerland
关键词
protein identification; sequence tag; proteome; two-dimensional gel electrophoresis;
D O I
10.1006/jmbi.1998.1726
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genome sequences are available for increasing numbers of organisms. The proteomes (protein complement expressed by the genome) of many such organisms a-e being studied with two-dimensional (2D) gel electrophoresis. Here we have investigated the application of short N-terminal and C-terminal sequence tags to the identification of proteins separated on 2D gels. The theoretical N and C termini of 15,519 proteins, representing all SWISS-PROT entries for the organisms Mycoplasma genitalium, Bacillus subtilis, Escherichia coli, Saccharomyces cerevisae and human, were analysed. Sequence tags were found to be surprisingly specific, with N-terminal tags of four amino acid residues found to be unique for between 43% and 83% of proteins, and C-terminal tags of four amino acid residues unique for between 74% and 97% of proteins, depending on the species studied. Sequence tags of five amino acid residues were found to be even more specific. To utilise this specificity of sequence tags for protein identification, we created a world-wide web-accessible protein identification program, TagIdent (http://www.expasy.ch/www/tools.html), which matches sequence tags of up to six amino acid residues as well as estimated protein pi and mass against proteins in the SWISS-PROT database. We demonstrate the utility of this identification approach with sequence tags generated from 91 different E. coli proteins purified by 2D gel electrophoresis. Fifty-one proteins were unambiguously identified by virtue of their sequence tags and estimated pi and mass, and a further 11 proteins identified when sequence tags were combined with protein amino acid composition data. We conlcude that the TagIdent identification approach is best suited to the identification of proteins from prokaryotes whose complete genome sequences are available. The approach is less well suited to proteins from eukaryotes, as many eukaryotic proteins are not amenable to sequencing ain Edman degradation, and tag protein identification cannot be unambiguous unless an organism's complete sequence is available. (C) 1998 Academic Press Limited.
引用
收藏
页码:599 / 608
页数:10
相关论文
共 27 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] A NEW-GENERATION OF INFORMATION-RETRIEVAL TOOLS FOR BIOLOGISTS - THE EXAMPLE OF THE EXPASY WWW SERVER
    APPEL, RD
    BAIROCH, A
    HOCHSTRASSER, DF
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1994, 19 (06) : 258 - 260
  • [3] The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998
    Bairoch, A
    Apweiler, R
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 38 - 42
  • [4] PEPTIDE LADDER SEQUENCING BY MASS-SPECTROMETRY USING A NOVEL, VOLATILE DEGRADATION REAGENT
    BARTLETJONES, M
    JEFFERY, WA
    HANSEN, HF
    PAPPIN, DJC
    [J]. RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 1994, 8 (09) : 737 - 742
  • [5] Electroblotting of proteins to Teflon tape acid membranes for N- and C-terminal sequence analysis
    Burkhart, WA
    Moyer, MB
    Bailey, JM
    Miller, CG
    [J]. ANALYTICAL BIOCHEMISTRY, 1996, 236 (02) : 364 - 367
  • [6] GOOLEY AA, 1997, IN PRESS ELECTROPHOR, V18
  • [7] Direct database searching with MALDI-PSD spectra of peptides
    Griffin, PR
    MacCoss, MJ
    Eng, JK
    Blevins, RA
    Aaronson, JS
    Yates, JR
    [J]. RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 1995, 9 (15) : 1546 - 1551
  • [8] METHODS FOR INCREASING THE RESOLUTION OF TWO-DIMENSIONAL PROTEIN ELECTROPHORESIS
    HOCHSTRASSER, DF
    HARRINGTON, MG
    HOCHSTRASSER, AC
    MILLER, MJ
    MERRIL, CR
    [J]. ANALYTICAL BIOCHEMISTRY, 1988, 173 (02) : 424 - 435
  • [9] DEVELOPMENT OF POLYACRYLAMIDE GELS THAT IMPROVE THE SEPARATION OF PROTEINS AND THEIR DETECTION BY SILVER STAINING
    HOCHSTRASSER, DF
    PATCHORNIK, A
    MERRIL, CR
    [J]. ANALYTICAL BIOCHEMISTRY, 1988, 173 (02) : 412 - 423
  • [10] Of genomes and proteomes
    James, P
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 1997, 231 (01) : 1 - 6