Identifying secretomes in people, pufferfish and pigs

被引:36
作者
Klee, EW
Carlson, DF
Fahrenkrug, SC
Ekker, SC
Ellis, LBM
机构
[1] Univ Minnesota, Arnold & Mabel Beckman Ctr Transposon Res, Minneapolis, MN 55455 USA
[2] Univ Minnesota, St Paul, MN 55108 USA
关键词
D O I
10.1093/nar/gkh286
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The proteins processed by the secretory pathway (secretome) are critical players in the development of multi-cellular eukaryotic organisms but have yet to be comprehensively studied at the genomic level. In this study, we use the Target P algorithm to predict human (13-20% of proteins found in individual datasets) and Fugu (14%) secretomes based on analysis of their nearly complete proteomes. We combine internal processing with prediction software to automate secreted protein identification and overcome one of the major challenges associated with EST data: identification of the minority of clones that encode N-terminally-complete proteins. We discuss the use of these methods to predict secreted proteins in EST-based consensus sequence sets, and we validate these predictions using an assay for cell-free cotranslational translocation. Analysis of TIGR Porcine Gene Index 4.0 as a test dataset resulted in the identification of 352 N-terminally-complete, putative secreted proteins. In functional agreement with our predictions, 34 of 40 (85%) of these cDNAs were verified to be cotranslationally translocated in an in vitro translation system. The methods developed here are specifically designed to accept partial open reading frames and improve secreted protein predictions in eukaryotic transcriptomes, and are valuable for the analysis and annotation of eukaryotic EST databases.
引用
收藏
页码:1414 / 1421
页数:8
相关论文
共 27 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes [J].
Aparicio, S ;
Chapman, J ;
Stupka, E ;
Putnam, N ;
Chia, J ;
Dehal, P ;
Christoffels, A ;
Rash, S ;
Hoon, S ;
Smit, A ;
Gelpke, MDS ;
Roach, J ;
Oh, T ;
Ho, IY ;
Wong, M ;
Detter, C ;
Verhoef, F ;
Predki, P ;
Tay, A ;
Lucas, S ;
Richardson, P ;
Smith, SF ;
Clark, MS ;
Edwards, YJK ;
Doggett, N ;
Zharkikh, A ;
Tavtigian, SV ;
Pruss, D ;
Barnstead, M ;
Evans, C ;
Baden, H ;
Powell, J ;
Glusman, G ;
Rowen, L ;
Hood, L ;
Tan, YH ;
Elgar, G ;
Hawkins, T ;
Venkatesh, B ;
Rokhsar, D ;
Brenner, S .
SCIENCE, 2002, 297 (5585) :1301-1310
[4]   Prediction of protein signal sequences [J].
Chou, KC .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2002, 3 (06) :615-622
[5]   Ensembl 2002: accommodating comparative genomics [J].
Clamp, M ;
Andrews, D ;
Barker, D ;
Bevan, P ;
Cameron, G ;
Chen, Y ;
Clark, L ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Down, T ;
Durbin, R ;
Eyras, E ;
Gilbert, J ;
Hammond, M ;
Hubbard, T ;
Kasprzyk, A ;
Keefe, D ;
Lehvaslaiho, H ;
Iyer, V ;
Melsopp, C ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Rust, A ;
Schmidt, E ;
Searle, S ;
Slater, G ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Stupka, E ;
Ureta-Vidal, A ;
Vastrik, I ;
Birney, E .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :38-42
[6]   The Secreted Protein Discovery Initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: A bioinformatics assessment [J].
Clark, HF ;
Gurney, AL ;
Abaya, E ;
Baker, K ;
Baldwin, D ;
Brush, J ;
Chen, J ;
Chow, B ;
Chui, C ;
Crowley, C ;
Currell, B ;
Deuel, B ;
Dowd, P ;
Eaton, D ;
Foster, J ;
Grimaldi, C ;
Gu, QM ;
Hass, PE ;
Heldens, S ;
Huang, A ;
Kim, HS ;
Klimowski, L ;
Jin, YS ;
Johnson, S ;
Lee, J ;
Lewis, L ;
Liao, DZ ;
Mark, M ;
Robbie, E ;
Sanchez, C ;
Schoenfeld, J ;
Seshagiri, S ;
Simmons, L ;
Singh, J ;
Smith, V ;
Stinson, J ;
Vagts, A ;
Vandlen, R ;
Watanabe, C ;
Wieand, D ;
Woods, K ;
Xie, MH ;
Yansura, D ;
Yi, S ;
Yu, GY ;
Yuan, J ;
Zhang, M ;
Zhang, ZM ;
Goddard, A ;
Wood, WI .
GENOME RESEARCH, 2003, 13 (10) :2265-2270
[7]   Predicting subcellular localization of proteins based on their N-terminal amino acid sequence [J].
Emanuelsson, O ;
Nielsen, H ;
Brunak, S ;
von Heijne, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 300 (04) :1005-1016
[8]   Porcine gene discovery by normalized cDNA-library sequencing and EST cluster assembly [J].
Fahrenkrug, SC ;
Smith, TPL ;
Freking, BA ;
Cho, J ;
White, J ;
Vallet, J ;
Wise, T ;
Rohrer, G ;
Pertea, E ;
Sultana, R ;
Quackenbush, J ;
Keele, JW .
MAMMALIAN GENOME, 2002, 13 (08) :475-478
[9]   The mouse secretome: Functional classification of the proteins secreted into the extracellular environment [J].
Grimmond, SM ;
Miranda, KC ;
Yuan, Z ;
Davis, MJ ;
Hume, DA ;
Yagi, K ;
Tominaga, N ;
Bono, H ;
Hayashizaki, Y ;
Okazaki, Y ;
Teasdale, RD .
GENOME RESEARCH, 2003, 13 (6B) :1350-1359
[10]   Secretion of the galectin family of mammalian carbohydrate-binding proteins [J].
Hughes, RC .
BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 1999, 1473 (01) :172-185