Mouse proteome analysis

被引:83
作者
Kanapin, A
Batalov, S
Davis, MJ
Gough, J
Grimmond, S
Kawaji, H
Magrane, M
Matsuda, H
Schönbach, C
Teasdale, RD
Yuan, Z
机构
[1] EMBL Outstation, European Bioinformat Inst, Hinxton CB10 1SD, Cambs, England
[2] Osaka Univ, Grad Sch Informat Sci & Technol, Dept Bioinformat Engn, Toyonaka, Osaka 5608531, Japan
[3] Univ Queensland, Inst Mol Biosci, St Lucia, Qld 4072, Australia
[4] Univ Queensland, ARC Special Res Ctr Funct & Appl Genom, St Lucia, Qld 4072, Australia
[5] Novo Res Fdn GNF, Genom Inst, San Diego, CA 92121 USA
[6] Stanford Univ, Dept Biol Struct, Stanford, CA 94305 USA
[7] RIKEN, Genom Sci Ctr GSC, Bioinformat Grp, Knowledge Discovery Team, Yokohama, Kanagawa 2300045, Japan
[8] RIKEN, Genom Sci Ctr GSC, Yokohama Inst, Lab Genome Explorat Res Grp,Tsurumi Ku, Yokohama, Kanagawa 2300045, Japan
[9] NTT Software Corp, Naka Ku, Yokohama, Kanagawa 2318554, Japan
[10] RIKEN, Genome Sci Lab, Wako, Saitama 3510198, Japan
关键词
D O I
10.1101/gr.978703
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A general overview of the protein sequence set for the mouse transcriptome produced during the FANTOM2 sequencing project is presented here. We applied different algorithms to characterize protein sequences derived from a nonredundant representative protein set (RPS) and a variant protein set (VPS) of the mouse transcriptome. The functional characterization and assignment of Gene Ontology terms was done by analysis of the proteome using InterPro. The Superfamily database analyses gave a detailed structural classification according to SCOP and provide additional evidence for the functional characterization of the proteome data. The MDS database analysis revealed new domains which are not presented in existing protein domain databases. Thus the transcriptome gives us a unique source of data for the detection of new functional groups. The data obtained for the RPS and VPS sets facilitated the comparison of different patterns of protein expression. A comparison of other existing mouse and human protein sequence sets (e.g., the International Protein Index) demonstrates the common patterns in mammalian proteornes. The analysis of the membrane organization within the transcriptome of multiple eukaryotes provides valuable statistics about the distribution of secretory and transmembrane proteins
引用
收藏
页码:1335 / 1344
页数:10
相关论文
共 30 条
[1]   The InterPro database, an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, T ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :37-40
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   The SUPERFAMILY database in structural genomics [J].
Gough, J .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 2002, 58 :1897-1900
[4]   SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments [J].
Gough, J ;
Chothia, C .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :268-272
[5]  
Ha SH, 2002, CANCER RES, V62, P1275
[6]   SOSUI: classification and secondary structure prediction system for membrane proteins [J].
Hirokawa, T ;
Boon-Chieng, S ;
Mitaku, S .
BIOINFORMATICS, 1998, 14 (04) :378-379
[7]   The Ensembl genome database project [J].
Hubbard, T ;
Barker, D ;
Birney, E ;
Cameron, G ;
Chen, Y ;
Clark, L ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Down, T ;
Durbin, R ;
Eyras, E ;
Gilbert, J ;
Hammond, M ;
Huminiecki, L ;
Kasprzyk, A ;
Lehvaslaiho, H ;
Lijnzaad, P ;
Melsopp, C ;
Mongin, E ;
Pettett, R ;
Pocock, M ;
Potter, S ;
Rust, A ;
Schmidt, E ;
Searle, S ;
Slater, G ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Stupka, E ;
Ureta-Vidal, A ;
Vastrik, I ;
Clamp, M .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :38-41
[8]  
Ikeda Masami, 2002, In Silico Biology, V2, P19
[9]   A MODEL RECOGNITION APPROACH TO THE PREDICTION OF ALL-HELICAL MEMBRANE-PROTEIN STRUCTURE AND TOPOLOGY [J].
JONES, DT ;
TAYLOR, WR ;
THORTON, JM .
BIOCHEMISTRY, 1994, 33 (10) :3038-3049
[10]   Exploration of novel motifs derived from mouse cDNA sequences [J].
Kawaji, H ;
Schönbach, C ;
Matsuo, Y ;
Kawai, J ;
Okazaki, Y ;
Hayashizaki, Y ;
Matsuda, H .
GENOME RESEARCH, 2002, 12 (03) :367-378