The International Protein Index: An integrated database for proteomics experiments

被引:574
作者
Kersey, PJ [1 ]
Duarte, J [1 ]
Williams, A [1 ]
Karavidopoulou, Y [1 ]
Birney, E [1 ]
Apweiler, R [1 ]
机构
[1] European Bioinformat Inst, EMBL Outstat, Hinxton CB10 1SD, Cambs, England
关键词
bioinformatics; databases; human genome; International Protein Index; proteomes;
D O I
10.1002/pmic.200300721
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Despite the complete determination of the genome sequence of several higher eukaryotes, their proteomes remain relatively poorly defined. Information about proteins identified by different experimental and computational methods is stored in different databases, meaning that no single resource offers full coverage of known and predicted proteins. IPI (the International Protein Index) has been developed to address these issues and offers complete nonredundant data sets representing the human, mouse and rat proteomes, built from the Swiss-Prot, TrEMBL, Ensembl and RefSeq databases.
引用
收藏
页码:1985 / 1988
页数:4
相关论文
共 9 条
[1]   The Mouse Genome Database (MGD): the model organism database for the laboratory mouse [J].
Blake, JA ;
Richardson, JE ;
Bult, CJ ;
Kadin, JA ;
Eppig, JT .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :113-115
[2]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[3]  
CLAMP M, 1906, NUCLEIC ACIDS RES, V31, P38
[4]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[5]   Clustering of highly homologous sequences to reduce the size of large protein databases [J].
Li, WZ ;
Jaroszewski, L ;
Godzik, A .
BIOINFORMATICS, 2001, 17 (03) :282-283
[6]   RefSeq and LocusLink: NCBI gene-centered resources [J].
Pruitt, KD ;
Maglott, DR .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :137-140
[7]   NCBI Reference Sequence Project: update and current status [J].
Pruitt, KD ;
Tatusova, T ;
Maglott, DR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :34-37
[8]   The EMBL Nucleotide Sequence Database: major new developments [J].
Stoesser, G ;
Baker, W ;
van den Broek, A ;
Garcia-Pastor, M ;
Kanz, C ;
Kulikova, T ;
Leinonen, R ;
Lin, Q ;
Lombard, V ;
Lopez, R ;
Mancuso, R ;
Nardone, F ;
Stoehr, P ;
Tuli, MA ;
Tzouvara, K ;
Vaughan, R .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :17-22
[9]   Genew: The Human Gene Nomenclature Database [J].
Wain, HM ;
Lush, M ;
Ducluzeau, F ;
Povey, S .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :169-171