UniProt: the universal protein knowledgebase

被引:1870
作者
Bateman, Alex [1 ]
Martin, Maria Jesus [1 ]
O'Donovan, Claire [1 ]
Magrane, Michele [1 ]
Alpi, Emanuele [1 ]
Antunes, Ricardo [1 ]
Bely, Benoit [1 ]
Bingley, Mark [1 ]
Bonilla, Carlos [1 ]
Britto, Ramona [1 ]
Bursteinas, Borisas [1 ]
Bye-A-Jee, Hema [1 ]
Cowley, Andrew [1 ]
Da Silva, Alan [1 ]
De Giorgi, Maurizio [1 ]
Dogan, Tunca [1 ]
Fazzini, Francesco [1 ]
Castro, Leyla Garcia [1 ]
Figueira, Luis [1 ]
Garmiri, Penelope [1 ]
Georghiou, George [1 ]
Gonzalez, Daniel [1 ]
Hatton-Ellis, Emma [1 ]
Li, Weizhong [1 ]
Liu, Wudong [1 ]
Lopez, Rodrigo [1 ]
Luo, Jie [1 ]
Lussi, Yvonne [1 ]
MacDougall, Alistair [1 ]
Nightingale, Andrew [1 ]
Palka, Barbara [1 ]
Pichler, Klemens [1 ]
Poggioli, Diego [1 ]
Pundir, Sangya [1 ]
Pureza, Luis [1 ]
Qi, Guoying [1 ]
Rosanoff, Steven [1 ]
Saidi, Rabie [1 ]
Sawford, Tony [1 ]
Shypitsyna, Aleksandra [1 ]
Speretta, Elena [1 ]
Turner, Edward [1 ]
Tyagi, Nidhi [1 ]
Volynkin, Vladimir [1 ]
Wardell, Tony [1 ]
Warner, Kate [1 ]
Watkins, Xavier [1 ]
Zaru, Rossana [1 ]
Zellner, Hermann [1 ]
Xenarios, Ioannis [2 ]
机构
[1] EBI, EMBL, Wellcome Genome Campus, Cambridge CB10 1SD, England
[2] Ctr Med Univ Geneva, SIB, 1 Rue Michel Servet, CH-1211 Geneva 4, Switzerland
[3] Georgetown Univ, Med Ctr, Prot Informat Resource, 3300 Whitehaven St NW,Suite 1200, Washington, DC 20007 USA
基金
英国生物技术与生命科学研究理事会; 美国国家卫生研究院;
关键词
SEQUENCE VARIANTS; RESOURCE; DATABASE; CLASSIFICATION; CHROMOSOMES; GUIDELINES; UNIREF;
D O I
10.1093/nar/gkw1099
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
The UniProt knowledgebase is a large resource of protein sequences and associated detailed annotation. The database contains over 60 million sequences, of which over half a million sequences have been curated by experts who critically review experimental and predicted data for each protein. The remainder are automatically annotated based on rule systems that rely on the expert curated knowledge. Since our last update in 2014, we have more than doubled the number of reference proteomes to 5631, giving a greater coverage of taxonomic diversity. We implemented a pipeline to remove redundant highly similar proteomes that were causing excessive redundancy in UniProt. The initial run of this pipeline reduced the number of sequences in UniProt by 47 million. For our users interested in the accessory proteomes, we have made available sets of pan proteome sequences that cover the diversity of sequences for each species that is found in its strains and sub-strains. To help interpretation of genomic variants, we provide tracks of detailed protein information for the major genome browsers. We provide a SPARQL endpoint that allows complex queries of the more than 22 billion triples of data in UniProt (http://sparql.uniprot.org/).
引用
收藏
页码:D158 / D169
页数:12
相关论文
共 29 条
[1]
[Anonymous], 2012, MOL CELL PROTEOMICS, DOI DOI 10.1074/MCP.M111.014068
[2]
The Tubulin Code: A Navigation System for Chromosomes during Mitosis [J].
Barisic, Marin ;
Maiato, Helder .
TRENDS IN CELL BIOLOGY, 2016, 26 (10) :766-775
[3]
Microtubule detyrosination guides chromosomes during mitosis [J].
Barisic, Marin ;
Silva e Sousa, Ricardo ;
Tripathy, Suvranta K. ;
Magiera, Maria M. ;
Zaytsev, Anatoly V. ;
Pereira, Ana L. ;
Janke, Carsten ;
Grishchuk, Ekaterina L. ;
Maiato, Helder .
SCIENCE, 2015, 348 (6236) :799-803
[4]
UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[5]
The UniProtKB guide to the human proteome [J].
Breuza, Lionel ;
Poux, Sylvain ;
Estreicher, Anne ;
Famiglietti, Maria Livia ;
Magrane, Michele ;
Tognolli, Michael ;
Bridge, Alan ;
Baratin, Delphine ;
Redaschi, Nicole ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Bairoch, Amos ;
Aimo, Lucila ;
Auchincloss, Andrea ;
Axelsen, Kristian ;
Argoud-Puy, Ghislaine ;
Bansal, Parit ;
Binz, Pierre-Alain ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bolleman, Jerven ;
Boutet, Emmanuel ;
Braconi-Quintaje, Silvia ;
Casal-Casas, Cristina ;
de Castro, Edouard ;
Cerutti, Lorenzo ;
Coudert, Elisabeth ;
Cuche, Beatrice ;
Cusin, Isabelle ;
Doche, Mikael ;
Dornevil, Dolnide ;
Duvaud, Severine ;
Ferro-Rojas, Serenella ;
Feuermann, Marc ;
Gasteiger, Elisabeth ;
Gehant, Sebastien ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
James, Janet ;
Jimenez, Silvia ;
Jungo, Florence ;
Keller, Guillaume ;
Kerhornou, Arnaud ;
Kappler, Thomas ;
Lane, Lydie ;
Lara, Vicente ;
Lemercier, Philippe .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
[6]
A fast Peptide Match service for UniProt Knowledgebase [J].
Chen, Chuming ;
Li, Zhiwen ;
Huang, Hongzhan ;
Suzek, Baris E. ;
Wu, Cathy H. .
BIOINFORMATICS, 2013, 29 (21) :2808-2809
[7]
Representative Proteomes: A Stable, Scalable and Unbiased Proteome Set for Sequence Analysis and Functional Annotation [J].
Chen, Chuming ;
Natale, Darren A. ;
Finn, Robert D. ;
Huang, Hongzhan ;
Zhang, Jian ;
Wu, Cathy H. ;
Mazumder, Raja .
PLOS ONE, 2011, 6 (04)
[8]
Standardized description of scientific evidence using the Evidence Ontology (ECO) [J].
Chibucos, Marcus C. ;
Mungall, Christopher J. ;
Balakrishnan, Rama ;
Christie, Karen R. ;
Huntley, Rachael P. ;
White, Owen ;
Blake, Judith A. ;
Lewis, Suzanna E. ;
Giglio, Michelle .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2014,
[9]
State of the Human Proteome in 2014/2015 As Viewed through PeptideAtlas: Enhancing Accuracy and Coverage through the AtlasProphet [J].
Deutsch, Eric W. ;
Sun, Zhi ;
Campbell, David ;
Kusebauch, Ulrike ;
Chu, Caroline S. ;
Mendoza, Luis ;
Shteynberg, David ;
Omenn, Gilbert S. ;
Moritz, Robert L. .
JOURNAL OF PROTEOME RESEARCH, 2015, 14 (09) :3461-3473
[10]
POSTTRANSLATIONAL GLUTAMYLATION OF ALPHA-TUBULIN [J].
EDDE, B ;
ROSSIER, J ;
LECAER, JP ;
DESBRUYERES, E ;
GROS, F ;
DENOULET, P .
SCIENCE, 1990, 247 (4938) :83-85