ConSurf: Using Evolutionary Data to Raise Testable Hypotheses about Protein Function

被引:417
作者
Celniker, Gershon [1 ,2 ,5 ]
Nimrod, Guy [1 ]
Ashkenazy, Haim [2 ]
Glaser, Fabian [3 ]
Martz, Eric [4 ]
Mayrose, Itay [5 ]
Pupko, Tal [2 ]
Ben-Tal, Nir [1 ]
机构
[1] Tel Aviv Univ, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
[2] Tel Aviv Univ, Dept Cell Res & Immunol, IL-69978 Tel Aviv, Israel
[3] Technion Israel Inst Technol, Bioinformat Knowledge Unit, Lorry I Lokey Interdisciplinary Ctr Life Sci & En, IL-32000 Haifa, Israel
[4] Univ Massachusetts, Dept Microbiol, Amherst, MA 01003 USA
[5] Tel Aviv Univ, Dept Mol Biol & Ecol Plants, IL-69978 Tel Aviv, Israel
关键词
bioinformatics; databases; genomics; protein models; protein structures; MULTIPLE SEQUENCE ALIGNMENT; WEB SERVER; IDENTIFICATION; INFERENCE; SURFACES; POLYMORPHISMS; CONSERVATION; GENERATION; PREDICTION; INTERFACES;
D O I
10.1002/ijch.201200096
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Many mutations disappear from the population because they impair protein function and/or stability. Thus, amino acid positions that are essential for proper function evolve more slowly than others, or in other words, the slow evolutionary rate of a position reflects its importance. ConSurf (http://consurf.tau.ac.il), reviewed in this manuscript, exploits this to reveal key amino acid positions that are important for maintaining the native conformation(s) of the protein and its function, be it binding, catalysis, transport, etc. Given the sequence or 3D structure of the query protein as input, a search for similar sequences is conducted and the sequences are aligned. The multiple sequence alignment is subsequently used to calculate the evolutionary rates of each amino acid site, using Bayesian or maximum-likelihood algorithms. Both algorithms take into account the evolutionary relationships between the sequences, reflected in phylogenetic trees, to alleviate problems due to uneven (biased) sampling in sequence space. This is particularly important when the number of sequences is low. The ConSurf-DB, a new release of which is presented here, provides precalculated ConSurf conservation analysis of nearly all available structures in the Protein DataBank (PDB). The usefulness of ConSurf for the study of individual proteins and mutations, as well as a range of large-scale, genome-wide applications, is reviewed.
引用
收藏
页码:199 / 206
页数:8
相关论文
共 52 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Discriminative modelling of context-specific amino acid substitution probabilities [J].
Angermueller, Christof ;
Biegert, Andreas ;
Soeding, Johannes .
BIOINFORMATICS, 2012, 28 (24) :3240-3247
[3]  
[Anonymous], PYMOL MOL GRAPH SYST
[4]   Reorganizing the protein space at the Universal Protein Resource (UniProt) [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bower, Lawrence ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Da Silva, Alan ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fazzini, Francesco ;
Fedotov, Alexander ;
Garavelli, John ;
Castro, Leyla Garcia ;
Gardner, Michael ;
Hieta, Reija ;
Huntley, Rachael ;
Jacobsen, Julius ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pontikos, Nikolas ;
Pundir, Sangya ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Wardell, Tony ;
Watkins, Xavier ;
Corbett, Matt ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Goujon, Mickael ;
McWilliam, Hamish ;
Lopez, Rodrigo ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Bridge, Alan ;
Poux, Sylvain ;
Redaschi, Nicole .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D71-D75
[5]   ConSurf: An algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information [J].
Armon, A ;
Graur, D ;
Ben-Tal, N .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (01) :447-463
[6]   ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids [J].
Ashkenazy, Haim ;
Erez, Elana ;
Martz, Eric ;
Pupko, Tal ;
Ben-Tal, Nir .
NUCLEIC ACIDS RESEARCH, 2010, 38 :W529-W533
[7]   ConSeq: the identification of functionally and structurally important residues in protein sequences [J].
Berezin, C ;
Glaser, F ;
Rosenberg, J ;
Paz, I ;
Pupko, T ;
Fariselli, P ;
Casadio, R ;
Ben-Tal, N .
BIOINFORMATICS, 2004, 20 (08) :1322-1324
[8]   SNAP: predict effect of non-synonymous polymorphisms on function [J].
Bromberg, Yana ;
Rost, Burkhard .
NUCLEIC ACIDS RESEARCH, 2007, 35 (11) :3823-3835
[9]   Crystal structures of oseltamivir-resistant influenza virus neuraminidase mutants [J].
Collins, Patrick J. ;
Haire, Lesley F. ;
Lin, Yi Pu ;
Liu, Junfeng ;
Russell, Rupert J. ;
Walker, Philip A. ;
Skehel, John J. ;
Martin, Stephen R. ;
Hay, Alan J. ;
Gamblin, Steven J. .
NATURE, 2008, 453 (7199) :1258-U61
[10]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797