Detecting site-specific biochemical constraints through substitution mapping

被引:9
作者
Dutheil, Julien [1 ,2 ]
机构
[1] Univ Aarhus, BiRC Bioinformat Res Ctr, DK-8000 Aarhus C, Denmark
[2] Univ Montpellier 2, Inst Sci Evolut, CNRS, UM2, F-34095 Montpellier, France
关键词
constraint; protein; phylogeny; substitution mapping; Markov model;
D O I
10.1007/s00239-008-9139-8
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The neutral theory of molecular evolution states that most mutations are deleterious or neutral. It results that the evolutionary rate of a given position in an alignment is a function of the level of constraint acting on this position. Inferring evolutionary rates from a set of aligned sequences is hence a powerful method to detect functionally and/or structurally important positions in a protein. Some positions, however, may be constrained while having a high substitution rate, providing these substitutions do not affect the biochemical property under constraint. Here, I introduce a new evolutionary rate measure accounting for the evolution of specific biochemical properties (e.g., volume, polarity, and charge). I then present a new statistical method based on the comparison of two rate measures: a site is said to be constrained for property X if it shows an unexpectedly high conservation of X knowing its total evolutionary rate. Compared to single-rate methods, the two-rate method offers several advantages: it (i) allows assessment of the significance of the constraint, (ii) provides information on the type of constraint acting on each position, and (iii) detects positions that are not proposed by previous methods. I apply this method to a 200-sequence data set of triosephosphate isomerase and report significant cases of positions constrained for polarity, volume, or charge. The three-dimensional localization of these positions shows that they are of potential interest to the molecular evolutionist and to the biochemist.
引用
收藏
页码:257 / 265
页数:9
相关论文
共 36 条
[1]  
[Anonymous], 2006, R LANG ENV STAT COMP
[2]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]  
Chessel D., 2004, R NEWS, V4, P6, DOI DOI 10.2307/3780087
[5]   A model-based approach for detecting coevolving positions in a molecule [J].
Dutheil, J ;
Pupko, T ;
Jean-Marie, A ;
Galtier, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (09) :1919-1928
[6]   Detecting groups of coevolving positions in a molecule: a clustering approach [J].
Dutheil, Julien ;
Galtier, Nicolas .
BMC EVOLUTIONARY BIOLOGY, 2007, 7 (1)
[7]   Bio++:: a set of C++ libraries for sequence analysis, phylogenetics, molecular evolution and population genetics [J].
Dutheil, Julien ;
Gaillard, Sylvain ;
Bazin, Eric ;
Glemin, Sylvain ;
Ranwez, Vincent ;
Galtier, Nicolas ;
Belkhir, Khalid .
BMC BIOINFORMATICS, 2006, 7 (1)
[8]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[9]  
Felsenstein Joseph, 2004, Inferring_phylogenies, V2
[10]   ConSurf: Identification of Functional Regions in Proteins by Surface-Mapping of Phylogenetic Information [J].
Glaser, F ;
Pupko, T ;
Paz, I ;
Bell, RE ;
Bechor-Shental, D ;
Martz, E ;
Ben-Tal, N .
BIOINFORMATICS, 2003, 19 (01) :163-164