Phylogenetic profiles reveal evolutionary relationships within the "twilight zone" of sequence similarity

被引:32
作者
Chang, Gue Su [1 ]
Hong, Yoojin [2 ]
Ko, Kyung Dae [1 ]
Bhardwaj, Gaurav [1 ]
Holmes, Edward C. [1 ]
Patterson, Randen L. [1 ,3 ]
van Rossum, Damian B. [1 ,3 ]
机构
[1] Penn State Univ, Dept Biol, State Coll, PA 16802 USA
[2] Penn State Univ, Dept Comp Sci & Engn, State Coll, PA 16802 USA
[3] Penn State Univ, Ctr Computat Prote, State Coll, PA 16802 USA
关键词
ab initio; retroelements; reverse transcriptase; GDDA-BLAST;
D O I
10.1073/pnas.0803860105
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Inferring evolutionary relationships among highly divergent protein sequences is a daunting task. In particular, when pairwise sequence alignments between protein sequences fall <25% identity, the phylogenetic relationships among sequences cannot be estimated with statistical certainty. Here, we show that phylogenetic profiles generated with the Gestalt Domain Detection Algorithm-Basic Local Alignment Tool (GDDA-BLAST) are capable of deriving, ab initio, phylogenetic relationships for highly divergent proteins in a quantifiable and robust manner. Notably, the results from our computational case study of the highly divergent family of retroelements accord with previous estimates of their evolutionary relationships. Taken together, these data demonstrate that GDDA-BLAST provides an independent and powerful measure of evolutionary relationships that does not rely on potentially subjective sequence alignment. We demonstrate that evolutionary relationships can be measured with phylogenetic profiles, and therefore propose that these measurements can provide key insights into relationships among distantly related and/or rapidly evolving proteins.
引用
收藏
页码:13474 / 13479
页数:6
相关论文
共 55 条
[11]   SPECULATIONS ON THE EARLY COURSE OF EVOLUTION [J].
DARNELL, JE ;
DOOLITTLE, WF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (05) :1271-1275
[12]   Tropism switching in Bordetella bacteriophage defines a family of diversity-generating retroelements [J].
Doulatov, S ;
Hodes, A ;
Dai, LX ;
Mandhana, N ;
Liu, M ;
Deora, R ;
Simons, RW ;
Zimmerly, S ;
Miller, JF .
NATURE, 2004, 431 (7007) :476-481
[13]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[14]   The diversity of retrotransposons and the properties of their reverse transcriptases [J].
Eickbush, Thomas H. ;
Jamburuthugoda, Varuni K. .
VIRUS RESEARCH, 2008, 134 (1-2) :221-234
[15]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[16]   A study of combined structure/sequence profiles [J].
Elofsson, A ;
Fischer, D ;
Rice, DW ;
LeGrand, SM ;
Eisenberg, D .
FOLDING & DESIGN, 1996, 1 (06) :451-461
[17]   Multiple LTR-retrotransposon families in the asexual yeast Candida albicans [J].
Goodwin, TJD ;
Poulter, RTM .
GENOME RESEARCH, 2000, 10 (02) :174-191
[18]   PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS [J].
GRIBSKOV, M ;
MCLACHLAN, AD ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (13) :4355-4358
[19]   Locally defined protein phylogenetic profiles reveal previously missed protein interactions and functional relationships [J].
Kim, Y ;
Subramaniam, S .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (04) :1115-1124
[20]  
KO KD, 2008, PHYS ARCH