An approach of orthology detection from homologous sequences under minimum evolution

被引:26
作者
Kim, Kyung Mo [1 ,2 ]
Sung, Samsun [1 ]
Caetano-Anolles, Gustavo [2 ]
Han, Jae Yong [3 ]
Kim, Heebal [1 ]
机构
[1] Seoul Natl Univ, Dept Agr Biotechnol, Lab Bioinformat & Populat Genet, Seoul 151742, South Korea
[2] Univ Illinois, Dept Crop Sci, Urbana, IL 61801 USA
[3] Seoul Natl Univ, Dept Agr Biotechnol, Lab Anim Genet Engn, Seoul 151742, South Korea
关键词
D O I
10.1093/nar/gkn485
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In the field of phylogenetics and comparative genomics, it is important to establish orthologous relationships when comparing homologous sequences. Due to the slight sequence dissimilarity between orthologs and paralogs, it is prone to regarding paralogs as orthologs. For this reason, several methods based on evolutionary distance, phylogeny and BLAST have tried to detect orthologs with more precision. Depending on their algorithmic implementations, each of these methods sometimes has increased false negative or false positive rates. Here, we developed a novel algorithm for orthology detection that uses a distance method based on the phylogenetic criterion of minimum evolution. Our algorithm assumes that sets of sequences exhibiting orthologous relationships are evolutionarily less costly than sets that include one or more paralogous relationships. Calculation of evolutionary cost requires the reconstruction of a neighbor-joining (NJ) tree, but calculations are unaffected by the topology of any given NJ tree. Unlike tree reconciliation, our algorithm appears free from the problem of incorrect topologies of species and gene trees. The reliability of the algorithm was tested in a comparative analysis with two other orthology detection methods using 95 manually curated KOG datasets and 21 experimentally verified EXProt datasets. Sensitivity and specificity estimates indicate that the concept of minimum evolution could be valuable for the detection of orthologs.
引用
收藏
页数:12
相关论文
共 30 条
[1]  
[Anonymous], 1989, Cladistics, DOI DOI 10.1111/J.1096-0031.1989.TB00562.X
[2]   Cytoglobin:: A novel globin type ubiquitously expressed in vertebrate tissues [J].
Burmester, T ;
Ebner, B ;
Weich, B ;
Hankeln, T .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (04) :416-421
[3]   PHYLOGENETIC ANALYSIS - MODELS AND ESTIMATION PROCEDURES [J].
CAVALLISFORZA, LL ;
EDWARDS, AWF .
EVOLUTION, 1967, 21 (03) :550-+
[4]   Assessing Performance of Orthology Detection Strategies Applied to Eukaryotic Genomes [J].
Chen, Feng ;
Mackey, Aaron J. ;
Vermunt, Jeroen K. ;
Roos, David S. .
PLOS ONE, 2007, 2 (04)
[5]   Homology - a personal view on some of the problems [J].
Fitch, WM .
TRENDS IN GENETICS, 2000, 16 (05) :227-231
[6]   DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS [J].
FITCH, WM .
SYSTEMATIC ZOOLOGY, 1970, 19 (02) :99-&
[7]   FITTING THE GENE LINEAGE INTO ITS SPECIES LINEAGE, A PARSIMONY STRATEGY ILLUSTRATED BY CLADOGRAMS CONSTRUCTED FROM GLOBIN SEQUENCES [J].
GOODMAN, M ;
CZELUSNIAK, J ;
MOORE, GW ;
ROMEROHERRERA, AE ;
MATSUDA, G .
SYSTEMATIC ZOOLOGY, 1979, 28 (02) :132-163
[8]  
GRAUR D, 2000, FUNDAMENTALS MOL EVO
[9]  
HALL T. A., 1999, Nucleic Acids Symp Ser, V41, P95, DOI [10.1021/bk-1999-0734.ch008, DOI 10.1021/BK-1999-0734.CH008]
[10]   KEGG: Kyoto Encyclopedia of Genes and Genomes [J].
Kanehisa, M ;
Goto, S .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :27-30