NEIGHBOR JOINING AND MAXIMUM-LIKELIHOOD WITH RNA SEQUENCES - ADDRESSING THE INTERDEPENDENCE OF SITES

被引:80
作者
TILLIER, ERM
COLLINS, RA
机构
[1] UNIV TORONTO, DEPT BOT, TORONTO, ON, CANADA
[2] UNIV TORONTO, DEPT MOLEC & MED GENET, TORONTO, ON, CANADA
[3] CANADIAN INST ADV RES, EVOLUTIONARY BIOL PROGRAM, TORONTO, ON, CANADA
关键词
COMPENSATORY SUBSTITUTIONS; EVOLUTIONARY MODEL; RNA EVOLUTION; MAXIMUM LIKELIHOOD; NEIGHBOR-JOINING; PHYLOGENETIC ANALYSIS; COMPUTER SIMULATIONS; STATISTICAL TESTS;
D O I
10.1093/oxfordjournals.molbev.a040195
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Intrastrand base pairings give ribosomal and other RNA molecules characteristic structures that are important for their function. In order to maintain these structures, a substitution at one paired site may have to be compensated for by an appropriate substitution at the complementary site. Thus paired sites do not evolve independently of one another. Most current methods for inferring phylogeny from molecular sequences assume that the sites are independent and will therefore give statistically unreliable and possibly erroneous results when used on structured RNA sequences. We analyze a new probabilistic model for the evolution of double-stranded RNA molecules that considers substitutions of the base pairs rather than of each of the bases independently. The new model, called the double-stranded model, was incorporated into the neighbor-joining distance and maximum likelihood methods. Computer simulations show that maximum likelihood is very robust to the violation of the assumption of the independence of sites. In contrast, the neighbor-joining method is sensitive to such violations: the double-stranded model can provide a significant increase in the chance of obtaining the correct tree topologies with neighbor joining when distances are large and the tree is difficult to obtain. The new model also leads to lower but more realistic estimates for the statistical confidence in the branch lengths and tree topologies.
引用
收藏
页码:7 / 15
页数:9
相关论文
共 25 条
[1]  
CHASTAIN M, 1991, PROG NUCLEIC ACID RE, V41, P131
[2]  
Cox D. R., 1977, THEORY STOCHASTIC PR
[3]  
DIXON MT, 1993, MOL BIOL EVOL, V10, P256
[4]   GENE TREES AND SPECIES TREES - MOLECULAR SYSTEMATICS AS ONE-CHARACTER TAXONOMY [J].
DOYLE, JJ .
SYSTEMATIC BOTANY, 1992, 17 (01) :144-163
[5]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[6]  
FELSENSTEIN J, 1985, EVOLUTION, V39, P783, DOI 10.1111/j.1558-5646.1985.tb00420.x
[7]   ROBUSTNESS OF MAXIMUM-LIKELIHOOD TREE ESTIMATION AGAINST DIFFERENT PATTERNS OF BASE SUBSTITUTIONS [J].
FUKAMIKOBAYASHI, K ;
TATENO, Y .
JOURNAL OF MOLECULAR EVOLUTION, 1991, 32 (01) :79-91
[8]   IDENTIFYING CONSTRAINTS ON THE HIGHER-ORDER STRUCTURE OF RNA - CONTINUED DEVELOPMENT AND APPLICATION OF COMPARATIVE SEQUENCE-ANALYSIS METHODS [J].
GUTELL, RR ;
POWER, A ;
HERTZ, GZ ;
PUTZ, EJ ;
STORMO, GD .
NUCLEIC ACIDS RESEARCH, 1992, 20 (21) :5785-5795
[9]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174
[10]   RIBOSOMAL DNA - MOLECULAR EVOLUTION AND PHYLOGENETIC INFERENCE [J].
HILLIS, DM ;
DIXON, MT .
QUARTERLY REVIEW OF BIOLOGY, 1991, 66 (04) :411-453