EShadow: A tool for comparing closely related sequences

被引:40
作者
Ovcharenko, I [1 ]
Boffelli, D
Loots, GG
机构
[1] Lawrence Livermore Natl Lab, EEBI, Livermore, CA 94550 USA
[2] Lawrence Livermore Natl Lab, Genome Biol Div, Livermore, CA 94550 USA
[3] Lawrence Berkeley Lab, Dept Genome Sci, Berkeley, CA 94720 USA
关键词
D O I
10.1101/gr.1773104
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Primate sequence comparisons are difficult to interpret due to the high degree Of Sequence similarity shared between such closely related species. Recently, a novel method, phylogenetic shadowing, has been pioneered for predicting functional elements in the human genome through the analysis of multiple primate sequence alignments. We have expanded this theoretical approach to create a computational tool, eShadow, for the identification of elements Under selective pressure in multiple sequence alignments of closely related genomes, such as in comparisons of human-to-primate or mouse-to-rat DNA. This tool integrates two different statistical methods and allows for the dynamic visulaization of the resulting conservation profile. eShadow also includes a versatile optimization module capable of training the Underlying Hidden Markov Model to differentially predict functional sequences. This module grants the tool high flexibility in the analysis Of multiple sequence alignments and in comparing sequences with different divergence rates. Here, we describe the eShadow comparative tool and its potential Uses for analyzing both multiple nucleotide and protein alignments to predict putative functional elements.
引用
收藏
页码:1191 / 1198
页数:8
相关论文
共 43 条
[41]   CLUSTAL-W - IMPROVING THE SENSITIVITY OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT THROUGH SEQUENCE WEIGHTING, POSITION-SPECIFIC GAP PENALTIES AND WEIGHT MATRIX CHOICE [J].
THOMPSON, JD ;
HIGGINS, DG ;
GIBSON, TJ .
NUCLEIC ACIDS RESEARCH, 1994, 22 (22) :4673-4680
[42]   Identification and characterization of subfamily-specific signatures in a large protein superfamily by a hidden Markov model approach [J].
Truong, K ;
Ikura, M .
BMC BIOINFORMATICS, 2002, 3 (1)
[43]   Initial sequencing and comparative analysis of the mouse genome [J].
Waterston, RH ;
Lindblad-Toh, K ;
Birney, E ;
Rogers, J ;
Abril, JF ;
Agarwal, P ;
Agarwala, R ;
Ainscough, R ;
Alexandersson, M ;
An, P ;
Antonarakis, SE ;
Attwood, J ;
Baertsch, R ;
Bailey, J ;
Barlow, K ;
Beck, S ;
Berry, E ;
Birren, B ;
Bloom, T ;
Bork, P ;
Botcherby, M ;
Bray, N ;
Brent, MR ;
Brown, DG ;
Brown, SD ;
Bult, C ;
Burton, J ;
Butler, J ;
Campbell, RD ;
Carninci, P ;
Cawley, S ;
Chiaromonte, F ;
Chinwalla, AT ;
Church, DM ;
Clamp, M ;
Clee, C ;
Collins, FS ;
Cook, LL ;
Copley, RR ;
Coulson, A ;
Couronne, O ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Daly, M ;
David, R ;
Davies, J ;
Delehaunty, KD ;
Deri, J ;
Dermitzakis, ET .
NATURE, 2002, 420 (6915) :520-562