Upcoming challenges for multiple sequence alignment methods in the high-throughput era

被引:139
作者
Kemena, Carsten [1 ]
Notredame, Cedric [1 ]
机构
[1] Pompeus Fabre Univ, Ctr Genom Regulat, Barcelona 08003, Spain
关键词
PROTEIN-STRUCTURE ALIGNMENT; ACCURATE; ALGORITHM; BENCHMARK; CONSISTENCY; HOMOLOGY; COFFEE; IDENTIFICATION; PREDICTION; MUSCLE;
D O I
10.1093/bioinformatics/btp452
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This review focuses on recent trends in multiple sequence alignment tools. It describes the latest algorithmic improvements including the extension of consistency-based methods to the problem of template-based multiple sequence alignments. Some results are presented suggesting that template-based methods are significantly more accurate than simpler alternative methods. The validation of existing methods is also discussed at length with the detailed description of recent results and some suggestions for future validation strategies. The last part of the review addresses future challenges for multiple sequence alignment methods in the genomic era, most notably the need to cope with very large sequences, the need to integrate large amounts of experimental data, the need to accurately align non-coding and non-transcribed sequences and finally, the need to integrate many alternative methods and approaches.
引用
收藏
页码:2455 / 2465
页数:11
相关论文
共 79 条
[61]   HOMSTRAD: recent developments of the Homologous Protein Structure Alignment Database [J].
Stebbings, LA ;
Mizuguchi, K .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D203-D207
[62]  
Stoye J, 1997, ISMB-97 - FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY, PROCEEDINGS, P303
[63]   DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment [J].
Subramanian, Amarendran R. ;
Kaufmann, Michael ;
Morgenstern, Burkhard .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2008, 3 (1)
[64]   DIALIGN-T: An improved algorithm for segment-based multiple sequence alignment [J].
Subramanian, AR ;
Weyer-Menkhoff, J ;
Kaufmann, M ;
Morgenstern, B .
BMC BIOINFORMATICS, 2005, 6 (1)
[65]   IDENTIFICATION OF PROTEIN-SEQUENCE HOMOLOGY BY CONSENSUS TEMPLATE ALIGNMENT [J].
TAYLOR, WR .
JOURNAL OF MOLECULAR BIOLOGY, 1986, 188 (02) :233-258
[66]   BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs [J].
Thompson, JD ;
Plewniak, F ;
Poch, O .
BIOINFORMATICS, 1999, 15 (01) :87-88
[67]   BAliBASE 3.0: Latest developments of the multiple sequence alignment benchmark [J].
Thompson, JD ;
Koehl, P ;
Ripp, R ;
Poch, O .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 (01) :127-136
[68]   CLUSTAL-W - IMPROVING THE SENSITIVITY OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT THROUGH SEQUENCE WEIGHTING, POSITION-SPECIFIC GAP PENALTIES AND WEIGHT MATRIX CHOICE [J].
THOMPSON, JD ;
HIGGINS, DG ;
GIBSON, TJ .
NUCLEIC ACIDS RESEARCH, 1994, 22 (22) :4673-4680
[69]   SABmark - a benchmark for sequence alignment that covers the entire known fold space [J].
Van Walle, I ;
Lasters, I ;
Wyns, L .
BIOINFORMATICS, 2005, 21 (07) :1267-1268
[70]   MOTIF RECOGNITION AND ALIGNMENT FOR MANY SEQUENCES BY COMPARISON OF DOT-MATRICES [J].
VINGRON, M ;
ARGOS, P .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 218 (01) :33-43