Multiple alignment of complete sequences (MACS) in the post-genomic era

被引:44
作者
Lecompte, O [1 ]
Thompson, JD [1 ]
Plewniak, F [1 ]
Thierry, JC [1 ]
Poch, O [1 ]
机构
[1] ULP, INSERM, CNRS, Inst Genet & Biol Mol & Cellulaire,Lab Biol & Gen, F-67404 Illkirch Graffenstaden, France
关键词
bioinformatics; sequence analysis; functional genomics; genome annotation;
D O I
10.1016/S0378-1119(01)00461-9
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Multiple alignment, since its introduction in the early seventies, has become a cornerstone of modem molecular biology. It has traditionally been used to deduce structure / function by homology, to detect conserved motifs and in phylogenetic studies. There has recently been some renewed interest in the development of multiple alignment techniques, with current opinion moving away from a single all-encompassing algorithm to iterative and / or co-operative strategies. The exploitation of multiple alignments in genome annotation projects represents a qualitative leap in the functional analysis process, opening the way to the study of the co-evolution of validated sets of proteins and to reliable phylogenomic analysis. However, the alignment of the highly complex proteins detected by today's advanced database search methods is a daunting task, In addition, with the explosion of the sequence databases and with the establishment of numerous specialized biological databases, multiple alignment programs must evolve if they are to successfully rise to the new challenges of the post-genomic era. The way forward is clearly an integrated system bringing together sequence data, know-ledge-based systems and prediction methods with their inherent unreliability. The incorporation of such heterogeneous, often non-consistent, data will require major changes to the fundamental alignment algorithms used to date. Such an integrated multiple alignment system will provide an ideal workbench for the validation, propagation and presentation of this information in a format that is concise, clear and intuitive. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:17 / 30
页数:14
相关论文
共 140 条
  • [51] HEIN J, 1990, METHOD ENZYMOL, V183, P626
  • [52] Increased coverage of protein families with the Blocks Database servers
    Henikoff, JG
    Greene, EA
    Pietrokovski, S
    Henikoff, S
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 228 - 230
  • [53] Gene families: The taxonomy of protein paralogs and chimeras
    Henikoff, S
    Greene, EA
    Pietrokovski, S
    Bork, P
    Attwood, TK
    Hood, L
    [J]. SCIENCE, 1997, 278 (5338) : 609 - 614
  • [54] Mycoplasma pneumoniae and Mycoplasma genitalium: a comparison of two closely related bacterial species
    Herrmann, R
    Reiner, B
    [J]. CURRENT OPINION IN MICROBIOLOGY, 1998, 1 (05) : 572 - 579
  • [55] Comparative analysis of the genomes of the bacteria Mycoplasma pneumoniae and Mycoplasma genitalium
    Himmelreich, R
    Plagens, H
    Hilbert, H
    Reiner, B
    Herrmann, R
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (04) : 701 - 712
  • [56] The GeneQuiz Web server: protein functional analysis through the Web
    Hoersch, S
    Leroy, C
    Brown, NP
    Andrade, MA
    Sander, C
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2000, 25 (01) : 33 - 35
  • [57] The PROSITE database, its status in 1999
    Hofmann, K
    Bucher, P
    Falquet, L
    Bairoch, A
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 215 - 219
  • [58] Protein folds and families: sequence and structure alignments
    Holm, L
    Sander, C
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 244 - 247
  • [59] The FSSP database: Fold classification based on structure structure alignment of proteins
    Holm, L
    Sander, C
    [J]. NUCLEIC ACIDS RESEARCH, 1996, 24 (01) : 206 - 209
  • [60] Evolution of simple sequence in proteins
    Huntley, M
    Golding, GB
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2000, 51 (02) : 131 - 140