Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee

被引:78
作者
Chang, Jia-Ming
Di Tommaso, Paolo
Taly, Jean-Francois
Notredame, Cedric [1 ]
机构
[1] Ctr Genom Regulat CRG, Bioinformat & Genom Program, Barcelona 08003, Spain
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
MEMBRANE-PROTEINS; DATABASE; TOPOLOGY; STRATEGY;
D O I
10.1186/1471-2105-13-S4-S1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Transmembrane proteins (TMPs) constitute about 20 similar to 30% of all protein coding genes. The relative lack of experimental structure has so far made it hard to develop specific alignment methods and the current state of the art (PRALINE T) only manages to recapitulate 50% of the positions in the reference alignments available from the BAliBASE2-ref7. Methods: We show how homology extension can be adapted and combined with a consistency based approach in order to significantly improve the multiple sequence alignment of alpha-helical TMPs. TM-Coffee is a special mode of PSI-Coffee able to efficiently align TMPs, while using a reduced reference database for homology extension. Results: Our benchmarking on BAliBASE2-ref7 alpha-helical TMPs shows a significant improvement over the most accurate methods such as MSAProbs, Kalign, PROMALS, MAFFT, ProbCons and PRALINE T. We also estimated the influence of the database used for homology extension and show that highly non-redundant UniRef databases can be used to obtain similar results at a significantly reduced computational cost over full protein databases. TM-Coffee is part of the T-Coffee package, a web server is also available from http://tcoffee.crg.cat/tmcoffee and a freeware open source code can be downloaded from http://www.tcoffee.org/Packages/Stable/Latest.
引用
收藏
页数:7
相关论文
共 20 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations
    Bahr, A
    Thompson, JD
    Thierry, JC
    Poch, O
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 323 - 326
  • [3] Atypical membrane topology and heteromeric function of Drosophila odorant receptors in vivo
    Benton, R
    Sachse, S
    Michnick, SW
    Vosshall, LB
    [J]. PLOS BIOLOGY, 2006, 4 (02) : 240 - 257
  • [4] NEW ALIGNMENT STRATEGY FOR TRANSMEMBRANE PROTEINS
    CSERZO, M
    BERNASSAU, JM
    SIMON, I
    MAIGRET, B
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (03) : 388 - 396
  • [5] ProbCons: Probabilistic consistency-based multiple sequence alignment
    Do, CB
    Mahabhashyam, MSP
    Brudno, M
    Batzoglou, S
    [J]. GENOME RESEARCH, 2005, 15 (02) : 330 - 340
  • [6] On the accuracy of homology modeling and sequence alignment methods applied to membrane proteins
    Forrest, Lucy R.
    Tang, Christopher L.
    Honig, Barry
    [J]. BIOPHYSICAL JOURNAL, 2006, 91 (02) : 508 - 517
  • [7] Recent developments in the MAFFT multiple sequence alignment program
    Katoh, Kazutaka
    Toh, Hiroyuki
    [J]. BRIEFINGS IN BIOINFORMATICS, 2008, 9 (04) : 286 - 298
  • [8] Upcoming challenges for multiple sequence alignment methods in the high-throughput era
    Kemena, Carsten
    Notredame, Cedric
    [J]. BIOINFORMATICS, 2009, 25 (19) : 2455 - 2465
  • [9] Kalign - an accurate and fast multiple sequence alignment algorithm
    Lassmann, T
    Sonnhammer, ELL
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [10] MSAProbs: multiple sequence alignment based on pair hidden Markov models and partition function posterior probabilities
    Liu, Yongchao
    Schmidt, Bertil
    Maskell, Douglas L.
    [J]. BIOINFORMATICS, 2010, 26 (16) : 1958 - 1964