HMM Logos for visualization of protein families -: art. no. 7

被引:179
作者
Schuster-Böckler, B
Schultz, J
Rahmann, S [1 ]
机构
[1] Free Univ Berlin, Dept Math & Comp Sci, D-1000 Berlin, Germany
[2] Univ Wurzburg, Biozentrum, Dept Bioinformat, D-97074 Wurzburg, Germany
[3] Max Planck Inst Mol Genet, Dept Computat Mol Biol, D-14195 Berlin, Germany
关键词
D O I
10.1186/1471-2105-5-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Profile Hidden Markov Models (pHMMs) are a widely used tool for protein family research. Up to now, however, there exists no method to visualize all of their central aspects graphically in an intuitively understandable way. Results: We present a visualization method that incorporates both emission and transition probabilities of the pHMM, thus extending sequence logos introduced by Schneider and Stephens. For each emitting state of the pHMM, we display a stack of letters. The stack height is determined by the deviation of the position's letter emission frequencies from the background frequencies. The stack width visualizes both the probability of reaching the state ( the hitting probability) and the expected number of letters the state emits during a pass through the model ( the state's expected contribution). A web interface offering online creation of HMM Logos and the corresponding source code can be found at the Logos web server of the Max Planck Institute for Molecular Genetics http:// logos.molgen.mpg.de. Conclusions: We demonstrate that HMM Logos can be a useful tool for the biologist: We use them to highlight differences between two homologous subfamilies of GTPases, Rab and Ras, and we show that they are able to indicate structural elements of Ras.
引用
收藏
页数:8
相关论文
共 13 条
[1]  
[Anonymous], HMMER USERS GUIDE BI
[2]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[3]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[4]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[5]  
HUGHEY R, 2003, UCSCCRL9911 U CAL BA
[6]   HIDDEN MARKOV-MODELS IN COMPUTATIONAL BIOLOGY - APPLICATIONS TO PROTEIN MODELING [J].
KROGH, A ;
BROWN, M ;
MIAN, IS ;
SJOLANDER, K ;
HAUSSLER, D .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 235 (05) :1501-1531
[7]   Recent improvements to the SMART domain-based sequence annotation resource [J].
Letunic, I ;
Goodstadt, L ;
Dickens, NJ ;
Doerks, T ;
Schultz, J ;
Mott, R ;
Ciccarelli, F ;
Copley, RR ;
Ponting, CP ;
Bork, P .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :242-244
[8]   The mammalian Rab family of small GTPases: Definition of family and subfamily sequence motifs suggests a mechanism for functional specificity in the Ras superfamily [J].
Pereira-Leal, JB ;
Seabra, MC .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 301 (04) :1077-1087
[9]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[10]  
RAHMANN S, 2003, STAT APPL GENET MOL, V2