SEQUENCE LOGOS - A NEW WAY TO DISPLAY CONSENSUS SEQUENCES

被引:2493
作者
SCHNEIDER, TD
STEPHENS, RM
机构
[1] National Cancer Institute, Frederick Cancer Research and Development Center, Laboratory of Mathematical Biology, Frederick, MD 21701, PO Box B
关键词
D O I
10.1093/nar/18.20.6097
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to Its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured In bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns. © 1990 Oxford University Press.
引用
收藏
页码:6097 / 6100
页数:4
相关论文
共 20 条