Unsupervised segmentation of continuous genomic data

被引:73
作者
Day, Nathan
Hemmaplardh, Andrew
Thurman, Robert E. [1 ]
Stamatoyannopoulos, John A.
Noble, William S.
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
[2] Univ Washington, Div Med Genet, Seattle, WA 98195 USA
[3] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
关键词
D O I
10.1093/bioinformatics/btm096
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The advent of high-density, high-volume genomic data has created the need for tools to summarize large datasets at multiple scales. HMMSeg is a command-line utility for the scale-specific segmentation of continuous genomic data using hidden Markov models (HMMs). Scale specificity is achieved by an optional wavelet-based smoothing operation. HMMSeg is capable of handling multiple datasets simultaneously, rendering it ideal for integrative analysis of expression, phylogenetic and functional genomic data.
引用
收藏
页码:1424 / 1426
页数:3
相关论文
共 10 条
  • [1] EDDY SR, 1995, P 3 INT C INT SYST M, P114
  • [2] *ENCODE CONS, 2007, IN PRESS NATURE
  • [3] The ENCODE (ENCyclopedia of DNA elements) Project
    Feingold, EA
    Good, PJ
    Guyer, MS
    Kamholz, S
    Liefer, L
    Wetterstrand, K
    Collins, FS
    Gingeras, TR
    Kampa, D
    Sekinger, EA
    Cheng, J
    Hirsch, H
    Ghosh, S
    Zhu, Z
    Pate, S
    Piccolboni, A
    Yang, A
    Tammana, H
    Bekiranov, S
    Kapranov, P
    Harrison, R
    Church, G
    Struhl, K
    Ren, B
    Kim, TH
    Barrera, LO
    Qu, C
    Van Calcar, S
    Luna, R
    Glass, CK
    Rosenfeld, MG
    Guigo, R
    Antonarakis, SE
    Birney, E
    Brent, M
    Pachter, L
    Reymond, A
    Dermitzakis, ET
    Dewey, C
    Keefe, D
    Denoeud, F
    Lagarde, J
    Ashurst, J
    Hubbard, T
    Wesselink, JJ
    Castelo, R
    Eyras, E
    Myers, RM
    Sidow, A
    Batzoglou, S
    [J]. SCIENCE, 2004, 306 (5696) : 636 - 640
  • [4] The human genome browser at UCSC
    Kent, WJ
    Sugnet, CW
    Furey, TS
    Roskin, KM
    Pringle, TH
    Zahler, AM
    Haussler, D
    [J]. GENOME RESEARCH, 2002, 12 (06) : 996 - 1006
  • [5] Wavelets in bioinformatics and computational biology:: state of art and perspectives
    Liò, P
    [J]. BIOINFORMATICS, 2003, 19 (01) : 2 - 9
  • [6] TigrScan and GlimmerHMM:: two open source ab initio eukaryotic gene-finders
    Majoros, WH
    Pertea, M
    Salzberg, SL
    [J]. BIOINFORMATICS, 2004, 20 (16) : 2878 - 2879
  • [7] Percival DB, 2000, WAVELET METHODS TIME
  • [8] A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION
    RABINER, LR
    [J]. PROCEEDINGS OF THE IEEE, 1989, 77 (02) : 257 - 286
  • [9] THURMAN RE, 2007, IN PRESS GENOME RES
  • [10] Young S., 1995, HTK BOOK