Footer:: A quantitative comparative genomics method for efficient recognition of cis-regulatory elements

被引:17
作者
Corcoran, DL
Feingold, E
Dominick, J
Wright, M
Harnaha, J
Trucco, M
Giannoukakis, N
Benos, PV [1 ]
机构
[1] Univ Pittsburgh, Sch Med, Grad Sch Publ Hlth, Dept Biostat, Pittsburgh, PA 15261 USA
[2] Univ Pittsburgh, Sch Med, Dept Human Genet, GSPH, Pittsburgh, PA 15261 USA
[3] Univ Pittsburgh, Sch Med, Dept Computat Biol, Pittsburgh, PA 15261 USA
[4] Univ Pittsburgh, Sch Med, Inst Canc, Pittsburgh, PA 15261 USA
[5] Childrens Hosp Pittsburgh, Pittsburgh, PA 15213 USA
关键词
D O I
10.1101/gr.2952005
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The search for mammalian DNA regulatory regions poses a challenging problem in computational biology. The short length of the DNA patterns compared with the size of the promoter regions and the degeneracy of the patterns makes their identification difficult. One way to overcome this problem is to use evolutionary information to reduce the number of false-positive predictions. We developed a novel method for pattern identification that compares a pair of Putative binding sites in two species (e.g., human and mouse) and assigns two probability scores based on the relative position of the sites in the promoter and their agreement with a known model of binding preferences. We tested the algorithm's ability to predict known binding sites on various promoters. Overall, it exhibited 83% sensitivity and the specificity was 72%, which is a clear improvement over existing methods. Our algorithm also successfully predicted two novel NF-kappa B binding sites in the promoter region of the mouse autotaxin gene (ATX, ENPP2), which we were able to verify by using chromatin immunoprecipitation assay coupled with quantitative real-time PCR.
引用
收藏
页码:840 / 847
页数:8
相关论文
共 36 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] BARASH Y, 2003, 7 ANN INT C COMP MOL
  • [3] Benos P V, 2001, Pac Symp Biocomput, P115
  • [4] Additivity in protein-DNA interactions: how good an approximation is it?
    Benos, PV
    Bulyk, ML
    Stormo, GD
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (20) : 4442 - 4451
  • [5] Probabilistic code for DNA recognition by proteins of the EGR family
    Benos, PV
    Lapedes, AS
    Stormo, GD
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2002, 323 (04) : 701 - 727
  • [6] SELECTION OF DNA-BINDING SITES FOR ZINC FINGERS USING RATIONALLY RANDOMIZED DNA REVEALS CODED INTERACTIONS
    CHOO, Y
    KLUG, A
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (23) : 11168 - 11172
  • [7] Finding functional features in Saccharomyces genomes by phylogenetic footprinting
    Cliften, P
    Sudarsanam, P
    Desikan, A
    Fulton, L
    Fulton, B
    Majors, J
    Waterston, R
    Cohen, BA
    Johnston, M
    [J]. SCIENCE, 2003, 301 (5629) : 71 - 76
  • [8] CORCORAN DL, 2005, IN PRESS NUCL ACIDS
  • [9] Searching for regulatory elements in human noncoding sequences
    Duret, L
    Bucher, P
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (03) : 399 - 406
  • [10] Genomic targets of the human c-Myc protein
    Fernandez, PC
    Frank, SR
    Wang, LQ
    Schroeder, M
    Liu, SX
    Greene, J
    Cocito, A
    Amati, B
    [J]. GENES & DEVELOPMENT, 2003, 17 (09) : 1115 - 1129