Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems

被引:30
作者
Angelini, Claudia [1 ,2 ]
Costa, Valerio [2 ,3 ]
机构
[1] CNR, Ist Applicaz Calcolo M Picone, Via Pietro Castellinol 111, I-80131 Naples, Italy
[2] Computat & Biol Open Lab ComBOlab, Naples, Italy
[3] CNR, Inst Genet & Biophys A Buzzati Traverso, Naples, Italy
来源
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY | 2014年 / 2卷
关键词
ChIP-seq; data integration; gene regulatory mechanisms; RNA-seq; statistics;
D O I
10.3389/fcell.2014.00051
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
The availability of omic data produced from international consortia, as well as from worldwide laboratories, is offering the possibility both to answer long-standing questions in biomedicine/molecular biology and to formulate novel hypotheses to test. However, the impact of such data is not fully exploited due to a limited availability of multi-omic data integration tools and methods. In this paper, we discuss the interplay between gene expression and epigenetic markers/transcription factors. We show how integrating ChIP-seq and RNA-seq data can help to elucidate gene regulatory mechanisms. In particular, we discuss the two following questions: (i) Can transcription factor occupancies or histone modification data predict gene expression? (ii) Can ChIP-seq and RNA-seq data be used to infer gene regulatory networks? We propose potential directions for statistical data integration. We discuss the importance of incorporating underestimated aspects (such as alternative splicing and long-range chromatin interactions). We also highlight the lack of data benchmarks and the need to develop tools for data integration from a statistical viewpoint, designed in the spirit of reproducible research.
引用
收藏
页数:8
相关论文
共 67 条
[1]   Predictive Models of Gene Regulation from High-Throughput Epigenomics Data [J].
Althammer, Sonja ;
Pages, Amadis ;
Eyras, Eduardo .
COMPARATIVE AND FUNCTIONAL GENOMICS, 2012,
[2]   Computational approaches for isoform detection and estimation: good and bad news [J].
Angelini, Claudia ;
De Canditiis, Daniela ;
De Feis, Italia .
BMC BIOINFORMATICS, 2014, 15
[3]   REGULATION OF GENE EXPRESSION IN THE GENOMIC CONTEXT [J].
Atkinson, Taylor J. ;
Halfon, Marc S. .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2014, 9 (13)
[4]   Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data [J].
Bailey, Timothy ;
Krajewski, Pawel ;
Ladunga, Istvan ;
Lefebvre, Celine ;
Li, Qunhua ;
Liu, Tao ;
Madrigal, Pedro ;
Taslim, Cenny ;
Zhang, Jie .
PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (11)
[5]   ISMARA: automated modeling of genomic signals as a democracy of regulatory motifs [J].
Balwierz, Piotr J. ;
Pachkov, Mikhail ;
Arnold, Phil ;
Gruber, Andreas J. ;
Zavolan, Mihaela ;
van Nimwegen, Erik .
GENOME RESEARCH, 2014, 24 (05) :869-884
[6]   Chromatin modifiers and remodellers: regulators of cellular differentiation [J].
Chen, Taiping ;
Dent, Sharon Y. R. .
NATURE REVIEWS GENETICS, 2014, 15 (02) :93-106
[7]  
Cheng C, 2011, GENOME BIOL, V12
[8]   Understanding transcriptional regulation by integrative analysis of transcription factor binding data [J].
Cheng, Chao ;
Alexander, Roger ;
Min, Renqiang ;
Leng, Jing ;
Yip, Kevin Y. ;
Rozowsky, Joel ;
Yan, Koon-Kiu ;
Dong, Xianjun ;
Djebali, Sarah ;
Ruan, Yijun ;
Davis, Carrie A. ;
Carninci, Piero ;
Lassman, Timo ;
Gingerasi, Thomas R. ;
Guigo, Roderic ;
Birney, Ewan ;
Weng, Zhiping ;
Snyder, Michael ;
Gerstein, Mark .
GENOME RESEARCH, 2012, 22 (09) :1658-1667
[9]   Modeling the relative relationship of transcription factor binding and histone modifications to gene expression levels in mouse embryonic stem cells [J].
Cheng, Chao ;
Gerstein, Mark .
NUCLEIC ACIDS RESEARCH, 2012, 40 (02) :553-568
[10]   Construction and Analysis of an Integrated Regulatory Network Derived from High-Throughput Sequencing Data [J].
Cheng, Chao ;
Yan, Koon-Kiu ;
Hwang, Woochang ;
Qian, Jiang ;
Bhardwaj, Nitin ;
Rozowsky, Joel ;
Lu, Zhi John ;
Niu, Wei ;
Alves, Pedro ;
Kato, Masaomi ;
Snyder, Michael ;
Gerstein, Mark .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (11)