Methods for analyzing deep sequencing expression data: constructing the human and mouse promoterome with deepCAGE data

被引:103
作者
Balwierz, Piotr J. [1 ,2 ]
Carninci, Piero [3 ]
Daub, Carsten O. [3 ]
Kawai, Jun [3 ]
Hayashizaki, Yoshihide [3 ]
Van Belle, Werner [4 ]
Beisel, Christian [4 ]
van Nimwegen, Erik [1 ,2 ]
机构
[1] Univ Basel, Biozentrum, CH-4056 Basel, Switzerland
[2] Swiss Inst Bioinformat, CH-4056 Basel, Switzerland
[3] RIKEN Yokohama Inst, RIKEN Om Sci Ctr, Tsurumi Ku, Kanagawa 2300045, Japan
[4] Eidgenoss Tech Hsch Zurich, Dept Biosyst Sci & Engn, Lab Quantitat Genom, CH-4058 Basel, Switzerland
来源
GENOME BIOLOGY | 2009年 / 10卷 / 07期
基金
瑞士国家科学基金会;
关键词
GENOME-WIDE ANALYSIS; GENE-EXPRESSION; TRANSCRIPTIONAL LANDSCAPE; CAP ANALYSIS; RNA-SEQ; REPRODUCIBILITY; ARRAYS; CAGE;
D O I
10.1186/gb-2009-10-7-r79
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
With the advent of ultra high-throughput sequencing technologies, increasingly researchers are turning to deep sequencing for gene expression studies. Here we present a set of rigorous methods for normalization, quantification of noise, and co-expression analysis of deep sequencing data. Using these methods on 122 cap analysis of gene expression (CAGE) samples of transcription start sites, we construct genome-wide 'promoteromes' in human and mouse consisting of a three-tiered hierarchy of transcription start sites, transcription start clusters, and transcription start regions.
引用
收藏
页数:21
相关论文
共 27 条
[1]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[2]   The transcriptional landscape of the mammalian genome [J].
Carninci, P ;
Kasukawa, T ;
Katayama, S ;
Gough, J ;
Frith, MC ;
Maeda, N ;
Oyama, R ;
Ravasi, T ;
Lenhard, B ;
Wells, C ;
Kodzius, R ;
Shimokawa, K ;
Bajic, VB ;
Brenner, SE ;
Batalov, S ;
Forrest, ARR ;
Zavolan, M ;
Davis, MJ ;
Wilming, LG ;
Aidinis, V ;
Allen, JE ;
Ambesi-Impiombato, X ;
Apweiler, R ;
Aturaliya, RN ;
Bailey, TL ;
Bansal, M ;
Baxter, L ;
Beisel, KW ;
Bersano, T ;
Bono, H ;
Chalk, AM ;
Chiu, KP ;
Choudhary, V ;
Christoffels, A ;
Clutterbuck, DR ;
Crowe, ML ;
Dalla, E ;
Dalrymple, BP ;
de Bono, B ;
Della Gatta, G ;
di Bernardo, D ;
Down, T ;
Engstrom, P ;
Fagiolini, M ;
Faulkner, G ;
Fletcher, CF ;
Fukushima, T ;
Furuno, M ;
Futaki, S ;
Gariboldi, M .
SCIENCE, 2005, 309 (5740) :1559-1563
[3]   Genome-wide analysis of mammalian promoter architecture and evolution [J].
Carninci, Piero ;
Sandelin, Albin ;
Lenhard, Boris ;
Katayama, Shintaro ;
Shimokawa, Kazuro ;
Ponjavic, Jasmina ;
Semple, Colin A. M. ;
Taylor, Martin S. ;
Engström, Par G. ;
Frith, Martin C. ;
Forrest, Alistair R. R. ;
Alkema, Wynand B. ;
Tan, Sin Lam ;
Plessy, Charles ;
Kodzius, Rimantas ;
Ravasi, Timothy ;
Kasukawa, Takeya ;
Fukuda, Shiro ;
Kanamori-Katayama, Mutsumi ;
Kitazume, Yayoi ;
Kawaji, Hideya ;
Kai, Chikatoshi ;
Nakamura, Mari ;
Konno, Hideaki ;
Nakano, Kenji ;
Mottagui-Tabar, Salim ;
Arner, Peter ;
Chesi, Alessandra ;
Gustincich, Stefano ;
Persichetti, Francesca ;
Suzuki, Harukazu ;
Grimmond, Sean M. ;
Wells, Christine A. ;
Orlando, Valerio ;
Wahlestedt, Claes ;
Liu, Edison T. ;
Harbers, Matthias ;
Kawai, Jun ;
Bajic, Vladimir B. ;
Hume, David A. ;
Hayashizaki, Yoshihide .
NATURE GENETICS, 2006, 38 (06) :626-635
[4]   Tagging mammalian transcription complexity [J].
Carninci, Piero .
TRENDS IN GENETICS, 2006, 22 (09) :501-510
[5]   Reliability and reproducibility issues in DNA microarray measurements [J].
Draghici, S ;
Khatri, P ;
Eklund, AC ;
Szallasi, Z .
TRENDS IN GENETICS, 2006, 22 (02) :101-109
[6]   A rescue strategy for multimapping short sequence tags refines surveys of transcriptional activity by CAGE [J].
Faulkner, Geoffrey J. ;
Forrest, Alistair R. R. ;
Chalk, Alistair M. ;
Schroder, Kate ;
Hayashizaki, Yoshihide ;
Carninci, Piero ;
Hume, David A. ;
Grimmond, Sean M. .
GENOMICS, 2008, 91 (03) :281-288
[7]   A code for transcription initiation in mammalian genomes [J].
Frith, Martin C. ;
Valen, Eivind ;
Krogh, Anders ;
Hayashizaki, Yoshihide ;
Carninci, Piero ;
Sandelin, Albin .
GENOME RESEARCH, 2008, 18 (01) :1-12
[8]  
Jaynes Edwin T, 2003, Probability Theory: The Logic of Science, DOI 10.1017/CBO9780511790423
[9]   The UCSC Genome Browser Database: 2008 update [J].
Karolchik, D. ;
Kuhn, R. M. ;
Baertsch, R. ;
Barber, G. P. ;
Clawson, H. ;
Diekhans, M. ;
Giardine, B. ;
Harte, R. A. ;
Hinrichs, A. S. ;
Hsu, F. ;
Kober, K. M. ;
Miller, W. ;
Pedersen, J. S. ;
Pohl, A. ;
Raney, B. J. ;
Rhead, B. ;
Rosenbloom, K. R. ;
Smith, K. E. ;
Stanke, M. ;
Thakkapallayil, A. ;
Trumbower, H. ;
Wang, T. ;
Zweig, A. S. ;
Haussler, D. ;
Kent, W. J. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D773-D779
[10]   CAGE: cap analysis of gene expression [J].
Kodzius, R ;
Kojima, M ;
Nishiyori, H ;
Nakamura, M ;
Fukuda, S ;
Tagami, M ;
Sasaki, D ;
Imamura, K ;
Kai, C ;
Harbers, M ;
Hayashizaki, Y ;
Carninci, P .
NATURE METHODS, 2006, 3 (03) :211-222