RNA-sequence analysis of human B-cells

被引:111
作者
Toung, Jonathan M. [2 ]
Morley, Michael [1 ]
Li, Mingyao [3 ,4 ]
Cheung, Vivian G. [1 ,5 ,6 ,7 ]
机构
[1] Childrens Hosp Philadelphia, Philadelphia, PA 19104 USA
[2] Univ Penn, Genom & Computat Biol Program, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Biostat, Philadelphia, PA 19104 USA
[4] Univ Penn, Dept Epidemiol, Philadelphia, PA 19104 USA
[5] Univ Penn, Dept Pediat, Philadelphia, PA 19104 USA
[6] Univ Penn, Dept Genet, Philadelphia, PA 19104 USA
[7] Univ Penn, Howard Hughes Med Inst, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
TRANSCRIPTOME;
D O I
10.1101/gr.116335.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
RNA-sequencing (RNA-seq) allows quantitative measurement of expression levels of genes and their transcripts. In this study, we sequenced complementary DNA fragments of cultured human B-cells and obtained 879 million 50-bp reads comprising 44 Gb of sequence. The results allowed us to study the gene expression profile of B-cells and to determine experimental parameters for sequencing-based expression studies. We identified 20,766 genes and 67,453 of their alternatively spliced transcripts. More than 90% of the genes with multiple exons are alternatively spliced; for most genes, one isoform is predominantly expressed. We found that while chromosomes differ in gene density, the percentage of transcribed genes in each chromosome is less variable. In addition, genes involved in related biological processes are expressed at more similar levels than genes with different functions. Besides characterizing gene expression, we also used the data to investigate the effect of sequencing depth on gene expression measurements. While 100 million reads are sufficient to detect most expressed genes and transcripts, about 500 million reads are needed to measure accurately their expression levels. We provide examples in which deep sequencing is needed to determine the relative abundance of genes and their isoforms. With data from 20 individuals and about 40 million sequence reads per sample, we uncovered only 21 alternatively spliced, multi-exon genes that are not in databases; this result suggests that at this sequence coverage, we can detect most of the known genes. Results from this project are available on the UCSC Genome Browser to allow readers to study the expression and structure of genes in human B-cells.
引用
收藏
页码:991 / 998
页数:8
相关论文
共 27 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[3]   Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays [J].
Brenner, S ;
Johnson, M ;
Bridgham, J ;
Golda, G ;
Lloyd, DH ;
Johnson, D ;
Luo, SJ ;
McCurdy, S ;
Foy, M ;
Ewan, M ;
Roth, R ;
George, D ;
Eletr, S ;
Albrecht, G ;
Vermaas, E ;
Williams, SR ;
Moon, K ;
Burcham, T ;
Pallas, M ;
DuBridge, RB ;
Kirchner, J ;
Fearon, K ;
Mao, J ;
Corcoran, K .
NATURE BIOTECHNOLOGY, 2000, 18 (06) :630-634
[4]   Stem cell transcriptome profiling via massive-scale mRNA sequencing [J].
Cloonan, Nicole ;
Forrest, Alistair R. R. ;
Kolle, Gabriel ;
Gardiner, Brooke B. A. ;
Faulkner, Geoffrey J. ;
Brown, Mellissa K. ;
Taylor, Darrin F. ;
Steptoe, Anita L. ;
Wani, Shivangi ;
Bethel, Graeme ;
Robertson, Alan J. ;
Perkins, Andrew C. ;
Bruce, Stephen J. ;
Lee, Clarence C. ;
Ranade, Swati S. ;
Peckham, Heather E. ;
Manning, Jonathan M. ;
McKernan, Kevin J. ;
Grimmond, Sean M. .
NATURE METHODS, 2008, 5 (07) :613-619
[5]   PROGRAM DESCRIPTION - CENTER-DETUDE-DU-POLYMORPHISME-HUMAIN (CEPH) - COLLABORATIVE GENETIC-MAPPING OF THE HUMAN GENOME [J].
DAUSSET, J ;
CANN, H ;
COHEN, D ;
LATHROP, M ;
LALOUEL, JM ;
WHITE, R .
GENOMICS, 1990, 6 (03) :575-577
[6]  
DeRisi J, 1996, NAT GENET, V14, P457
[7]   A genome-wide association study of global gene expression [J].
Dixon, Anna L. ;
Liang, Liming ;
Moffatt, Miriam F. ;
Chen, Wei ;
Heath, Simon ;
Wong, Kenny C. C. ;
Taylor, Jenny ;
Burnett, Edward ;
Gut, Ivo ;
Farrall, Martin ;
Lathrop, G. Mark ;
Abecasis, Goncalo R. ;
Cookson, William O. C. .
NATURE GENETICS, 2007, 39 (10) :1202-1207
[8]   Heritability and linkage analysis of sensitivity to cisplatin-induced cytotoxicity [J].
Dolan, ME ;
Newbold, KG ;
Nagasubramanian, R ;
Wu, XL ;
Ratain, MJ ;
Cook, EH ;
Badner, JA .
CANCER RESEARCH, 2004, 64 (12) :4353-4356
[9]   MULTIPLEXED BIOCHEMICAL ASSAYS WITH BIOLOGICAL CHIPS [J].
FODOR, SPA ;
RAVA, RP ;
HUANG, XHC ;
PEASE, AC ;
HOLMES, CP ;
ADAMS, CL .
NATURE, 1993, 364 (6437) :555-556
[10]   GENCODE: producing a reference annotation for ENCODE [J].
Harrow, Jennifer ;
Denoeud, France ;
Frankish, Adam ;
Reymond, Alexandre ;
Chen, Chao-Kung ;
Chrast, Jacqueline ;
Lagarde, Julien ;
Gilbert, James Gr ;
Storey, Roy ;
Swarbreck, David ;
Rossier, Colette ;
Ucla, Catherine ;
Hubbard, Tim ;
Antonarakis, Stylianos E. ;
Guigo, Roderic .
GENOME BIOLOGY, 2006, 7 (Suppl 1)