Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs

被引:592
作者
Hangauer, Matthew J. [1 ]
Vaughn, Ian W. [1 ]
McManus, Michael T. [1 ]
机构
[1] Univ Calif San Francisco, Dept Microbiol & Immunol, Ctr Diabet, San Francisco, CA 94143 USA
来源
PLOS GENETICS | 2013年 / 9卷 / 06期
关键词
HUMAN-CELLS; GENE-EXPRESSION; REVEALS; IDENTIFICATION; ANNOTATION; CHROMATIN; ASSOCIATION; EVOLUTION; LINCRNAS; SEQUENCE;
D O I
10.1371/journal.pgen.1003569
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Known protein coding gene exons compose less than 3% of the human genome. The remaining 97% is largely uncharted territory, with only a small fraction characterized. The recent observation of transcription in this intergenic territory has stimulated debate about the extent of intergenic transcription and whether these intergenic RNAs are functional. Here we directly observed with a large set of RNA-seq data covering a wide array of human tissue types that the majority of the genome is indeed transcribed, corroborating recent observations by the ENCODE project. Furthermore, using de novo transcriptome assembly of this RNA-seq data, we found that intergenic regions encode far more long intergenic noncoding RNAs (lincRNAs) than previously described, helping to resolve the discrepancy between the vast amount of observed intergenic transcription and the limited number of previously known lincRNAs. In total, we identified tens of thousands of putative lincRNAs expressed at a minimum of one copy per cell, significantly expanding upon prior lincRNA annotation sets. These lincRNAs are specifically regulated and conserved rather than being the product of transcriptional noise. In addition, lincRNAs are strongly enriched for trait-associated SNPs suggesting a new mechanism by which intergenic trait-associated regions may function. These findings will enable the discovery and interrogation of novel intergenic functional elements.
引用
收藏
页数:13
相关论文
共 46 条
  • [1] Long noncoding RNAs are rarely translated in two human cell lines
    Banfai, Balazs
    Jia, Hui
    Khatun, Jainab
    Wood, Emily
    Risk, Brian
    Gundling, William E., Jr.
    Kundaje, Anshul
    Gunawardena, Harsha P.
    Yu, Yanbao
    Xie, Ling
    Krajewski, Krzysztof
    Strahl, Brian D.
    Chen, Xian
    Bickel, Peter
    Giddings, Morgan C.
    Brown, James B.
    Lipovich, Leonard
    [J]. GENOME RESEARCH, 2012, 22 (09) : 1646 - 1657
  • [2] Global identification of human transcribed sequences with genome tiling arrays
    Bertone, P
    Stolc, V
    Royce, TE
    Rozowsky, JS
    Urban, AE
    Zhu, XW
    Rinn, JL
    Tongprasit, W
    Samanta, M
    Weissman, S
    Gerstein, M
    Snyder, M
    [J]. SCIENCE, 2004, 306 (5705) : 2242 - 2246
  • [3] Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project
    Birney, Ewan
    Stamatoyannopoulos, John A.
    Dutta, Anindya
    Guigo, Roderic
    Gingeras, Thomas R.
    Margulies, Elliott H.
    Weng, Zhiping
    Snyder, Michael
    Dermitzakis, Emmanouil T.
    Stamatoyannopoulos, John A.
    Thurman, Robert E.
    Kuehn, Michael S.
    Taylor, Christopher M.
    Neph, Shane
    Koch, Christoph M.
    Asthana, Saurabh
    Malhotra, Ankit
    Adzhubei, Ivan
    Greenbaum, Jason A.
    Andrews, Robert M.
    Flicek, Paul
    Boyle, Patrick J.
    Cao, Hua
    Carter, Nigel P.
    Clelland, Gayle K.
    Davis, Sean
    Day, Nathan
    Dhami, Pawandeep
    Dillon, Shane C.
    Dorschner, Michael O.
    Fiegler, Heike
    Giresi, Paul G.
    Goldy, Jeff
    Hawrylycz, Michael
    Haydock, Andrew
    Humbert, Richard
    James, Keith D.
    Johnson, Brett E.
    Johnson, Ericka M.
    Frum, Tristan T.
    Rosenzweig, Elizabeth R.
    Karnani, Neerja
    Lee, Kirsten
    Lefebvre, Gregory C.
    Navas, Patrick A.
    Neri, Fidencio
    Parker, Stephen C. J.
    Sabo, Peter J.
    Sandstrom, Richard
    Shafer, Anthony
    [J]. NATURE, 2007, 447 (7146) : 799 - 816
  • [4] Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses
    Cabili, Moran N.
    Trapnell, Cole
    Goff, Loyal
    Koziol, Magdalena
    Tazon-Vega, Barbara
    Regev, Aviv
    Rinn, John L.
    [J]. GENES & DEVELOPMENT, 2011, 25 (18) : 1915 - 1927
  • [5] Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution
    Cheng, J
    Kapranov, P
    Drenkow, J
    Dike, S
    Brubaker, S
    Patel, S
    Long, J
    Stern, D
    Tammana, H
    Helt, G
    Sementchenko, V
    Piccolboni, A
    Bekiranov, S
    Bailey, DK
    Ganesh, M
    Ghosh, S
    Bell, I
    Gerhard, DS
    Gingeras, TR
    [J]. SCIENCE, 2005, 308 (5725) : 1149 - 1154
  • [6] The Reality of Pervasive Transcription
    Clark, Michael B.
    Amaral, Paulo P.
    Schlesinger, Felix J.
    Dinger, Marcel E.
    Taft, Ryan J.
    Rinn, John L.
    Ponting, Chris P.
    Stadler, Peter F.
    Morris, Kevin V.
    Morillon, Antonin
    Rozowsky, Joel S.
    Gerstein, Mark B.
    Wahlestedt, Claes
    Hayashizaki, Yoshihide
    Carninci, Piero
    Gingeras, Thomas R.
    Mattick, John S.
    [J]. PLOS BIOLOGY, 2011, 9 (07)
  • [7] The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression
    Derrien, Thomas
    Johnson, Rory
    Bussotti, Giovanni
    Tanzer, Andrea
    Djebali, Sarah
    Tilgner, Hagen
    Guernec, Gregory
    Martin, David
    Merkel, Angelika
    Knowles, David G.
    Lagarde, Julien
    Veeravalli, Lavanya
    Ruan, Xiaoan
    Ruan, Yijun
    Lassmann, Timo
    Carninci, Piero
    Brown, James B.
    Lipovich, Leonard
    Gonzalez, Jose M.
    Thomas, Mark
    Davis, Carrie A.
    Shiekhattar, Ramin
    Gingeras, Thomas R.
    Hubbard, Tim J.
    Notredame, Cedric
    Harrow, Jennifer
    Guigo, Roderic
    [J]. GENOME RESEARCH, 2012, 22 (09) : 1775 - 1789
  • [8] The expanding RNA polymerase III transcriptome
    Dieci, Giorgio
    Fiorino, Gloria
    Castelnuovo, Manuele
    Teichmann, Martin
    Pagano, Aldo
    [J]. TRENDS IN GENETICS, 2007, 23 (12) : 614 - 622
  • [9] Differentiating Protein-Coding and Noncoding RNA: Challenges and Ambiguities
    Dinger, Marcel E.
    Pang, Ken C.
    Mercer, Tim R.
    Mattick, John S.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (11)
  • [10] Landscape of transcription in human cells
    Djebali, Sarah
    Davis, Carrie A.
    Merkel, Angelika
    Dobin, Alex
    Lassmann, Timo
    Mortazavi, Ali
    Tanzer, Andrea
    Lagarde, Julien
    Lin, Wei
    Schlesinger, Felix
    Xue, Chenghai
    Marinov, Georgi K.
    Khatun, Jainab
    Williams, Brian A.
    Zaleski, Chris
    Rozowsky, Joel
    Roeder, Maik
    Kokocinski, Felix
    Abdelhamid, Rehab F.
    Alioto, Tyler
    Antoshechkin, Igor
    Baer, Michael T.
    Bar, Nadav S.
    Batut, Philippe
    Bell, Kimberly
    Bell, Ian
    Chakrabortty, Sudipto
    Chen, Xian
    Chrast, Jacqueline
    Curado, Joao
    Derrien, Thomas
    Drenkow, Jorg
    Dumais, Erica
    Dumais, Jacqueline
    Duttagupta, Radha
    Falconnet, Emilie
    Fastuca, Meagan
    Fejes-Toth, Kata
    Ferreira, Pedro
    Foissac, Sylvain
    Fullwood, Melissa J.
    Gao, Hui
    Gonzalez, David
    Gordon, Assaf
    Gunawardena, Harsha
    Howald, Cedric
    Jha, Sonali
    Johnson, Rory
    Kapranov, Philipp
    King, Brandon
    [J]. NATURE, 2012, 489 (7414) : 101 - 108