Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses

被引:2731
作者
Cabili, Moran N. [1 ,2 ,3 ,6 ]
Trapnell, Cole [1 ,3 ,6 ]
Goff, Loyal [1 ,4 ,6 ]
Koziol, Magdalena [1 ,3 ,6 ]
Tazon-Vega, Barbara [1 ,3 ,6 ]
Regev, Aviv [1 ,5 ,6 ]
Rinn, John L. [1 ,3 ,6 ]
机构
[1] MIT, Broad Inst, Cambridge, MA 02142 USA
[2] Harvard Univ, Sch Med, Dept Syst Biol, Boston, MA 02115 USA
[3] Harvard Univ, Dept Stem Cell & Regenerat Biol, Cambridge, MA 02138 USA
[4] MIT, Comp Sci & Artificial Intelligence Lab, Dept Elect Engn & Comp Sci, Cambridge, MA 02140 USA
[5] MIT, Howard Hughes Med Inst, Dept Biol, Cambridge, MA 02140 USA
[6] Harvard Univ, Cambridge, MA 02142 USA
关键词
long noncoding RNAs; RNA sequencing; lincRNAs; HUMAN GENOME; CHROMATIN; GENE; TRANSCRIPTION; MOUSE; IDENTIFICATION; QUANTIFICATION; EXPRESSION; DELETION; DYNAMICS;
D O I
10.1101/gad.17446611
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Large intergenic noncoding RNAs (lincRNAs) are emerging as key regulators of diverse cellular processes. Determining the function of individual lincRNAs remains a challenge. Recent advances in RNA sequencing (RNA-seq) and computational methods allow for an unprecedented analysis of such transcripts. Here, we present an integrative approach to define a reference catalog of >8000 human lincRNAs. Our catalog unifies previously existing annotation sources with transcripts we assembled from RNA-seq data collected from similar to 4 billion RNA-seq reads across 24 tissues and cell types. We characterize each lincRNA by a panorama of >30 properties, including sequence, structural, transcriptional, and orthology features. We found that lincRNA expression is strikingly tissue-specific compared with coding genes, and that lincRNAs are typically coexpressed with their neighboring genes, albeit to an extent similar to that of pairs of neighboring protein-coding genes. We distinguish an additional subset of transcripts that have high evolutionary conservation but may include short ORFs and may serve as either lincRNAs or small peptides. Our integrated, comprehensive, yet conservative reference catalog of human lincRNAs reveals the global properties of lincRNAs and will facilitate experimental studies and further functional classification of these genes.
引用
收藏
页码:1915 / 1927
页数:13
相关论文
共 61 条
  • [1] lncRNAdb: a reference database for long noncoding RNAs
    Amaral, Paulo P.
    Clark, Michael B.
    Gascoigne, Dennis K.
    Dinger, Marcel E.
    Mattick, John S.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D146 - D151
  • [2] Global identification of human transcribed sequences with genome tiling arrays
    Bertone, P
    Stolc, V
    Royce, TE
    Rozowsky, JS
    Urban, AE
    Zhu, XW
    Rinn, JL
    Tongprasit, W
    Samanta, M
    Weissman, S
    Gerstein, M
    Snyder, M
    [J]. SCIENCE, 2004, 306 (5705) : 2242 - 2246
  • [3] Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project
    Birney, Ewan
    Stamatoyannopoulos, John A.
    Dutta, Anindya
    Guigo, Roderic
    Gingeras, Thomas R.
    Margulies, Elliott H.
    Weng, Zhiping
    Snyder, Michael
    Dermitzakis, Emmanouil T.
    Stamatoyannopoulos, John A.
    Thurman, Robert E.
    Kuehn, Michael S.
    Taylor, Christopher M.
    Neph, Shane
    Koch, Christoph M.
    Asthana, Saurabh
    Malhotra, Ankit
    Adzhubei, Ivan
    Greenbaum, Jason A.
    Andrews, Robert M.
    Flicek, Paul
    Boyle, Patrick J.
    Cao, Hua
    Carter, Nigel P.
    Clelland, Gayle K.
    Davis, Sean
    Day, Nathan
    Dhami, Pawandeep
    Dillon, Shane C.
    Dorschner, Michael O.
    Fiegler, Heike
    Giresi, Paul G.
    Goldy, Jeff
    Hawrylycz, Michael
    Haydock, Andrew
    Humbert, Richard
    James, Keith D.
    Johnson, Brett E.
    Johnson, Ericka M.
    Frum, Tristan T.
    Rosenzweig, Elizabeth R.
    Karnani, Neerja
    Lee, Kirsten
    Lefebvre, Gregory C.
    Navas, Patrick A.
    Neri, Fidencio
    Parker, Stephen C. J.
    Sabo, Peter J.
    Sandstrom, Richard
    Shafer, Anthony
    [J]. NATURE, 2007, 447 (7146) : 799 - 816
  • [4] Fast Statistical Alignment
    Bradley, Robert K.
    Roberts, Adam
    Smoot, Michael
    Juvekar, Sudeep
    Do, Jaeyoung
    Dewey, Colin
    Holmes, Ian
    Pachter, Lior
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (05)
  • [5] THE HUMAN XIST GENE - ANALYSIS OF A 17 KB INACTIVE X-SPECIFIC RNA THAT CONTAINS CONSERVED REPEATS AND IS HIGHLY LOCALIZED WITHIN THE NUCLEUS
    BROWN, CJ
    HENDRICH, BD
    RUPERT, JL
    LAFRENIERE, RG
    XING, Y
    LAWRENCE, J
    WILLARD, HF
    [J]. CELL, 1992, 71 (03) : 527 - 542
  • [6] The transcriptional landscape of the mammalian genome
    Carninci, P
    Kasukawa, T
    Katayama, S
    Gough, J
    Frith, MC
    Maeda, N
    Oyama, R
    Ravasi, T
    Lenhard, B
    Wells, C
    Kodzius, R
    Shimokawa, K
    Bajic, VB
    Brenner, SE
    Batalov, S
    Forrest, ARR
    Zavolan, M
    Davis, MJ
    Wilming, LG
    Aidinis, V
    Allen, JE
    Ambesi-Impiombato, X
    Apweiler, R
    Aturaliya, RN
    Bailey, TL
    Bansal, M
    Baxter, L
    Beisel, KW
    Bersano, T
    Bono, H
    Chalk, AM
    Chiu, KP
    Choudhary, V
    Christoffels, A
    Clutterbuck, DR
    Crowe, ML
    Dalla, E
    Dalrymple, BP
    de Bono, B
    Della Gatta, G
    di Bernardo, D
    Down, T
    Engstrom, P
    Fagiolini, M
    Faulkner, G
    Fletcher, CF
    Fukushima, T
    Furuno, M
    Futaki, S
    Gariboldi, M
    [J]. SCIENCE, 2005, 309 (5740) : 1559 - 1563
  • [7] Long noncoding RNA genes: conservation of sequence and brain expression among diverse amniotes
    Chodroff, Rebecca A.
    Goodstadt, Leo
    Sirey, Tamara M.
    Oliver, Peter L.
    Davies, Kay E.
    Green, Eric D.
    Molnar, Zoltan
    Ponting, Chris P.
    [J]. GENOME BIOLOGY, 2010, 11 (07):
  • [8] A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression
    Cohen, BA
    Mitra, RD
    Hughes, JD
    Church, GM
    [J]. NATURE GENETICS, 2000, 26 (02) : 183 - 186
  • [9] Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters
    Core, Leighton J.
    Waterfall, Joshua J.
    Lis, John T.
    [J]. SCIENCE, 2008, 322 (5909) : 1845 - 1848
  • [10] A Large Fraction of Extragenic RNA Pol II Transcription Sites Overlap Enhancers
    De Santa, Francesca
    Barozzi, Iros
    Mietton, Flore
    Ghisletti, Serena
    Polletti, Sara
    Tusi, Betsabeh Khoramian
    Muller, Heiko
    Ragoussis, Jiannis
    Wei, Chia-Lin
    Natoli, Gioacchino
    [J]. PLOS BIOLOGY, 2010, 8 (05):