A single-molecule long-read survey of the human transcriptome

被引:473
作者
Sharon, Donald [1 ,2 ]
Tilgner, Hagen [1 ]
Grubert, Fabian [1 ]
Snyder, Michael [1 ]
机构
[1] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
[2] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT USA
基金
美国国家卫生研究院;
关键词
NONCODING RNAS; MESSENGER-RNA; ANNOTATION; LANDSCAPE; REVEALS; GENCODE; SEQ;
D O I
10.1038/nbt.2705
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Global RNA studies have become central to understanding biological processes, but methods such as microarrays and short-read sequencing are unable to describe an entire RNA molecule from 5' to 3' end. Here we use single-molecule long-read sequencing technology from Pacific Biosciences to sequence the polyadenylated RNA complement of a pooled set of 20 human organs and tissues without the need for fragmentation or amplification. We show that full-length RNA molecules of up to 1.5 kb can readily be monitored with little sequence loss at the 5' ends. For longer RNA molecules more 5' nucleotides are missing, but complete intron structures are often preserved. In total, we identify similar to 14,000 spliced GENCODE genes. High-confidence mappings are consistent with GENCODE annotations, but >10% of the alignments represent intron structures that were not previously annotated. As a group, transcripts mapping to unannotated regions have features of long, noncoding RNAs. Our results show the feasibility of deep sequencing full-length RNA from complex eukaryotic transcriptomes on a single-molecule level.
引用
收藏
页码:1009 / +
页数:8
相关论文
共 27 条
[1]  
[Anonymous], 2012, R LANG ENV STAT COMP
[2]   Improving PacBio Long Read Accuracy by Short Read Alignment [J].
Au, Kin Fai ;
Underwood, Jason G. ;
Lee, Lawrence ;
Wong, Wing Hung .
PLOS ONE, 2012, 7 (10)
[3]   Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses [J].
Cabili, Moran N. ;
Trapnell, Cole ;
Goff, Loyal ;
Koziol, Magdalena ;
Tazon-Vega, Barbara ;
Regev, Aviv ;
Rinn, John L. .
GENES & DEVELOPMENT, 2011, 25 (18) :1915-1927
[4]   Predicting the molecular complexity of sequencing libraries [J].
Daley, Timothy ;
Smith, Andrew D. .
NATURE METHODS, 2013, 10 (04) :325-+
[5]   The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression [J].
Derrien, Thomas ;
Johnson, Rory ;
Bussotti, Giovanni ;
Tanzer, Andrea ;
Djebali, Sarah ;
Tilgner, Hagen ;
Guernec, Gregory ;
Martin, David ;
Merkel, Angelika ;
Knowles, David G. ;
Lagarde, Julien ;
Veeravalli, Lavanya ;
Ruan, Xiaoan ;
Ruan, Yijun ;
Lassmann, Timo ;
Carninci, Piero ;
Brown, James B. ;
Lipovich, Leonard ;
Gonzalez, Jose M. ;
Thomas, Mark ;
Davis, Carrie A. ;
Shiekhattar, Ramin ;
Gingeras, Thomas R. ;
Hubbard, Tim J. ;
Notredame, Cedric ;
Harrow, Jennifer ;
Guigo, Roderic .
GENOME RESEARCH, 2012, 22 (09) :1775-1789
[6]   Landscape of transcription in human cells [J].
Djebali, Sarah ;
Davis, Carrie A. ;
Merkel, Angelika ;
Dobin, Alex ;
Lassmann, Timo ;
Mortazavi, Ali ;
Tanzer, Andrea ;
Lagarde, Julien ;
Lin, Wei ;
Schlesinger, Felix ;
Xue, Chenghai ;
Marinov, Georgi K. ;
Khatun, Jainab ;
Williams, Brian A. ;
Zaleski, Chris ;
Rozowsky, Joel ;
Roeder, Maik ;
Kokocinski, Felix ;
Abdelhamid, Rehab F. ;
Alioto, Tyler ;
Antoshechkin, Igor ;
Baer, Michael T. ;
Bar, Nadav S. ;
Batut, Philippe ;
Bell, Kimberly ;
Bell, Ian ;
Chakrabortty, Sudipto ;
Chen, Xian ;
Chrast, Jacqueline ;
Curado, Joao ;
Derrien, Thomas ;
Drenkow, Jorg ;
Dumais, Erica ;
Dumais, Jacqueline ;
Duttagupta, Radha ;
Falconnet, Emilie ;
Fastuca, Meagan ;
Fejes-Toth, Kata ;
Ferreira, Pedro ;
Foissac, Sylvain ;
Fullwood, Melissa J. ;
Gao, Hui ;
Gonzalez, David ;
Gordon, Assaf ;
Gunawardena, Harsha ;
Howald, Cedric ;
Jha, Sonali ;
Johnson, Rory ;
Kapranov, Philipp ;
King, Brandon .
NATURE, 2012, 489 (7414) :101-108
[7]   Real-Time DNA Sequencing from Single Polymerase Molecules [J].
Eid, John ;
Fehr, Adrian ;
Gray, Jeremy ;
Luong, Khai ;
Lyle, John ;
Otto, Geoff ;
Peluso, Paul ;
Rank, David ;
Baybayan, Primo ;
Bettman, Brad ;
Bibillo, Arkadiusz ;
Bjornson, Keith ;
Chaudhuri, Bidhan ;
Christians, Frederick ;
Cicero, Ronald ;
Clark, Sonya ;
Dalal, Ravindra ;
deWinter, Alex ;
Dixon, John ;
Foquet, Mathieu ;
Gaertner, Alfred ;
Hardenbol, Paul ;
Heiner, Cheryl ;
Hester, Kevin ;
Holden, David ;
Kearns, Gregory ;
Kong, Xiangxu ;
Kuse, Ronald ;
Lacroix, Yves ;
Lin, Steven ;
Lundquist, Paul ;
Ma, Congcong ;
Marks, Patrick ;
Maxham, Mark ;
Murphy, Devon ;
Park, Insil ;
Pham, Thang ;
Phillips, Michael ;
Roy, Joy ;
Sebra, Robert ;
Shen, Gene ;
Sorenson, Jon ;
Tomaney, Austin ;
Travers, Kevin ;
Trulson, Mark ;
Vieceli, John ;
Wegener, Jeffrey ;
Wu, Dawn ;
Yang, Alicia ;
Zaccarin, Denis .
SCIENCE, 2009, 323 (5910) :133-138
[8]   ESTGenes: Alternative splicing from ESTs in Ensembl [J].
Eyras, E ;
Caccamo, M ;
Curwen, V ;
Clamp, M .
GENOME RESEARCH, 2004, 14 (05) :976-987
[9]   Functional coordination of alternative splicing in the mammalian central nervous system [J].
Fagnani, Matthew ;
Barash, Yoseph ;
Ip, Joanna ;
Misquitta, Christine ;
Pan, Qun ;
Saltzman, Arneet L. ;
Shai, Ofer ;
Lee, Leo ;
Rozenhek, Aviad ;
Mohammad, Naveed ;
Willaime-Morawek, Sandrine ;
Babak, Tomas ;
Zhang, Wen ;
Hughes, Timothy R. ;
van der Kooy, Derek ;
Frey, Brendan J. ;
Blencowe, Benjamin J. .
GENOME BIOLOGY, 2007, 8 (06)
[10]   Missing lincs in the transcriptome [J].
Gingeras, Thomas .
NATURE BIOTECHNOLOGY, 2009, 27 (04) :346-347