Large-scale identification of novel transcripts in the human genome

被引:15
作者
Peters, Brock A.
St. Croix, Brad
Sjoblom, Tobias
Cummins, Jordan M.
Silliman, Natalie
Ptak, Janine
Saha, Saurabh
Kinzler, Kenneth W.
Hatzis, Christos
Velculescu, Victor E. [1 ]
机构
[1] Johns Hopkins Univ, Kimmel Canc Ctr, Ludwig Ctr Canc Genet & Therapeut, Baltimore, MD 21231 USA
[2] Johns Hopkins Univ, Dept Pharmacol & Mol Sci, Baltimore, MD 21231 USA
[3] NCI, Tumor Angiogenesis Sect, Mouse Canc Genet Program, Frederick, MD 21702 USA
[4] Nuvera Biosci, Woburn, MA 01801 USA
关键词
D O I
10.1101/gr.5486607
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although the sequencing of the human genome has been completed, the number and identity of genes contained within it remains to be fully determined. We used LongSAGE to analyze 660,357 human transcripts from human brain mRNA and identified expression of 17,409 known genes and > 15,000 different transcripts that were not annotated in genome databases. Analysis of a subset of these unannotated transcripts suggests that 85% were differentially expressed in various tissue types and that fewer than 20% would have been detected by ab initio gene predictions. These studies suggest that the human genome contains on the order of twice as many transcribed regions as are currently annotated and that experimental approaches will be required to fully elucidate the novel genes corresponding to these transcripts.
引用
收藏
页码:287 / 292
页数:6
相关论文
共 30 条
[11]   Integrative annotation of 21,037 human genes validated by full-length cDNA clones [J].
Imanishi, T ;
Itoh, T ;
Suzuki, Y ;
O'Donovan, C ;
Fukuchi, S ;
Koyanagi, KO ;
Barrero, RA ;
Tamura, T ;
Yamaguchi-Kabata, Y ;
Tanino, M ;
Yura, K ;
Miyazaki, S ;
Ikeo, K ;
Homma, K ;
Kasprzyk, A ;
Nishikawa, T ;
Hirakawa, M ;
Thierry-Mieg, J ;
Thierry-Mieg, D ;
Ashurst, J ;
Jia, LB ;
Nakao, M ;
Thomas, MA ;
Mulder, N ;
Karavidopoulou, Y ;
Jin, LH ;
Kim, S ;
Yasuda, T ;
Lenhard, B ;
Eveno, E ;
Suzuki, Y ;
Yamasaki, C ;
Takeda, J ;
Gough, C ;
Hilton, P ;
Fujii, Y ;
Sakai, H ;
Tanaka, S ;
Amid, C ;
Bellgard, M ;
Bonaldo, MD ;
Bono, H ;
Bromberg, SK ;
Brookes, AJ ;
Bruford, E ;
Carninci, P ;
Chelala, C ;
Couillault, C ;
de Souza, SJ ;
Debily, MA .
PLOS BIOLOGY, 2004, 2 (06) :856-875
[12]   Large-scale transcriptional activity in chromosomes 21 and 22 [J].
Kapranov, P ;
Cawley, SE ;
Drenkow, J ;
Bekiranov, S ;
Strausberg, RL ;
Fodor, SPA ;
Gingeras, TR .
SCIENCE, 2002, 296 (5569) :916-919
[13]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[14]   Vertebrate MicroRNA genes [J].
Lim, LP ;
Glasner, ME ;
Yekta, S ;
Burge, CB ;
Bartel, DP .
SCIENCE, 2003, 299 (5612) :1540-1540
[15]   Employment opportunities for non-coding RNAs [J].
Morey, C ;
Avner, P .
FEBS LETTERS, 2004, 567 (01) :27-34
[16]   Complete sequencing and characterization of 21,243 full-length human cDNAs [J].
Ota, T ;
Suzuki, Y ;
Nishikawa, T ;
Otsuki, T ;
Sugiyama, T ;
Irie, R ;
Wakamatsu, A ;
Hayashi, K ;
Sato, H ;
Nagai, K ;
Kimura, K ;
Makita, H ;
Sekine, M ;
Obayashi, M ;
Nishi, T ;
Shibahara, T ;
Tanaka, T ;
Ishii, S ;
Yamamoto, J ;
Saito, K ;
Kawai, Y ;
Isono, Y ;
Nakamura, Y ;
Nagahari, K ;
Murakami, K ;
Yasuda, T ;
Iwayanagi, T ;
Wagatsuma, M ;
Shiratori, A ;
Sudo, H ;
Hosoiri, T ;
Kaku, Y ;
Kodaira, H ;
Kondo, H ;
Sugawara, M ;
Takahashi, M ;
Kanda, K ;
Yokoi, T ;
Furuya, T ;
Kikkawa, E ;
Omura, Y ;
Abe, K ;
Kamihara, K ;
Katsuta, N ;
Sato, K ;
Tanikawa, M ;
Yamazaki, M ;
Ninomiya, K ;
Ishibashi, T ;
Yamashita, H .
NATURE GENETICS, 2004, 36 (01) :40-45
[17]   GeneID in Drosophila [J].
Parra, G ;
Blanco, E ;
Guigó, R .
GENOME RESEARCH, 2000, 10 (04) :511-515
[18]   Numerous novel annotations of the human genome sequence supported by a 5′-end-enriched cDNA collection [J].
Porcel, BM ;
Delfour, O ;
Castelli, V ;
De Berardinis, V ;
Friedlander, L ;
Cruaud, C ;
Ureta-Vidal, A ;
Scarpelli, C ;
Wincker, P ;
Schächter, V ;
Saurin, W ;
Gyapay, G ;
Salanoubat, M ;
Weissenbach, J .
GENOME RESEARCH, 2004, 14 (03) :463-471
[19]   Genomics - Massively parallel sequencing [J].
Rogers, YH ;
Venter, JC .
NATURE, 2005, 437 (7057) :326-327
[20]   Evaluation of gene-finding programs on mammalian sequences [J].
Rogic, S ;
Mackworth, AK ;
Ouellette, FBF .
GENOME RESEARCH, 2001, 11 (05) :817-832