The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags

被引:86
作者
Brentani, H
Caballero, OL
Camargo, AA
da Silva, AM
da Silva, WA
Neto, ED
Grivet, M
Gruber, A
Guimaraes, PEM
Hide, W
Iseli, C
Jongeneel, CV
Kelso, J
Nagai, MA
Ojopi, EPB
Osorio, EC
Reis, EMR
Riggins, GJ
Simpson, AJG
de Souza, S
Stevenson, BJ
Strausberg, RL
Tajara, EH
Verjovski-Almeida, S
Acencion, ML
Bengtsono, MH
Bettonip, F
Bodmerq, WF
Brionesr, MRS
Camargos, LP
Caveneet, W
Ceruttiu, JM
Coelho Andradev, LE
Costa dos Santosn, PC
Costaw, MCR
da Silvaw, IT
Esteciox, MRH
Ferreiraw, KS
Furnarit, FB
Faria, M
Galantep, PAF
Guimaraesy, GS
Holandaw, AJ
Kimuraz, ET
Leerkesp, MR
Xin, LA
Macielu, RMB
Martinsbb, EAL
Massirero, KB
Melor, ASA
机构
[1] Univ Sao Paulo, Lab Genet Mol Canc, Dept Radiol, BR-05403010 Sao Paulo, Brazil
[2] Univ Sao Paulo, Dept Bioquim, Inst Quim, BR-05508900 Sao Paulo, Brazil
[3] Univ Estadual Paulista, Dept Biol, Inst Biociencias, BR-15054 Sao Jose Do Rio Preto, SP, Brazil
[4] Ludwig Inst Canc Res, BR-01509010 Sao Paulo, Brazil
[5] Univ Sao Paulo, Dept Patol, Fac Med Vet & Zootecn, BR-05508000 Sao Paulo, Brazil
[6] Univ Sao Paulo, Fdn Hemocentro Ribeirao Preto, Fac Med Ribeirao Preto, BR-14051140 Ribeirao Preto, SP, Brazil
[7] Univ Sao Paulo, Neurosci Lab, Inst Psiquiatria, Fac Med, BR-05403010 Sao Paulo, Brazil
[8] NCI, Bethesda, MD 20892 USA
[9] Duke Univ, Med Ctr, Durham, NC 27710 USA
[10] Univ Western Cape, S African Natl Bioinformat Inst, ZA-7535 Bellville, South Africa
[11] Ludwig Inst Canc Res, Off Informat Technol, CH-1066 Epalinges, Switzerland
[12] Ctr Estudo Telecomun PUC, BR-22453900 Rio De Janeiro, Brazil
关键词
D O I
10.1073/pnas.1233632100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define approximate to23,500 genes, of which only approximate to1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that <1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.
引用
收藏
页码:13418 / 13423
页数:6
相关论文
共 33 条
[1]  
BONALUME NR, 1999, NATURE, V398, P450
[2]   An anatomy of normal and malignant gene expression [J].
Boon, K ;
Osório, EC ;
Greenhut, SF ;
Schaefer, CF ;
Shoemaker, J ;
Polyak, K ;
Morin, PJ ;
Buetow, KH ;
Strausberg, RL ;
de Souza, SJ ;
Riggins, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (17) :11287-11292
[3]   Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays [J].
Brenner, S ;
Johnson, M ;
Bridgham, J ;
Golda, G ;
Lloyd, DH ;
Johnson, D ;
Luo, SJ ;
McCurdy, S ;
Foy, M ;
Ewan, M ;
Roth, R ;
George, D ;
Eletr, S ;
Albrecht, G ;
Vermaas, E ;
Williams, SR ;
Moon, K ;
Burcham, T ;
Pallas, M ;
DuBridge, RB ;
Kirchner, J ;
Fearon, K ;
Mao, J ;
Corcoran, K .
NATURE BIOTECHNOLOGY, 2000, 18 (06) :630-634
[4]   Alternative splicing and genome complexity [J].
Brett, D ;
Pospisil, H ;
Valcárcel, J ;
Reich, J ;
Bork, P .
NATURE GENETICS, 2002, 30 (01) :29-30
[5]   Alternative spliced transcripts as cancer markers [J].
Caballero, OL ;
de Souza, SJ ;
Brentani, RR ;
Simpson, AJG .
DISEASE MARKERS, 2001, 17 (02) :67-75
[6]   The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome [J].
Camargo, AA ;
Samaia, HPB ;
Dias-Neto, E ;
Simao, DF ;
Migotto, IA ;
Briones, MRS ;
Costa, FF ;
Nagai, MA ;
Verjovski-Almeida, S ;
Zago, MA ;
Andrade, LEC ;
Carrer, H ;
El-Dorry, HFA ;
Espreafico, EM ;
Habr-Gama, A ;
Giannella-Neto, D ;
Goldman, GH ;
Gruber, A ;
Hackel, C ;
Kimura, ET ;
Maciel, RMB ;
Marie, SKN ;
Martins, EAL ;
Nóbrega, MP ;
Paçó-Larson, ML ;
Pardini, MIMC ;
Pereira, GG ;
Pesquero, JB ;
Rodrigues, V ;
Rogatto, SR ;
da Silva, IDCG ;
Sogayar, MC ;
Sonati, MDF ;
Tajara, EH ;
Valentini, SR ;
Alberto, FL ;
Amaral, MEJ ;
Aneas, I ;
Arnaldi, LAT ;
de Assis, AM ;
Bengtson, MH ;
Bergamo, NA ;
Bombonato, V ;
de Camargo, MER ;
Canevari, RA ;
Carraro, DM ;
Cerutti, JM ;
Corrêa, MLC ;
Corrêa, RFR ;
Costa, MCR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (21) :12103-12108
[7]   The significance of CD44 variants expression in colorectal cancer and its regional lymph nodes [J].
Chun, SY ;
Bae, OS ;
Kim, JB .
JOURNAL OF KOREAN MEDICAL SCIENCE, 2000, 15 (06) :696-700
[8]   Expression-based genetic/physical maps of single-nucleotide polymorphisms identified by the cancer genome anatomy project [J].
Clifford, R ;
Edmonson, M ;
Hu, Y ;
Nguyen, C ;
Scherpbier, T ;
Buetow, KH .
GENOME RESEARCH, 2000, 10 (08) :1259-1265
[9]  
DeRisi J, 1996, NAT GENET, V14, P457
[10]   Computational Genomics of noncoding RNA genes [J].
Eddy, SR .
CELL, 2002, 109 (02) :137-140