The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags

被引:86
作者
Brentani, H
Caballero, OL
Camargo, AA
da Silva, AM
da Silva, WA
Neto, ED
Grivet, M
Gruber, A
Guimaraes, PEM
Hide, W
Iseli, C
Jongeneel, CV
Kelso, J
Nagai, MA
Ojopi, EPB
Osorio, EC
Reis, EMR
Riggins, GJ
Simpson, AJG
de Souza, S
Stevenson, BJ
Strausberg, RL
Tajara, EH
Verjovski-Almeida, S
Acencion, ML
Bengtsono, MH
Bettonip, F
Bodmerq, WF
Brionesr, MRS
Camargos, LP
Caveneet, W
Ceruttiu, JM
Coelho Andradev, LE
Costa dos Santosn, PC
Costaw, MCR
da Silvaw, IT
Esteciox, MRH
Ferreiraw, KS
Furnarit, FB
Faria, M
Galantep, PAF
Guimaraesy, GS
Holandaw, AJ
Kimuraz, ET
Leerkesp, MR
Xin, LA
Macielu, RMB
Martinsbb, EAL
Massirero, KB
Melor, ASA
机构
[1] Univ Sao Paulo, Lab Genet Mol Canc, Dept Radiol, BR-05403010 Sao Paulo, Brazil
[2] Univ Sao Paulo, Dept Bioquim, Inst Quim, BR-05508900 Sao Paulo, Brazil
[3] Univ Estadual Paulista, Dept Biol, Inst Biociencias, BR-15054 Sao Jose Do Rio Preto, SP, Brazil
[4] Ludwig Inst Canc Res, BR-01509010 Sao Paulo, Brazil
[5] Univ Sao Paulo, Dept Patol, Fac Med Vet & Zootecn, BR-05508000 Sao Paulo, Brazil
[6] Univ Sao Paulo, Fdn Hemocentro Ribeirao Preto, Fac Med Ribeirao Preto, BR-14051140 Ribeirao Preto, SP, Brazil
[7] Univ Sao Paulo, Neurosci Lab, Inst Psiquiatria, Fac Med, BR-05403010 Sao Paulo, Brazil
[8] NCI, Bethesda, MD 20892 USA
[9] Duke Univ, Med Ctr, Durham, NC 27710 USA
[10] Univ Western Cape, S African Natl Bioinformat Inst, ZA-7535 Bellville, South Africa
[11] Ludwig Inst Canc Res, Off Informat Technol, CH-1066 Epalinges, Switzerland
[12] Ctr Estudo Telecomun PUC, BR-22453900 Rio De Janeiro, Brazil
关键词
D O I
10.1073/pnas.1233632100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define approximate to23,500 genes, of which only approximate to1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that <1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.
引用
收藏
页码:13418 / 13423
页数:6
相关论文
共 33 条
[21]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[22]   Genome-wide detection of alternative splicing in expressed sequences of human genes [J].
Modrek, B ;
Resch, A ;
Grasso, C ;
Lee, C .
NUCLEIC ACIDS RESEARCH, 2001, 29 (13) :2850-2859
[23]  
Neto ED, 2000, P NATL ACAD SCI USA, V97, P3491
[24]   Nineteen additional unpredicted transcripts from human chromosome 21 [J].
Reymond, A ;
Camargo, AA ;
Deutsch, S ;
Stevenson, BJ ;
Parmigiani, RB ;
Ucla, C ;
Bettoni, F ;
Rossier, C ;
Lyle, R ;
Guipponi, M ;
de Souza, S ;
Iseli, C ;
Jongeneel, CV ;
Bucher, P ;
Simpson, AJG ;
Antonarakis, SE .
GENOMICS, 2002, 79 (06) :824-832
[25]   Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence [J].
Roest Crollius, H ;
Jaillon, O ;
Bernot, A ;
Dasilva, C ;
Bouneau, L ;
Fischer, C ;
Fizames, C ;
Wincker, P ;
Brottier, P ;
Quétier, F ;
Saurin, W ;
Weissenbach, J .
NATURE GENETICS, 2000, 25 (02) :235-238
[26]  
SAKABE NJ, 2003, IN PRESS P FR ACAD S
[27]  
SANCHEZ LM, 2001, ONCOL REP, V8, P145
[28]   An international database and integrated analysis tools for the study of cancer gene expression [J].
Strausberg R.L. ;
Camargo A.A. ;
Riggins G.J. ;
Schaefer C.F. ;
De Souza S.J. ;
Grouse L.H. ;
Lal A. ;
Buetow K.H. ;
Boon K. ;
Greenhut S.F. ;
Simpson A.J.G. .
The Pharmacogenomics Journal, 2002, 2 (3) :156-164
[29]   The cancer genome anatomy project - Building an annotated gene index [J].
Strausberg, RL ;
Buetow, KH ;
Emmert-Buck, MR ;
Klausner, RD .
TRENDS IN GENETICS, 2000, 16 (03) :103-106
[30]   New opportunities for uncovering the molecular basis of cancer [J].
Strausberg, RL ;
Dahl, CA ;
Klausner, RD .
NATURE GENETICS, 1997, 15 (Suppl 4) :415-416