Identification of novel human genes evolutionarily conserved in Caenorhabditis elegans by comparative proteomics

被引:364
作者
Lai, CH
Chou, CY
Ch'ang, LY
Liu, CS
Lin, WC [1 ]
机构
[1] Acad Sinica, Inst Biomed Sci, Taipei 115, Taiwan
[2] Chung Yuan Christian Univ, Dept Informat Engn, Chungli, Taiwan
关键词
D O I
10.1101/gr.10.5.703
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Modern biomedical research greatly benefits from large-scale genome-sequencing projects ranging from studies of viruses, bacteria, and yeast to multicellular organisms, like Caenorhabditis elegans. Comparative genomic studies offer a vast array of prospects for identification and functional annotation of human ortholog genes. We presented a novel comparative proteomic approach for assembling human gene contigs and assisting gene discovery. The C. elegans proteome was used as an alignment template to assist in novel human gene identification from human EST nucleotide databases. Among the available 18,452 C. elegans protein sequences, our results indicate that at least 83% (15,344 sequences) of C. elegans proteome has human homologous genes, with 7,954 records of C. elegans proteins matching known human gene transcripts. Only 11% or less of C. elegans proteome contains nematode-specific genes. We Found that the remaining 7,390 sequences might lead to discoveries of novel human genes, and over 150 putative Full-length human gene transcripts were assembled upon Further database analyses.
引用
收藏
页码:703 / 713
页数:11
相关论文
共 47 条
[1]   Toward the development of a gene index to the human genome: An assessment of the nature of high-throughput EST sequence data [J].
Aaronson, JS ;
Eckman, B ;
Blevins, RA ;
Borkowski, JA ;
Myerson, J ;
Imran, S ;
Elliston, KO .
GENOME RESEARCH, 1996, 6 (09) :829-845
[2]  
ADAMS MD, 1995, NATURE, V377, P3
[3]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[4]  
Andrade MA, 1997, YEAST, V13, P1363, DOI 10.1002/(SICI)1097-0061(199711)13:14<1363::AID-YEA182>3.0.CO
[5]  
2-8
[6]   Analysis of EST-driven gene annotation in human genomic sequence [J].
Bailey, LC ;
Searls, DB ;
Overton, GC .
GENOME RESEARCH, 1998, 8 (04) :362-376
[7]   Drosophila-related expressed sequences [J].
Banfi, S ;
Borsani, G ;
Bulfone, A ;
Ballabio, A .
HUMAN MOLECULAR GENETICS, 1997, 6 (10) :1745-1753
[8]   Identification and mapping of human cDNAs homologous to Drosophila mutant genes through EST database searching [J].
Banfi, S ;
Borsani, G ;
Rossi, E ;
Bernard, L ;
Guffanti, A ;
Rubboli, F ;
Marchitiello, A ;
Giglio, S ;
Coluccia, E ;
Zollo, M ;
Zuffardi, O ;
Ballabio, A .
NATURE GENETICS, 1996, 13 (02) :167-174
[9]   ESTABLISHING A HUMAN TRANSCRIPT MAP [J].
BOGUSKI, MS ;
SCHULER, GD .
NATURE GENETICS, 1995, 10 (04) :369-371
[10]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333