Reevaluation of human cytomegalovirus coding potential

被引:137
作者
Murphy, E
Rigoutsos, I
Shibuya, T
Shenk, TE [1 ]
机构
[1] Princeton Univ, Dept Mol Biol, Princeton, NJ USA
[2] IBM Corp, Thomas J Watson Res Ctr, Deep Comp Inst, Bioinformat & Pattern Discovery Grp,Computat Biol, Yorktown Hts, NY 10598 USA
[3] IBM Tokyo Res Lab, Kanagawa 2428502, Japan
关键词
D O I
10.1073/pnas.1735466100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Bio-Dictionary-based Gene Finder was used to reassess the coding potential of the AD169 laboratory strain of human cytomegalovirus and sequences in the Toledo strain that are missing in the laboratory strain of the virus. The gene-finder algorithm assesses the potential of an ORF to encode a protein based on matches to a database of amino acid patterns derived from a large collection of proteins. The algorithm was used to score all human cytomegalovirus ORFs with the potential to encode polypeptides greater than or equal to50 aa in length. As a further test for functionality, the genomes of the chimpanzee, rhesus, and murine cytomegaloviruses were searched for orthologues of the predicted human cytomegalovirus ORFs. The analysis indicates that 37 previously annotated ORFs ought to be discarded, and at least nine previously unrecognized ORFs with relatively strong coding potential should be added. Thus, the human cytomegalovirus genome appears to contain approximate to192 unique ORFs with the potential to encode a protein. Support for several of the predictions of our in silico analysis was obtained by sequencing several domains within a clinical isolate of human cytomegalovirus.
引用
收藏
页码:13585 / 13590
页数:6
相关论文
共 44 条
[1]   The products of human cytomegalovirus genes UL23, UL24, UL43 and US22 are tegument components [J].
Adair, R ;
Douglas, ER ;
Maclean, JB ;
Graham, SY ;
Aitken, JD ;
Jamieson, FE ;
Dargan, DJ .
JOURNAL OF GENERAL VIROLOGY, 2002, 83 :1315-1324
[2]   Abundant early expression of gpUL4 from a human cytomegalovirus mutant lacking a repressive upstream open reading frame [J].
Alderete, JP ;
Child, SJ ;
Geballe, AP .
JOURNAL OF VIROLOGY, 2001, 75 (15) :7188-7192
[3]   Identification and expression of human cytomegalovirus transcription units coding for two distinct Fcγ receptor homologs [J].
Atalay, R ;
Zimmermann, Z ;
Wagner, M ;
Borst, E ;
Benz, C ;
Messerle, M ;
Hengel, H .
JOURNAL OF VIROLOGY, 2002, 76 (17) :8596-8608
[4]   DNA-SEQUENCE AND EXPRESSION OF THE B95-8 EPSTEIN-BARR VIRUS GENOME [J].
BAER, R ;
BANKIER, AT ;
BIGGIN, MD ;
DEININGER, PL ;
FARRELL, PJ ;
GIBSON, TJ ;
HATFULL, G ;
HUDSON, GS ;
SATCHWELL, SC ;
SEGUIN, C ;
TUFFNELL, PS ;
BARRELL, BG .
NATURE, 1984, 310 (5974) :207-211
[5]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[6]  
Bankier A T, 1991, DNA Seq, V2, P1, DOI 10.3109/10425179109008433
[7]   Expression and characterization of a novel structural protein of human cytomegalovirus, pUL25 [J].
Battista, MC ;
Bergamini, G ;
Boccuni, MC ;
Campanini, F ;
Ripalti, A ;
Landini, MP .
JOURNAL OF VIROLOGY, 1999, 73 (05) :3800-3809
[8]  
Benedict CA, 1999, J IMMUNOL, V162, P6967
[9]  
Britt W., 1996, Fields Virology, P2493
[10]   Human cytomegalovirus clinical isolates carry at least 19 genes not found in laboratory strains [J].
Cha, TA ;
Tom, E ;
Kemble, GW ;
Duke, GM ;
Mocarski, ES ;
Spaete, RR .
JOURNAL OF VIROLOGY, 1996, 70 (01) :78-83