Use of artificial genomes in assessing methods for atypical gene detection

被引:27
作者
Azad, RK [1 ]
Lawrence, JG [1 ]
机构
[1] Univ Pittsburgh, Dept Biol Sci, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
D O I
10.1371/journal.pcbi.0010056
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Parametric methods for identifying laterally transferred genes exploit the directional mutational biases unique to each genome. Yet the development of new, more robust methods-as well as the evaluation and proper implementation of existing methods-relies on an arbitrary assessment of performance using real genomes, where the evolutionary histories of genes are not known. We have used the framework of a generalized hidden Markov model to create artificial genomes modeled after genuine genomes. To model a genome, "core" genes-those displaying patterns of mutational biases shared among large numbers of genes-are identified by a novel gene clustering approach based on the Akaike information criterion. Gene models derived from multiple "core" gene clusters are used to generate an artificial genome that models the properties of a genuine genome. Chimeric artificial genomes-representing those having experienced lateral gene transfer-were created by combining genes from multiple artificial genomes, and the performance of the parametric methods for identifying "atypical" genes was assessed directly. We found that a hidden Markov model that included multiple gene models, each trained on sets of genes representing the range of genotypic variability within a genome, could produce artificial genomes that mimicked the properties of genuine genomes. Moreover, different methods for detecting foreign genes performed differently-i.e., they had different sets of strengths and weaknesses-when identifying atypical genes within chimeric artificial genomes.
引用
收藏
页码:461 / 473
页数:13
相关论文
共 50 条
[31]   Asymmetric substitution patterns in the two DNA strands of bacteria [J].
Lobry, JR .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (05) :660-665
[32]   GeneMark.hmm: new solutions for gene finding [J].
Lukashin, AV ;
Borodovsky, M .
NUCLEIC ACIDS RESEARCH, 1998, 26 (04) :1107-1115
[33]   GCUA: General codon usage analysis [J].
McInerney, JO .
BIOINFORMATICS, 1998, 14 (04) :372-373
[34]   EVIDENCE FOR HORIZONTAL GENE-TRANSFER IN ESCHERICHIA-COLI SPECIATION [J].
MEDIGUE, C ;
ROUXEL, T ;
VIGIER, P ;
HENAUT, A ;
DANCHIN, A .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 222 (04) :851-856
[35]   EVIDENCE FOR A 14-GENE, PHNC TO PHNP LOCUS FOR PHOSPHONATE METABOLISM IN ESCHERICHIA-COLI [J].
METCALF, WW ;
WANNER, BL .
GENE, 1993, 129 (01) :27-32
[36]   Biased biological functions of horizontally transferred genes in prokaryotic genomes [J].
Nakamura, Y ;
Itoh, T ;
Matsuda, H ;
Gojobori, T .
NATURE GENETICS, 2004, 36 (07) :760-766
[37]   Lateral gene transfer and the nature of bacterial innovation [J].
Ochman, H ;
Lawrence, JG ;
Groisman, EA .
NATURE, 2000, 405 (6784) :299-304
[38]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[39]   Detection of lateral gene transfer among microbial genomes [J].
Ragan, MA .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2001, 11 (06) :620-626
[40]   On surrogate methods for detecting lateral gene transfer [J].
Ragan, MA .
FEMS MICROBIOLOGY LETTERS, 2001, 201 (02) :187-191