Genome-wide survey for biologically functional pseudogenes

被引:56
作者
Svensson, Orjan [1 ]
Arvestad, Lars [1 ]
Lagergren, Jens [1 ]
机构
[1] Albanova Univ Ctr, Royal Inst Technol, Stockholm Bioinformat Ctr, Stockholm, Sweden
关键词
D O I
10.1371/journal.pcbi.0020046
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human-mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i. e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human - mouse species split, and also a larger group of primate-specific ones found from human - chimpanzee searches. Two processed sequences are notable, their conservation since the human - mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7- like 3 ( ATX7NL3), and one from the Spinocerebellar ataxia type 1 protein (ATX1). Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross- species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein- coding genes, we use standard methods, utilizing in- frame disablements, as well as a probabilistic filter based on Ka/ Ks ratios.
引用
收藏
页码:358 / 369
页数:12
相关论文
共 34 条
[1]  
[Anonymous], 2009, BIOINFORMATICS, V26, P139, DOI [10.1093/bioinformatics/btp616, DOI 10.1093/BIOINFORMATICS/BTP616, DOI 10.1093/BIOINFORMATICS/17.SUPPL_1.S140]
[2]   Pseudogenes: Are they "Junk" or functional DNA? [J].
Balakirev, ES ;
Ayala, FJ .
ANNUAL REVIEW OF GENETICS, 2003, 37 :123-151
[3]   IDENTIFICATION AND CHARACTERIZATION OF THE GENE CAUSING TYPE-1 SPINOCEREBELLAR ATAXIA [J].
BANFI, S ;
SERVADIO, A ;
CHUNG, MY ;
KWIATKOWSKI, TJ ;
MCCALL, AE ;
DUVICK, LA ;
SHEN, Y ;
ROTH, EJ ;
ORR, HT ;
ZOGHBI, HY .
NATURE GENETICS, 1994, 7 (04) :513-520
[4]   TEXshade:: shading and labeling of multiple sequence alignments using LATEX 2ε [J].
Beitz, E .
BIOINFORMATICS, 2000, 16 (02) :135-139
[5]  
Bielawski Joseph P., 2003, Journal of Structural and Functional Genomics, V3, P201, DOI 10.1023/A:1022642807731
[6]   Reconstructing the genomic architecture of ancestral mammals: Lessons from human, mouse, and rat genomes [J].
Bourque, G ;
Pevzner, PA ;
Tesler, G .
GENOME RESEARCH, 2004, 14 (04) :507-516
[7]   Divergence between samples of chimpanzee and human DNA sequences is 5%, counting indels [J].
Britten, RJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (21) :13633-13635
[8]   The "Inverse relationship between evolutionary rate and age of mammalian genes" is an artifact of increased genetic distance with rate of evolution and time of divergence [J].
Elhaik, E ;
Sabath, N ;
Graur, D .
MOLECULAR BIOLOGY AND EVOLUTION, 2006, 23 (01) :1-3
[9]   pANT: A method for the pairwise assessment of nonfunctionalization times of processed pseudogenes [J].
Fleishman, SJ ;
Dagan, T ;
Graur, D .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (11) :1876-1880
[10]   Likelihood-based tests of topologies in phylogenetics [J].
Goldman, N ;
Anderson, JP ;
Rodrigo, AG .
SYSTEMATIC BIOLOGY, 2000, 49 (04) :652-670