Almost all human genes resulted from ancient duplication

被引:14
作者
Britten, Roy J. [1 ]
机构
[1] CALTECH, Corona Del Mar, CA 92625 USA
关键词
open criterion; protein; relationships; sequence;
D O I
10.1073/pnas.0608796103
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Results of protein sequence comparison at open criterion show a very large number of relationships that have, up to now, gone unreported. The relationships suggest many ancient events of gene duplication. It is well known that gene duplication has been a major process in the evolution of genomes. A collection of human genes that have known functions have been examined for a history of gene duplications detected by means of amino acid sequence similarity by using BLASTp with an expectation of two or less (open criterion). Because the collection of genes in build 35 includes sets of transcript variants, all genes of known function were collected, and only the longest transcription variant was included, yielding a 13,298-member library called KGMV (for known genes maximum variant). When all lengths of matches are accepted, > 97% of human genes show significant matches to each other. Many form matches with a large number of other different proteins, showing that most genes are made up from parts of many others as a result of ancient events of duplication. To support the use of the open criterion, all of the members of the KGMV library were twice replaced with random protein sequences of the same length and average composition, and all were compared with each other with BLASTp at expectation two or less. The set of matches averaged 0.35% of that observed for the KGMV set of proteins.
引用
收藏
页码:19027 / 19032
页数:6
相关论文
共 7 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
BRITTEN RJ, 1965, CARNEGIE I YB, V64, P333
[3]   Genes on human chromosome 19 show extreme divergence from the mouse orthologs and a high GC content [J].
Castresana, J .
NUCLEIC ACIDS RESEARCH, 2002, 30 (08) :1751-1756
[4]   A genome-wide comparison of recent chimpanzee and human segmental duplications [J].
Cheng, Z ;
Ventura, M ;
She, XW ;
Khaitovich, P ;
Graves, T ;
Osoegawa, K ;
Church, D ;
DeJong, P ;
Wilson, RK ;
Pääbo, S ;
Rocchi, M ;
Eichler, EE .
NATURE, 2005, 437 (7055) :88-93
[5]   Transduction of 3′-flanking sequences is common in L1 retrotransposition [J].
Goodier, JL ;
Ostertag, EM ;
Kazazian, HH .
HUMAN MOLECULAR GENETICS, 2000, 9 (04) :653-657
[6]  
Ohno S., 1970, EVOLUTION GENE DUPLI
[7]  
ZIMMER EA, 1980, P NATL ACAD SCI USA, V77, P2156