The majority of human genes have regions repeated in other human genes

被引:6
作者
Britten, RJ [1 ]
机构
[1] CALTECH, Corona Del Mar, CA 92625 USA
关键词
domains; protein; relationships;
D O I
10.1073/pnas.0501008102
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Amino acid sequence comparisons have been made between all of 25,193 human proteins with each of the others by using BLAST software (National Center for Biotechnology Information) and recording the results for regions that are significantly related in sequence, that is, have an expectation of <1 x 10(-3). The results are presented for each amino acid as the number of identical or similar amino acids matched in these aligned regions. This approach avoids summing or dealing directly with the different regions of any one protein that are often related to different numbers and types of other proteins. The results are presented graphically for a sample of 140 proteins. Relationships are not observed for 26.5% of the 12,728,866 amino acids. The average number of related amino acids is 36.5 for the majority (73.5%) that show relationships. The median number of recognized relationships is approximate to 3 for all of the amino acids, and the maximum number is 718. The results demonstrate the overwhelming importance of gene regional duplication forming families of proteins with related domains and show the variety of the resulting patterns of relationship. The magnitude of the set of relationships leads to the conclusion that the principal process by which new gene functions arise has been by making use of preexisting genes.
引用
收藏
页码:5466 / 5470
页数:5
相关论文
共 11 条
[1]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[2]   RNAs from all categories generate retrosequences that may be exapted as novel genes or regulatory elements [J].
Brosius, J .
GENE, 1999, 238 (01) :115-134
[3]  
BROWNELL E, 1989, ONCOGENE, V4, P935
[4]   CLONING OF DECAY-ACCELERATING FACTOR SUGGESTS NOVEL USE OF SPLICING TO GENERATE 2 PROTEINS [J].
CARAS, IW ;
DAVITZ, MA ;
RHEE, L ;
WEDDELL, G ;
MARTIN, DW ;
NUSSENZWEIG, V .
NATURE, 1987, 325 (6104) :545-549
[5]   AluGene:: a database of Alu elements incorporated within protein-coding genes [J].
Dagan, T ;
Sorek, R ;
Sharon, E ;
Ast, G ;
Graur, D .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D489-D492
[6]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[7]   Transposable elements and vertebrate protein diversity [J].
Lorenc, A ;
Makalowski, W .
GENETICA, 2003, 118 (2-3) :183-191
[8]   Transposable elements are found in a large number of human protein-coding genes [J].
Nekrutenko, A ;
Li, WHS .
TRENDS IN GENETICS, 2001, 17 (11) :619-621
[9]  
Ohno S., 1970, P1
[10]   Interspersed repeats and other mementos of transposable elements in mammalian genomes [J].
Smit, AFA .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 1999, 9 (06) :657-663