Repetitive sequence environment distinguishes housekeeping genes

被引:40
作者
Eller, C. Daniel
Regelson, Moira
Merriman, Barry
Nelson, Stan
Horvath, Steve
Marahrens, York
机构
[1] Univ Calif Los Angeles, David Geffen Sch Med, Gonda Ctr, Dept Human Genet, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Sch Publ Hlth, Dept Biostat, Los Angeles, CA 90095 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
random forest; Alu; SINE; LINE; repeat; tissue-specific genes; isochores;
D O I
10.1016/j.gene.2006.09.018
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Housekeeping genes are expressed across a wide variety of tissues. Since repetitive sequences have been reported to influence the expression of individual genes, we employed a novel approach to determine whether housekeeping genes can be distinguished from tissue-specific genes by their repetitive sequence context. We show that Alu elements are more highly concentrated around housekeeping genes while various longer (> 400-bp) repetitive sequences ("repeats"), including Long interspersed Nuclear Element-1 (LINE-1) elements, are excluded from these regions. We further show that isochore membership does not distinguish housekeeping genes from tissue-specific genes and that repetitive sequence environment distinguishes housekeeping genes from tissue-specific genes in every isochore. The distinct repetitive sequence environment, in combination with other previously published sequence properties of housekeeping genes, was used to develop a method of predicting housekeeping genes on the basis of DNA sequence alone. Using expression across tissue types as a measure of success, we demonstrate that repetitive sequence environment is by far the most important sequence feature identified to date for distinguishing housekeeping genes. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:153 / 165
页数:13
相关论文
共 139 条
[1]   Combined pituitary hormone deficiency (CPHD) due to a complete PROP1 deletion [J].
Abrao, M. G. ;
Leite, V. ;
Carvalho, L. R. ;
Billerbeck, A. E. C. ;
Nishi, M. Y. ;
Barbosa, A. S. ;
Martin, R. M. ;
Arnhold, I. J. P. ;
Mendonca, B. B. .
CLINICAL ENDOCRINOLOGY, 2006, 65 (03) :294-300
[2]   High concentrations of long interspersed nuclear element sequence distinguish monoallelically expressed genes [J].
Allen, E ;
Horvath, S ;
Tong, F ;
Kraft, P ;
Spiteri, E ;
Riggs, AD ;
Marahrens, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (17) :9940-9945
[3]  
AllinghamHawkins DJ, 1996, AM J MED GENET, V64, P329, DOI 10.1002/(SICI)1096-8628(19960809)64:2<329::AID-AJMG19>3.0.CO
[4]  
2-H
[5]   SINE retroposons can be used in vivo as nucleation centers for de novo methylation [J].
Arnaud, P ;
Goubely, C ;
Pélissier, T ;
Deragon, JM .
MOLECULAR AND CELLULAR BIOLOGY, 2000, 20 (10) :3434-3441
[6]   A YY1-binding site is required for accurate human LINE-1 transcription initiation [J].
Athanikar, JN ;
Badge, RM ;
Moran, JV .
NUCLEIC ACIDS RESEARCH, 2004, 32 (13) :3846-3855
[7]   Molecular evidence for a relationship between LINE-1 elements and X chromosome inactivation: The Lyon repeat hypothesis [J].
Bailey, JA ;
Carrel, L ;
Chakravarti, A ;
Eichler, EE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (12) :6634-6639
[8]  
Baker MD, 1996, MOL CELL BIOL, V16, P7122
[9]   Alu repeats and human genomic diversity [J].
Batzer, MA ;
Deininger, PL .
NATURE REVIEWS GENETICS, 2002, 3 (05) :370-379
[10]   Why are young and old repetitive elements distributed differently in the human genome? [J].
Belle, EMS ;
Webster, MT ;
Eyre-Walker, A .
JOURNAL OF MOLECULAR EVOLUTION, 2005, 60 (03) :290-296