A genome-wide analysis of common fragile sites: What features determine chromosomal instability in the human genome?

被引:122
作者
Fungtammasan, Arkarachai [1 ,2 ,3 ]
Walsh, Erin [1 ,4 ]
Chiaromonte, Francesca [1 ,5 ]
Eckert, Kristin A. [1 ,6 ]
Makova, Kateryna D. [1 ,3 ]
机构
[1] Penn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
[2] Penn State Univ, Integrat Biosci Grad Program, University Pk, PA 16802 USA
[3] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
[4] Penn State Univ, Coll Med, Cellular & Mol Biol Grad Program, Hershey, PA 17033 USA
[5] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[6] Penn State Univ, Coll Med, Dept Pathol, Jake Gittlen Canc Res Fdn, Hershey, PA 17033 USA
关键词
REPLICATION INITIATION; GENE; SEQUENCE; MOUSE; APHIDICOLIN; FRA16D; INTEGRATION; EXPRESSION; FRA3B; MAP;
D O I
10.1101/gr.134395.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
Chromosomal common fragile sites (CFSs) are unstable genomic regions that break under replication stress and are involved in structural variation. They frequently are sites of chromosomal rearrangements in cancer and of viral integration. However, CFSs are undercharacterized at the molecular level and thus difficult to predict computationally. Newly available genome-wide profiling studies provide us with an unprecedented opportunity to associate CFSs with features of their local genomic contexts. Here, we contrasted the genomic landscape of cytogenetically defined aphidicolin-induced CFSs (aCFSs) to that of nonfragile sites, using multiple logistic regression. We also analyzed aCFS breakage frequencies as a function of their genomic landscape, using standard multiple regression. We show that local genomic features are effective predictors both of regions harboring aCFSs (explaining similar to 77% of the deviance in logistic regression models) and of aCFS breakage frequencies (explaining similar to 45% of the variance in standard regression models). In our optimal models (having highest explanatory power), aCFSs are predominantly located in G-negative chromosomal bands and away from centromeres, are enriched in Alu repeats, and have high DNA flexibility. In alternative models, CpG island density, transcription start site density, H3K4me1 coverage, and mononucleotide microsatellite coverage are significant predictors. Also, aCFSs have high fragility when colocated with evolutionarily conserved chromosomal breakpoints. Our models are predictive of the fragility of aCFSs mapped at a higher resolution. Importantly, the genomic features we identified here as significant predictors of fragility allow us to draw valuable inferences on the molecular mechanisms underlying aCFSs.
引用
收藏
页码:993 / 1005
页数:13
相关论文
共 98 条
[1]
A genome-wide view of mutation rate co-variation using multivariate analyses [J].
Ananda, Guruprasad ;
Chiaromonte, Francesca ;
Makova, Kateryna D. .
GENOME BIOLOGY, 2011, 12 (03)
[2]
[Anonymous], 2011, R: A Language and Environment for Statistical Computing
[3]
ALU REPEATS - A SOURCE FOR THE GENESIS OF PRIMATE MICROSATELLITES [J].
ARCOT, SS ;
WANG, ZY ;
WEBER, JL ;
DEININGER, PL ;
BATZER, MA .
GENOMICS, 1995, 29 (01) :136-144
[4]
Common fragile sites as targets for chromosome rearrangements [J].
Arlt, Martin F. ;
Durkin, Sandra G. ;
Ragland, Ryan L. ;
Glover, Thomas W. .
DNA REPAIR, 2006, 5 (9-10) :1126-1135
[5]
Fragile sites are preferential targets for integrations of MLV vectors in gene therapy [J].
Bester, A. C. ;
Schwartz, M. ;
Schmidt, M. ;
Garrigue, A. ;
Hacein-Bey-Abina, S. ;
Cavazzana-Calvo, M. ;
Ben-Porat, N. ;
Von Kalle, C. ;
Fischer, A. ;
Kerem, B. .
GENE THERAPY, 2006, 13 (13) :1057-1059
[6]
Mutational Dynamics of Microsatellites [J].
Bhargava, Atul ;
Fuentes, F. F. .
MOLECULAR BIOTECHNOLOGY, 2010, 44 (03) :250-266
[7]
Integrating diverse databases into an unified analysis framework: a Galaxy approach [J].
Blankenberg, Daniel ;
Coraor, Nathan ;
Von Kuster, Gregory ;
Taylor, James ;
Nekrutenko, Anton .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
[8]
The FRA2C common fragile site maps to the borders of MYCN amplicons in neuroblastoma and is associated with gross chromosomal rearrangements in different cancers [J].
Blumrich, Anne ;
Zapatka, Marc ;
Brueckner, Lena M. ;
Zheglo, Diana ;
Schwab, Manfred ;
Savelyeva, Larissa .
HUMAN MOLECULAR GENETICS, 2011, 20 (08) :1488-1501
[9]
Borgaonkar DS, 1994, CHROMOSONIAL VARIATI
[10]
Characterization of FRA7B, a human common fragile site mapped at the 7p chromosome terminal region [J].
Bosco, Nazario ;
Pelliccia, Franca ;
Rocchi, Angela .
CANCER GENETICS AND CYTOGENETICS, 2010, 202 (01) :47-52