Wavelet-based method to disentangle transcription- and replication-associated strand asymmetries in mammalian genomes

被引:17
作者
Baker, Antoine [1 ,2 ,3 ]
Nicolay, Samuel [4 ]
Zaghloul, Lamia [1 ,2 ,3 ]
d'Aubenton-Carafa, Yves [5 ]
Thermes, Claude [5 ]
Audit, Benjamin [1 ,2 ,3 ]
Arneodo, Alain [1 ,2 ,3 ]
机构
[1] Univ Lyon, F-69007 Lyon, France
[2] Ecole Normale Super Lyon, CNRS, Lab Joliot Curie, F-69007 Lyon, France
[3] Ecole Normale Super Lyon, CNRS, Phys Lab, F-69007 Lyon, France
[4] Univ Liege, Inst Math, B-4000 Liege, Belgium
[5] CNRS, Ctr Genet Mol, F-91198 Gif Sur Yvette, France
关键词
Continuous wavelet transform; DNA sequences; Strand compositional asymmetry; Transcription; Replication; Mammalian genomes; LONG-RANGE CORRELATIONS; MULTIFRACTAL IMAGE-ANALYSIS; RESOLUTION SATELLITE IMAGES; EUKARYOTIC DNA-REPLICATION; LOG-NORMAL STATISTICS; FRACTAL CORRELATIONS; SCALING BEHAVIOR; SEQUENCES; FORMALISM; ORIGINS;
D O I
10.1016/j.acha.2009.10.003
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
During genome evolution, the two strands of the DNA double helix are not subjected to the same mutation patterns. This mutation bias is considered as a by-product of replicative and transcriptional activities. In this paper, we develop a wavelet-based methodology to analyze the DNA strand asymmetry profiles with the specific goal to extract the contributions associated with replication and transcription respectively. In a first step, we use an adapted N-shaped analyzing wavelet to perform a multi-scale pattern recognition analysis of the sum of the TA and GC skews along human chromosomes. This method provides an objective segmentation of the human genome in skew domains of similar or equal to 1 Mbp characteristic size, bordered by two putative replication origins recognized as large amplitude upward jumps in the noisy skew profile. In a second step, we use a least-square fitting procedure to disentangle, in these skew domains, the small-scale (the mean human gene size similar or equal to 30 kbp) square-like transcription component from the global N-shaped component induced by replication. When applying this procedure to the 22 human autosomes, we delineate 678 replication domains of mean length (L) over bar = 1.2 +/- 0.6 Mbp spanning 33.8% of the human genome and we predict 1062 replication origins. When investigating the distribution of transcription-associated skew inside the replication N-domains, we reveal some dependence upon the distance to the putative replication origins located at N-domain extremities, the closer the genes to the origin, the larger their transcription bias as the signature of a higher transcriptional activity in the germ-line. As a comparative analysis, we further apply Our wavelet-based methodology to skew profiles along the mouse chromosomes. The striking similarity of the results in human and mouse indicates that the remarkable gene organization observed inside the human replication N-domains is likely to be a general feature of mammalian genomes. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:150 / 170
页数:21
相关论文
共 148 条
[1]   Dynamics of DNA replication in mammalian somatic cells: Nucleotide pool modulates origin choice and interorigin spacing [J].
Anglana, M ;
Apiou, F ;
Bensimon, A ;
Debatisse, M .
CELL, 2003, 114 (03) :385-394
[2]  
[Anonymous], 1992, CBMS-NSF Reg. Conf. Ser. in Appl. Math
[3]  
[Anonymous], 2006, Wavelet Anal. Appl.
[4]  
[Anonymous], 2001, Image Anal Stereol
[5]  
[Anonymous], 1989, WAVELETS
[6]   A wavelet-based method for multifractal image analysis.: I.: Methodology and test applications on isotropic and anisotropic random rough surfaces [J].
Arnédo, A ;
Decoster, N ;
Roux, SG .
EUROPEAN PHYSICAL JOURNAL B, 2000, 15 (03) :567-600
[7]   Intermittency, log-normal statistics, and multifractal cascade process in high-resolution satellite images of cloud structure [J].
Arneodo, A ;
Decoster, N ;
Roux, SG .
PHYSICAL REVIEW LETTERS, 1999, 83 (06) :1255-1258
[8]   Revealing a lognormal cascading process in turbulent velocity statistics with wavelet analysis [J].
Arneodo, A ;
Manneville, S ;
Muzy, JF ;
Roux, SG .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1999, 357 (1760) :2415-2438
[9]   Towards log-normal statistics in high Reynolds number turbulence [J].
Arneodo, A ;
Manneville, S ;
Muzy, JF .
EUROPEAN PHYSICAL JOURNAL B, 1998, 1 (01) :129-140
[10]   Wavelet based fractal analysis of DNA sequences [J].
Arneodo, A ;
dAubentonCarafa, Y ;
Bacry, E ;
Graves, PV ;
Muzy, JF ;
Thermes, C .
PHYSICA D-NONLINEAR PHENOMENA, 1996, 96 (1-4) :291-320