Genome-wide DNA polymorphism analyses using VariScan

被引:95
作者
Hutter, Stephan
Vilella, Albert J.
Rozas, Julio
机构
[1] Univ Barcelona, Fac Biol, Dept Genet, E-08028 Barcelona, Spain
[2] Univ Munich, Dept Biol Evolutionary Biol 2, Munich, Germany
关键词
D O I
10.1186/1471-2105-7-409
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
引用
收藏
页数:10
相关论文
共 44 条
[1]   Adaptive evolution of non-coding DNA in Drosophila [J].
Andolfatto, P .
NATURE, 2005, 437 (7062) :1149-1152
[2]  
[Anonymous], CBMS NSF REGIONAL C
[3]   CHARACTERIZING LONG-RANGE CORRELATIONS IN DNA-SEQUENCES FROM WAVELET ANALYSIS [J].
ARNEODO, A ;
BACRY, E ;
GRAVES, PV ;
MUZY, JF .
PHYSICAL REVIEW LETTERS, 1995, 74 (16) :3293-3296
[4]   Neutrality tests based on the distribution of haplotypes under an infinite-site model [J].
Depaulis, F ;
Veuille, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (12) :1788-1790
[5]   Arlequin (version 3.0): An integrated software package for population genetics data analysis [J].
Excoffier, Laurent ;
Laval, Guillaume ;
Schneider, Stefan .
EVOLUTIONARY BIOINFORMATICS, 2005, 1 :47-50
[6]   A sliding window-based method to detect selective constraints in protein-coding genes and its application to RNA viruses [J].
Fares, MA ;
Elena, SF ;
Ortiz, J ;
Moya, A ;
Barrio, E .
JOURNAL OF MOLECULAR EVOLUTION, 2002, 55 (05) :509-521
[7]  
Fay JC, 2000, GENETICS, V155, P1405
[8]   PROSEQ: A software for preparation and evolutionary analysis of DNA sequence data sets [J].
Filatov, DA .
MOLECULAR ECOLOGY NOTES, 2002, 2 (04) :621-624
[9]  
FU YX, 1993, GENETICS, V133, P693
[10]  
Fu YX, 1997, GENETICS, V147, P915