PIQA: pipeline for Illumina G1 genome analyzer data quality assessment

被引:27
作者
Martinez-Alcantara, A. [1 ]
Ballesteros, E. [1 ,2 ]
Feng, C. [1 ]
Rojas, M. [1 ]
Koshinsky, H. [3 ]
Fofanov, V. Y. [3 ]
Havlak, P. [1 ]
Fofanov, Y. [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
[2] Univ Guadalajara, Dept Phys, Guadalajara, Jalisco, Mexico
[3] Eureka Genom Corp, Houston, TX USA
关键词
D O I
10.1093/bioinformatics/btp429
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
PIQA is a quality analysis pipeline designed to examine genomic reads produced by Next Generation Sequencing technology (Illumina G1 Genome Analyzer). A short statistical summary, as well as tile-by-tile and cycle-by-cycle graphical representation of clusters density, quality scores and nucleotide frequencies allow easy identification of various technical problems including defective tiles, mistakes in sample/library preparations and abnormalities in the frequencies of appearance of sequenced genomic reads. PIQA is written in the R statistical programming language and is compatible with bustard, fastq and scarf Illumina G1 Genome Analyzer data formats.
引用
收藏
页码:2438 / 2439
页数:2
相关论文
共 10 条
[1]  
[Anonymous], 2007, R LANG ENV STAT COMP
[2]   Whole-genome re-sequencing [J].
Bentley, David R. .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2006, 16 (06) :545-552
[3]   Genomes for all [J].
Church, GM .
SCIENTIFIC AMERICAN, 2006, 294 (01) :46-54
[4]   TileQC: A system for tile-based quality control of Solexa data [J].
Dolan, Peter C. ;
Denver, Dee R. .
BMC BIOINFORMATICS, 2008, 9 (1)
[5]  
*ILL INC, 2008, SPEC SHEET ILL SEQ
[6]  
*ILL INC, 2009, ILL SEQ TECHN
[7]  
*ILL INC, 2008, ILL GEN AN BROCH
[8]  
KATHRYN EH, 2008, NAT GENET, V40, P987
[9]  
LECOUTRE E, 2008, R2HTML HTML EXPORATI
[10]   High-Precision, Whole-Genome Sequencing of Laboratory Strains Facilitates Genetic Studies [J].
Srivatsan, Anjana ;
Han, Yi ;
Peng, Jianlan ;
Tehranchi, Ashley K. ;
Gibbs, Richard ;
Wang, Jue D. ;
Chen, Rui .
PLOS GENETICS, 2008, 4 (08)