Selecting signature oligonucleotides to identify organisms using DNA arrays

被引:74
作者
Kaderali, L
Schliep, A
机构
[1] Univ Cologne, Ctr Appl Comp Sci Cologne, D-50931 Cologne, Germany
[2] Max Planck Inst Mol Genet, Dept Computat Mol Biol, D-14195 Berlin, Germany
关键词
D O I
10.1093/bioinformatics/18.10.1340
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: DNA arrays are a very useful tool to quickly identify biological agents present in some given sample, e.g. to identify viruses causing disease, for quality control in the food industry, or to determine bacteria contaminating drinking water. The selection of specific oligos to attach to the array surface is a relevant problem in the experiment design process. Given a set S of genomic sequences (the target sequences), the task is to find at least one oligonucleotide, called probe, for each sequence in S. This probe will be attached to the array surface, and must be chosen in a way that it will not hybridize to any other sequence but the intended target. Furthermore, all probes on the array must hybridize to their intended targets under the same reaction conditions, most importantly at the temperature T at which the experiment is conducted. Results: We present an efficient algorithm for the probe design problem. Melting temperatures are calculated for all possible probe-target interactions using an extended nearest-neighbor model, allowing for both non-Watson-Crick base-pairing and unpaired bases within a duplex. To compute temperatures efficiently, a combination of suffix trees and dynamic programming based alignment algorithms is introduced. Additional filtering steps during preprocessing increase the speed of the computation. The practicability of the algorithms is demonstrated by two case studies: The identification of HIV-1 subtypes, and of 28S rDNA sequences from greater than or equal to400 organisms.
引用
收藏
页码:1340 / 1349
页数:10
相关论文
共 39 条
[1]   Nearest-neighbor thermodynamics of internal A•C mismatches in DNA:: Sequence dependence and pH effects [J].
Allawi, HT ;
SantaLucia, J .
BIOCHEMISTRY, 1998, 37 (26) :9435-9444
[2]   Nearest neighbor thermodynamic parameters for internal G•A mismatches in DNA [J].
Allawi, HT ;
SantaLucia, J .
BIOCHEMISTRY, 1998, 37 (08) :2170-2179
[3]   Thermodynamics and NMR of internal GT mismatches in DNA [J].
Allawi, HT ;
SantaLucia, J .
BIOCHEMISTRY, 1997, 36 (34) :10581-10594
[4]   Thermodynamics of internal C•T mismatches in DNA [J].
Allawi, HT ;
Santalucia, J .
NUCLEIC ACIDS RESEARCH, 1998, 26 (11) :2694-2701
[5]   Thermodynamic parameters for DNA sequences with dangling ends [J].
Bommarito, S ;
Peyret, N ;
SantaLucia, J .
NUCLEIC ACIDS RESEARCH, 2000, 28 (09) :1929-1934
[6]   PREDICTING DNA DUPLEX STABILITY FROM THE BASE SEQUENCE [J].
BRESLAUER, KJ ;
FRANK, R ;
BLOCKER, H ;
MARKY, LA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (11) :3746-3750
[7]  
CUPAL J, 1997, THESIS U VIENNA
[8]  
Delpech M, 2000, ANN BIOL CLIN-PARIS, V58, P29
[9]  
Durbin R., 1998, BIOL SEQUENCE ANAL
[10]  
FOX GE, 2000, RAPID IDENTIFICATION