Information theoretical probe selection for hybridisation experiments

被引:33
作者
Herwig, R
Schmitt, AO
Steinfath, M
O'Brien, J
Seidel, H
Meier-Ewert, S
Lehrach, H
Radelof, U
机构
[1] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
[2] metaGen Gesell Genomforsch, D-14195 Berlin, Germany
[3] Schering AG, D-13342 Berlin, Germany
[4] GPC AG, D-82152 Martinsried, Germany
关键词
D O I
10.1093/bioinformatics/16.10.890
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The choice of probes is an important feature of hybridisation experiments. In this paper we present an algorithm that optimises probes with respect to a training set of sequences based on Shannon entropy as a quality criterion. The practical motivation for our algorithm is oligonucleotide fingerprinting, a method for the simultaneous identification of sequences (cDNA or genomic DNA) by their hybridisation tags according to a set of shot? probes such as octamers, although the algorithm is of course not restricted to that application. Results: We can show that our method is superior to the selection of probes according to their frequencies, which is a widely, used strategy, and to randomly chosen probe sets. The quality of probe sets is assessed by a simulation pipeline that entails the set of probes as a simulation parameter: The performance of probe sets trained on sequences from different organisms shows additionally that probes should be chosen with regard to the organism under analysis. Case studies are presented on how constraints (G + C-content, complexity of the individual probes) influence the selection process.
引用
收藏
页码:890 / 898
页数:9
相关论文
共 23 条
[1]  
[Anonymous], [No title captured]
[2]  
Clark MD, 1999, METHOD ENZYMOL, V303, P205
[3]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[4]  
CUTICCHIA AJ, 1993, COMPUT APPL BIOSCI, V9, P201
[5]   Gene-representing cDNA clusters defined by hybridization of 57,419 clones from infant brain libraries with short oligonucleotide probes [J].
Drmanac, S ;
Stavropoulos, NA ;
Labat, I ;
Vonau, J ;
Hauser, B ;
Soares, MB ;
Drmanac, R .
GENOMICS, 1996, 37 (01) :29-40
[6]   PNA HYBRIDIZES TO COMPLEMENTARY OLIGONUCLEOTIDES OBEYING THE WATSON-CRICK HYDROGEN-BONDING RULES [J].
EGHOLM, M ;
BUCHARDT, O ;
CHRISTENSEN, L ;
BEHRENS, C ;
FREIER, SM ;
DRIVER, DA ;
BERG, RH ;
KIM, SK ;
NORDEN, B ;
NIELSEN, PE .
NATURE, 1993, 365 (6446) :566-568
[7]   ON THE DESIGN OF GENOME MAPPING EXPERIMENTS USING SHORT SYNTHETIC OLIGONUCLEOTIDES [J].
FU, YX ;
TIMBERLAKE, WE ;
ARNOLD, J .
BIOMETRICS, 1992, 48 (02) :337-359
[8]  
Grillo G, 1996, COMPUT APPL BIOSCI, V12, P1
[9]   Large-scale clustering of cDNA-fingerprinting data [J].
Herwig, R ;
Poustka, AJ ;
Müller, C ;
Bull, C ;
Lehrach, H ;
O'Brien, J .
GENOME RESEARCH, 1999, 9 (11) :1093-1105
[10]   HYBRIDIZATION ANALYSES OF ARRAYED CDNA LIBRARIES [J].
LENNON, GG ;
LEHRACH, H .
TRENDS IN GENETICS, 1991, 7 (10) :314-317