Xlandscape: the graphical display of word frequencies in sequences

被引:11
作者
Levy, S
Compagnoni, L
Myers, EW [1 ]
Stormo, GD
机构
[1] Univ Colorado, Boulder, CO 80309 USA
[2] Univ Arizona, Dept Comp Sci, Tucson, AZ 85721 USA
关键词
D O I
10.1093/bioinformatics/14.1.74
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: To provide a graphical interface for two generation display and manipulation of a sequence landscape that will run on all X-windows-based Unix workstations. Results: The sequence landscape approach enables the representation of the frequency of occurrence of all query sequence sub-words within a database. The landscape approach can detect tandem and other repeating word motifs, specific sub-words that are over-represented words in a particular database using Markov probability and the preference for sub-words belonging to either one of two databases. All these features aid in the classification of a query sequence. Given the open-text format for sequences and databases, the Xlandscape tool can be applied to a wide range of problems. Availability: Xlandscape is freely available by anonymous ftp to beagle.colorado.edu, directory, pub/Landscape/xland.vl. Contact: Samuel.Levy@colorado.edu or Gary.Stormo@colorado.edu
引用
收藏
页码:74 / 80
页数:7
相关论文
共 27 条
[1]  
BUCHER P, 1996, EUKARYOTIC PROMOTER
[2]  
Chen QK, 1997, COMPUT APPL BIOSCI, V13, P29
[3]   HEURISTIC INFORMATIONAL ANALYSIS OF SEQUENCES [J].
CLAVERIE, JM ;
BOUGUELERET, L .
NUCLEIC ACIDS RESEARCH, 1986, 14 (01) :179-196
[4]   SEQUENCE LANDSCAPES [J].
CLIFT, B ;
HAUSSLER, D ;
MCCONNELL, R ;
SCHNEIDER, TD ;
STORMO, GD .
NUCLEIC ACIDS RESEARCH, 1986, 14 (01) :141-158
[5]   Characteristic enrichment of DNA repeats in different genomes [J].
Cox, R ;
Mirkin, SM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (10) :5237-5242
[6]  
Hertz GZ, 1996, METHOD ENZYMOL, V273, P30
[7]  
Hutchinson GB, 1996, COMPUT APPL BIOSCI, V12, P391
[8]   HETEROGENEITY OF GENOMES - MEASURES AND VALUES [J].
KARLIN, S ;
LADUNGA, I ;
BLAISDELL, BE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (26) :12837-12841
[9]   Trinucleotide repeats and long homopeptides in genes and proteins associated with nervous system disease and development [J].
Karlin, S ;
Burge, C .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (04) :1560-1565
[10]  
KARLIN S, 1995, TRENDS GENET, V11, P283