IMEx: Imperfect microsatellite extractor

被引:161
作者
Mudunuri, Suresh B. [1 ]
Nagarajaram, Hampapathalu A. [1 ]
机构
[1] CDFD, Lab Computat Biol, Hyderabad 500076, Andhra Pradesh, India
关键词
D O I
10.1093/bioinformatics/btm097
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Microsatellites, also known as simple sequence repeats, are the tandem repeats of nucleoti de motifs of the size 1-6bp found in every genome known so far. Their importance in genomes is well known. Microsatellites are associated with various disease genes, have been used as molecular markers in linkage analysis and DNA fingerprinting studies, and also seem to play an important role in the genome evolution. Therefore, it is of importance to study distribution, enrichment and polymorphism of microsatellites in the genomes of interest. For this, the prerequisite is the availability of a computational tool for extraction of microsatellites (perfect as well as imperfect) and their related information from whole genome sequences. Examination of available tools revealed certain lacunae in them and prompted us to develop a new tool. Results: In order to efficiently screen genome sequences for microsatellites (perfect as well as imperfect), we developed a new tool called IMEx (Imperfect Microsatellite Extractor). IMEx uses simple string-matching algorithm with sliding window approach to screen DNA sequences for microsatellites and reports the motif, copy number, genomic location, nearby genes, mutational events and many other features useful for in-depth studies. IMEx is more sensitive, efficient and useful than the available widely used tools. IMEx is available in the form of a stand-alone program as well as in the form of a web-server.
引用
收藏
页码:1181 / 1187
页数:7
相关论文
共 32 条
[1]  
ABAJIAN C, SPUTNIK
[2]   SSRscanner: a program for reporting distribution and exact location of simple sequence repeats [J].
Anwar, Tamanna ;
Khan, Asad U. .
BIOINFORMATION, 2006, 1 (03) :89-91
[3]   InSatDb: a microsatellite database of fully sequenced insect genomes [J].
Archak, Sunil ;
Meduri, Eshwar ;
Kumar, P. Sravana ;
Nagaraju, J. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D36-D39
[4]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[5]   Poly: a quantitative analysis tool for simple sequence repeat (SSR) tracts in DNA [J].
Bizzaro, JW ;
Marx, KA .
BMC BIOINFORMATICS, 2003, 4 (1)
[6]   TRbase: a database relating tandem repeats to disease genes for the human genome [J].
Boby, T ;
Patch, AM ;
Aves, SJ .
BIOINFORMATICS, 2005, 21 (06) :811-816
[7]   Short fuzzy tandem repeats in genomic sequences, identification, and possible role in regulation of gene expression [J].
Boeva, V ;
Regnier, M ;
Papatsenko, D ;
Makeev, V .
BIOINFORMATICS, 2006, 22 (06) :676-684
[8]   Mutation rate in human microsatellites:: Influence of the structure and length of the tandem repeat [J].
Brinkmann, B ;
Klintschar, M ;
Neuhuber, F ;
Hühne, J ;
Rolf, B .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 62 (06) :1408-1415
[9]   The rise, fall and renaissance of microsatellites in eukaryotic genomes [J].
Buschiazzo, Emmanuel ;
Gemmell, Neil J. .
BIOESSAYS, 2006, 28 (10) :1040-1050
[10]   TROLL-Tandem Repeat Occurrence Locator [J].
Castelo, AT ;
Martins, W ;
Gao, GR .
BIOINFORMATICS, 2002, 18 (04) :634-636