Algorithms and software for support of gene identification experiments

被引:11
作者
Sze, SH [1 ]
Roytberg, MA
Gelfand, MS
Mironov, AA
Astakhova, TV
Pevzner, PA
机构
[1] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
[2] Univ So Calif, Dept Math, Los Angeles, CA 90089 USA
[3] Russian Acad Sci, Inst Math Problems Biol, Moscow, Russia
[4] Russian Acad Sci, Inst Prot Res, Moscow, Russia
[5] NIIGENETIKA, Natl Biotechnol Ctr, Lab Math Methods, Moscow, Russia
[6] ArkaGene Inc, San Mateo, CA 94403 USA
关键词
D O I
10.1093/bioinformatics/14.1.14
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene annotation is the final goal of gene prediction algorithms?. However these algorithms frequently make mistakes and therefore the use of gene predictions for sequence annotation is hardly possible. As a result, biologists ma forced to conduct time-consuming gene identification experiments by designing appropriate PCR primers to test cDNA libraries or applying RT-PCR, exon trapping/amplification, or other techniques. This process frequently amounts to 'guessing' PCR primers on top of unreliable gene predictions and frequently leads to wasting of experimental efforts. Results: The present paper proposes a simple and reliable algorithm for experimental gene identification which by passes the unreliable gene prediction step. Studies of the performance of the algorithm on a sample of human genes indicate that an experimental protocol based on the algorithms predictions achieves at? accurate gene identification with relatively few PCR primers. Predictions of PCR primers may be used for exon amplification in preliminary! mutation analysis during ai? attempt to identify a gene responsible for a disease. We propose a simple approach to find a short region from a genomic sequence that with high probability overlaps with some exon of the gene. The algorithm is enhanced to find one or more segments that al-e probably contained hilt the translated region of the gene and can be used as PCR primers to select appropriate clones in cDNA libraries by selective amplification. The algorithm is further extended to locate a set of PCR primers that uniformly: cover ail translated regions anti can be used Soi RT-PCR and further sequencing of (unknown) mRNA. Availability: The programs are implemented as Web servers (GenePrimer and CASSANDRA) and can be reached at http://www-hto.usc.edu/software/procrustes/ Contact: ssze@hto.usc.edu.
引用
收藏
页码:14 / 19
页数:6
相关论文
共 24 条
[1]   The Caenorhabditis elegans gene unc-89, required for muscle M-line assembly, encodes a giant modular protein composed of Ig and signal transduction domains [J].
Benian, GM ;
Tinley, TL ;
Tang, XX ;
Borodovsky, M .
JOURNAL OF CELL BIOLOGY, 1996, 132 (05) :835-848
[2]   CONSTRUCTION OF A TRANSCRIPTION MAP SURROUNDING THE BRCA1 LOCUS OF HUMAN-CHROMOSOME-17 [J].
BRODY, LC ;
ABEL, KJ ;
CASTILLA, LH ;
COUCH, FJ ;
MCKINLEY, DR ;
YIN, GY ;
HO, PP ;
MERAJVER, S ;
CHANDRASEKHARAPPA, SC ;
XU, JZ ;
COLE, JL ;
STRUEWING, JP ;
VALDES, JM ;
COLLINS, FS ;
WEBER, BL .
GENOMICS, 1995, 25 (01) :238-247
[3]   CLONING A BALANCED TRANSLOCATION ASSOCIATED WITH DIGEORGE-SYNDROME AND IDENTIFICATION OF A DISRUPTED CANDIDATE GENE [J].
BUDARF, ML ;
COLLINS, J ;
GONG, WL ;
ROE, B ;
WANG, ZL ;
BAILEY, LC ;
SELLINGER, B ;
MICHAUD, D ;
DRISCOLL, DA ;
EMANUEL, BS .
NATURE GENETICS, 1995, 10 (03) :269-278
[4]   Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[5]   IDENTIFICATION OF HUMAN CHROMOSOME-9 SPECIFIC GENES USING EXON AMPLIFICATION [J].
CHURCH, DM ;
BANKS, LT ;
ROGERS, AC ;
GRAW, SL ;
HOUSMAN, DE ;
GUSELLA, JF ;
BUCKLER, AJ .
HUMAN MOLECULAR GENETICS, 1993, 2 (11) :1915-1920
[6]  
FICKETT JW, 1992, NUCLEIC ACIDS RES, V21, P2837
[7]  
Gelfand M S, 1995, J Comput Biol, V2, P87, DOI 10.1089/cmb.1995.2.87
[8]   Gene recognition via spliced sequence alignment [J].
Gelfand, MS ;
Mironov, AA ;
Pevzner, PA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (17) :9061-9066
[9]  
GELFAND MS, 1996, GENOME INFORMATICS 1, P82
[10]   Cloning, genomic organization, and chromosomal localization of human citrate transport protein to the DiGeorge velocardiofacial syndrome minimal critical region [J].
Goldmuntz, E ;
Wang, ZL ;
Roe, BA ;
Budarf, ML .
GENOMICS, 1996, 33 (02) :271-276