FEATURE EXPRESSIONS - CREATING AND MANIPULATING SEQUENCE DATASETS

被引:7
作者
FRISTENSKY, B
机构
[1] Department of Plant Science, University of Manitoba, Winnipeg
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1093/nar/21.25.5997
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Annotation of features, such as introns, exons and protein coding regions in GenBank/EMBL/DDBJ entries is now standardized through use of the Features Table (FT) language. The essence of the FT language is described by the relation 'expression --> sequence', meaning that each FT expression evaluates to a sequence. For example, the expression M74750:1..50 evaluates to the first 50 bases of the sequence with accession number M74750. Because FT is intrinsic to the database definition, it can serve as a software- and platform-independent lingua franca for sequence manipulation. The XYLEM package makes it possible to create and manipulate sequence datasets using FT expressions. FEATURES is a program that resolves FT expressions into their corresponding sequences. Annotated features can be retrieved either by feature key or by expression. Even unannotated portions of a sequence can be retrieved by user-generated FT expressions. Applications of the FT language include retrieval of subsequences from large sequence entries, generation of chromosome models or artificial DNA constructs, and representation of restriction maps or mutants.
引用
收藏
页码:5997 / 6003
页数:7
相关论文
共 16 条
[1]   GENBANK [J].
BURKS, C ;
CASSIDY, M ;
CINKOSKY, MJ ;
CUMELLA, KE ;
GILNA, P ;
HAYDEN, JED ;
KEEN, GM ;
KELLEY, TA ;
KELLY, M ;
KRISTOFFERSON, D ;
RYALS, J .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2221-2225
[2]   THE EMBL DATA LIBRARY [J].
CAMERON, GN .
NUCLEIC ACIDS RESEARCH, 1988, 16 (05) :1865-1867
[3]  
Jensen K., 1974, PASCAL USER MANUAL R
[4]   RECONSTRUCTION AND ANALYSIS OF HUMAN ALU GENES [J].
JURKA, J ;
MILOSAVLJEVIC, A .
JOURNAL OF MOLECULAR EVOLUTION, 1991, 32 (02) :105-121
[5]   THE PHYSICAL MAP OF THE WHOLE ESCHERICHIA-COLI CHROMOSOME - APPLICATION OF A NEW STRATEGY FOR RAPID ANALYSIS AND SORTING OF A LARGE GENOMIC LIBRARY [J].
KOHARA, Y ;
AKIYAMA, K ;
ISONO, K .
CELL, 1987, 50 (03) :495-508
[6]  
OSTELL J, 1990, TECH REP, V1
[7]  
PEARSON WR, 1990, METHOD ENZYMOL, V183, P63
[8]  
Pfeiffer F, 1988, Protein Seq Data Anal, V1, P269
[9]  
READ RL, 1992, COMPUT APPL BIOSCI, V8, P407
[10]   ALIGNMENT OF ESCHERICHIA-COLI K12 DNA-SEQUENCES TO A GENOMIC RESTRICTION MAP [J].
RUDD, KE ;
MILLER, W ;
OSTELL, J ;
BENSON, DA .
NUCLEIC ACIDS RESEARCH, 1990, 18 (02) :313-321