INFERRING GENES FROM OPEN READING FRAMES

被引:6
作者
FICKETT, JW
机构
[1] Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos
来源
COMPUTERS & CHEMISTRY | 1994年 / 18卷 / 03期
关键词
D O I
10.1016/0097-8485(94)85014-3
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
One expects that in DNA without protein coding function, stop codons (which constitute three of the 64 possible codons) should occur frequently in all reading frames, and that a long open reading frame (ORF) can be interpreted as a sign for the existence of a gene. We make a beginning on introducing quantitative measures of confidence into this inference-taking Saccharomyces cerevisiae as a sample case-and show that some common assumptions can reasonably be questioned. In particular we show that statistical support for the biological function of shorter ORFs listed as putative genes in recent papers is in fact very weak. This is an issue of practical as well as theoretical interest, since researching the function of a putative gene is difficult and expensive.
引用
收藏
页码:203 / 205
页数:3
相关论文
共 24 条
[21]   REGIONAL BASE COMPOSITION VARIATION ALONG YEAST CHROMOSOME-III - EVOLUTION OF CHROMOSOME PRIMARY STRUCTURE [J].
SHARP, PM ;
LLOYD, AT .
NUCLEIC ACIDS RESEARCH, 1993, 21 (02) :179-183
[22]   IDENTIFICATION OF CODING REGIONS IN GENOMIC DNA-SEQUENCES - AN APPLICATION OF DYNAMIC-PROGRAMMING AND NEURAL NETWORKS [J].
SNYDER, EE ;
STORMO, GD .
NUCLEIC ACIDS RESEARCH, 1993, 21 (03) :607-613
[23]   CORRELATION BETWEEN OBSERVED TRANSCRIPTS AND SEQUENCED ORFS OF CHROMOSOME-III OF SACCHAROMYCES-CEREVISIAE [J].
TANAKA, S ;
ISONO, K .
NUCLEIC ACIDS RESEARCH, 1993, 21 (05) :1149-1153
[24]   LOCATING PROTEIN-CODING REGIONS IN HUMAN DNA-SEQUENCES BY A MULTIPLE SENSOR NEURAL NETWORK APPROACH [J].
UBERBACHER, EC ;
MURAL, RJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (24) :11261-11265