Association of nucleotide patterns with gene function classes:: application to human 3′ untranslated sequences

被引:20
作者
Conklin, D
Jonassen, I
Aasland, R
Taylor, WR
机构
[1] Zymogenet Inc, Seattle, WA 98102 USA
[2] Univ Bergen, Dept Informat, N-5020 Bergen, Norway
[3] Univ Bergen, Dept Mol Biol, N-5020 Bergen, Norway
[4] Natl Inst Med Res, Div Math Biol, London NW7 1AA, England
关键词
D O I
10.1093/bioinformatics/18.1.182
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene expression is dependent on two main types of signals; one involving transcription factors which initiates gene transcription, and another which regulates the translation of a nascent mRNA. These posttranscriptional events play an important yet incompletely understood role in regulating gene expression and cellular behavior. Many of the identified cis acting elements for translational regulation occur within the 3' untranslated region (3' UTR), and some have been observed to occur with surprising regularity within certain protein function classes. Results: In this study, we present a new association rule mining method for discovering nucleotide sequence patterns that appear in more sequences than expected within protein function classes. The method is applied to a database of human 3' UTR sequences, and some significant associations between nucleotide patterns and protein function classes are discovered. Among previously identified patterns, the AU-Rich Element (ARE) is found here to occur within the 3' UTR of cytokines, providing statistical validation of an association often reported in the literature. The method has also identified some GC-rich patterns, found to occur within the 3' UTR of homeodomain transcription factors and nuclear proteins. The method should be applicable to many types of regulatory element discovery. Contact: conklin@zgi.com.
引用
收藏
页码:182 / 189
页数:8
相关论文
共 37 条
[11]   TRANSLATIONAL REGULATION IN DEVELOPMENT [J].
CURTIS, D ;
LEHMANN, R ;
ZAMORE, PD .
CELL, 1995, 81 (02) :171-178
[12]   AU-rich elements target small nuclear RNAs as well as mRNAs for rapid degradation [J].
Fan, XHC ;
Myer, VE ;
Steitz, JA .
GENES & DEVELOPMENT, 1997, 11 (19) :2557-2568
[13]   HNS, a nuclear-cytoplasmic shuttling sequence in HuR [J].
Fan, XHC ;
Steitz, JA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (26) :15293-15298
[14]   RNA-RNA interaction is required for the formation of specific bicoid mRNA 3' UTR-STAUFEN ribonucleoprotein particles [J].
Ferrandon, D ;
Koch, I ;
Westhof, E ;
NussleinVolhard, C .
EMBO JOURNAL, 1997, 16 (07) :1751-1758
[15]   Alternate polyadenylation in human mRNAs: A large-scale analysis by EST clustering [J].
Gautheret, D ;
Poirot, O ;
Lopez, F ;
Audic, S ;
Claverie, JM .
GENOME RESEARCH, 1998, 8 (05) :524-530
[16]   Discovering common stem-loop motifs in unaligned RNA sequences [J].
Gorodkin, J ;
Stricklin, SL ;
Stormo, GD .
NUCLEIC ACIDS RESEARCH, 2001, 29 (10) :2135-2144
[17]   POSITION-BASED SEQUENCE WEIGHTS [J].
HENIKOFF, S ;
HENIKOFF, JG .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (04) :574-578
[18]   The PROSITE database, its status in 1999 [J].
Hofmann, K ;
Bucher, P ;
Falquet, L ;
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :215-219
[19]   TRANSLATIONAL CONTROL OF CYTOKINE EXPRESSION BY 3' UA-RICH SEQUENCES [J].
KRUYS, V ;
HUEZ, G .
BIOCHIMIE, 1994, 76 (09) :862-866
[20]   Use of keyword hierarchies to interpret gene expression patterns [J].
Masys, DR ;
Welsh, JB ;
Fink, JL ;
Gribskov, M ;
Klacansky, I ;
Corbeil, J .
BIOINFORMATICS, 2001, 17 (04) :319-326