Fuzzy association rules for biological data analysis: A case study on yeast

被引:29
作者
Lopez, Francisco J. [1 ]
Blanco, Armando [1 ]
Garcia, Fernando [1 ]
Cano, Carlos [1 ]
Marin, Antonio [2 ]
机构
[1] Univ Granada, Dept Comp Sci & AI, E-18071 Granada, Spain
[2] Univ Seville, Dept Genet, E-41012 Seville, Spain
关键词
D O I
10.1186/1471-2105-9-107
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Last years' mapping of diverse genomes has generated huge amounts of biological data which are currently dispersed through many databases. Integration of the information available in the various databases is required to unveil possible associations relating already known data. Biological data are often imprecise and noisy. Fuzzy set theory is specially suitable to model imprecise data while association rules are very appropriate to integrate heterogeneous data. Results: In this work we propose a novel fuzzy methodology based on a fuzzy association rule mining method for biological knowledge extraction. We apply this methodology over a yeast genome dataset containing heterogeneous information regarding structural and functional genome features. A number of association rules have been found, many of them agreeing with previous research in the area. In addition, a comparison between crisp and fuzzy results proves the fuzzy associations to be more reliable than crisp ones. Conclusion: An integrative approach as the one carried out in this work can unveil significant knowledge which is currently hidden and dispersed through the existing biological databases. It is shown that fuzzy association rules can model this knowledge in an intuitive way by using linguistic labels and few easy-understandable parameters.
引用
收藏
页数:18
相关论文
共 59 条
  • [1] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [2] FatiGO+:: a functional profiling tool for genomic data.: Integration of functional annotation, regulatory motifs and interaction data with microarray experiments
    Al-Shahrour, Fatima
    Minguez, Pablo
    Tarraga, Joaquin
    Medina, Ignacio
    Alloza, Eva
    Montaner, David
    Dopazo, Joaquin
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : W91 - W96
  • [3] Machine learning in bioinformatics: A brief survey and recommendations for practitioners
    Bhaskar, Harish
    Hoyle, David C.
    Singh, Sameer
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (10) : 1104 - 1125
  • [4] Integrating genomics, bioinformatics, and classical genetics to study the effects of recombination on genome evolution
    Birdsell, JA
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (07) : 1181 - 1197
  • [5] SMALL-SIZE MESSENGER-RNAS CODE FOR RIBOSOMAL-PROTEINS IN YEAST
    BOLLEN, GHPM
    MAGER, WH
    JENNESKENS, LW
    PLANTA, RJ
    [J]. EUROPEAN JOURNAL OF BIOCHEMISTRY, 1980, 105 (01): : 75 - 80
  • [6] TRANSCRIPTION-DEPENDENT DNA SUPERCOILING IN YEAST DNA TOPOISOMERASE MUTANTS
    BRILL, SJ
    STERNGLANZ, R
    [J]. CELL, 1988, 54 (03) : 403 - 411
  • [7] Integrated analysis of gene expression by association rules discovery
    Carmona-Saez, P
    Chagoyen, M
    Rodriguez, A
    Trelles, O
    Carazo, JM
    Pascual-Montano, A
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [8] CONFORMATIONAL INFORMATION IN DNA - ITS ROLE IN THE INTERACTION WITH DNA TOPOISOMERASE-I AND NUCLEOSOMES
    CASERTA, M
    CAMILLONI, G
    VENDITTI, S
    VENDITTI, P
    DIMAURO, E
    [J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 1994, 55 (01) : 93 - 97
  • [9] CASTRILLO JI, 1996, J BIOCHEM MOL BIOL, V37, P93
  • [10] A genome-wide transcriptional analysis of the mitotic cell cycle
    Cho, RJ
    Campbell, MJ
    Winzeler, EA
    Steinmetz, L
    Conway, A
    Wodicka, L
    Wolfsberg, TG
    Gabrielian, AE
    Landsman, D
    Lockhart, DJ
    Davis, RW
    [J]. MOLECULAR CELL, 1998, 2 (01) : 65 - 73