Sequential patterns mining and gene sequence visualization to discover novelty from microarray data

被引:23
作者
Sallaberry, A. [4 ]
Pecheur, N. [3 ]
Bringay, S. [2 ,3 ]
Roche, M. [3 ]
Teisseire, M. [1 ]
机构
[1] Irstea, UMR TETIS, Maison Teledetect, F-34093 Montpellier, France
[2] Univ Montpellier 3, MIAp Dept, F-34199 Montpellier 5, France
[3] Univ Montpellier 2, CNRS, LIRMM, F-34095 Montpellier 5, France
[4] INRIA Bordeaux Sud Ouest, LaBRI, F-33405 Talence, France
关键词
Visualization; Data mining; Bioinformatics; Sequential patterns; Microarray data; Gene data; METHODOLOGY; SEARCH; MODEL;
D O I
10.1016/j.jbi.2011.04.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data mining allow users to discover novelty in huge amounts of data. Frequent pattern methods have proved to be efficient, but the extracted patterns are often too numerous and thus difficult to analyze by end users. In this paper, we focus on sequential pattern mining and propose a new visualization system to help end users analyze the extracted knowledge and to highlight novelty according to databases of referenced biological documents. Our system is based on three visualization techniques: clouds, solar systems, and treemaps. We show that these techniques are very helpful for identifying associations and hierarchical relationships between patterns among related documents. Sequential patterns extracted from gene data using our system were successfully evaluated by two biology laboratories working on Alzheimer's disease and cancer. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:760 / 774
页数:15
相关论文
共 40 条
  • [21] A novel visualization model for web search results
    Nguyen, Tien N.
    Zhang, Jin
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (05) : 981 - 988
  • [22] NIN J, 2009, 22 IEEE INT S COMP B
  • [23] Pensa RG, 2004, LECT NOTES COMPUT SC, V3245, P230
  • [24] PERLIN K, 1993, SIGGRAPH 93, P57
  • [25] iMotifs: an integrated sequence motif visualization and analysis environment
    Piipari, Matias
    Down, Thomas A.
    Saini, Harpreet
    Enright, Anton
    Hubbard, Tim J. P.
    [J]. BIOINFORMATICS, 2010, 26 (06) : 843 - 844
  • [26] Priyantha N.B., 2003, P 1 INT C EMBEDDED N, P340, DOI DOI 10.1145/958491.958550
  • [27] TIDIER DRAWINGS OF TREES
    REINGOLD, EM
    TILFORD, JS
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1981, 7 (02) : 223 - 228
  • [28] State of the art: Coordinated & multiple views in exploratory visualization
    Roberts, Jonathan C.
    [J]. CMV 2007: FIFTH INTERNATIONAL CONFERENCE ON COORDINATED & MULTIPLE VIEWS IN EXPLORATORY VISUALIZATION, PROCEEDINGS, 2007, : 61 - 71
  • [29] SALLABERRY A, 2010, ISVC, V3, P534
  • [30] GeneMining: Identification, Visualization, and Interpretation of Brain Ageing Signatures
    Salle, Paola
    Bringay, Sandra
    Teisseire, Maguelonne
    Chakkour, Feirouz
    Roche, Mathieu
    Rassoul, Ronza Abdel
    Verdier, Jean-Michel
    Devau, Gina
    [J]. MEDICAL INFORMATICS IN A UNITED AND HEALTHY EUROPE, 2009, 150 : 767 - 771