InfraPhenoGrid: A scientific workflow infrastructure for plant phenomics on the Grid

被引:15
作者
Pradal, Christophe [1 ,2 ]
Artzet, Simon [2 ,3 ]
Chopard, Jerome [2 ,4 ]
Dupuis, Dimitri [5 ]
Fournier, Christian [2 ,3 ]
Mielewczik, Michael [3 ,6 ]
Negre, Vincent [3 ]
Neveu, Pascal [4 ]
Parigot, Didier [5 ]
Valduriez, Patrick [5 ]
Cohen-Boulakia, Sarah [2 ,5 ,7 ]
机构
[1] CIRAD, UMR AGAP, Montpellier, France
[2] Inria, VirtualPlants, Montpellier, France
[3] INRA, UMR459, LEPSE, F-34060 Montpellier, France
[4] INRA, UMR729, MISTEA, F-34060 Montpellier, France
[5] Inria, Zenith, Montpellier, France
[6] Imperial Coll London, NHLI, ICCH, London, England
[7] Univ Paris Saclay, CNRS UMR 8623, Univ Paris Sud, Lab Rech Informat, Orsay, France
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2017年 / 67卷
关键词
Phenomics; Scientific workflows; Provenance; Grid computing; PROVENANCE; COMPONENT; FUTURE; ROBUST;
D O I
10.1016/j.future.2016.06.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Plant phenotyping consists in the observation of physical and biochemical traits of plant genotypes in response to environmental conditions. Challenges, in particular in context of climate change and food security, are numerous. High-throughput platforms have been introduced to observe the dynamic growth of a large number of plants in different environmental conditions. Instead of considering a few genotypes at a time (as it is the case when phenomic traits are measured manually), such platforms make it possible to use completely new kinds of approaches. However, the datasets produced by such widely instrumented platforms are huge, constantly augmenting and produced by increasingly complex experiments, reaching a point where distributed computation is mandatory to extract knowledge from data. In this paper, we introduce InfraPhenoGrid, the infrastructure we designed and deploy to efficiently manage datasets produced by the PhenoArch plant phenomics platform in the context of the French Phenome Project. Our solution consists in deploying scientific workflows on a Grid using a middleware to pilot workflow executions. Our approach is user-friendly in the sense that despite the intrinsic complexity of the infrastructure, running scientific workflows and understanding results obtained (using provenance information) is kept as simple as possible for end-users. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:341 / 353
页数:13
相关论文
共 45 条
  • [11] Bao ZW, 2009, PROC INT CONF DATA, P808, DOI 10.1109/ICDE.2009.103
  • [12] Querying and managing provenance through user views in scientific workflows
    Biton, Olivier
    Cohen-Boulakia, Sarah
    Davidson, Susan B.
    Hara, Carmern S.
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1072 - +
  • [13] Chapman A., 2007, IEEE DATA ENG B, V30, P38
  • [14] Chirigati FernandoSeabra., 2013, TaPP
  • [15] Distilling structure in Taverna scientific workflows: a refactoring approach
    Cohen-Boulakia, Sarah
    Chen, Jiuqiang
    Missier, Paolo
    Goble, Carole
    Williams, Alan R.
    Froidevaux, Christine
    [J]. BMC BIOINFORMATICS, 2014, 15 : 1 - 14
  • [16] Search, Adapt, and Reuse: The Future of Scientific Workflows
    Cohen-Boulakia, Sarah
    Leser, Ulf
    [J]. SIGMOD RECORD, 2011, 40 (02) : 6 - 16
  • [17] Cohen-Boulakia S, 2011, PROC INT CONF DATA, P1366, DOI 10.1109/ICDE.2011.5767957
  • [18] Mean shift: A robust approach toward feature space analysis
    Comaniciu, D
    Meer, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) : 603 - 619
  • [19] Cell to whole-plant phenotyping: the best is yet to come
    Dhondt, Stijn
    Wuyts, Nathalie
    Inze, Dirk
    [J]. TRENDS IN PLANT SCIENCE, 2013, 18 (08) : 433 - 444
  • [20] Future Scenarios for Plant Phenotyping
    Fiorani, Fabio
    Schurr, Ulrich
    [J]. ANNUAL REVIEW OF PLANT BIOLOGY, VOL 64, 2013, 64 : 267 - 291