Opening practice: supporting reproducibility and critical spatial data science

被引:63
作者
Brunsdon, Chris [1 ]
Comber, Alexis [2 ]
机构
[1] Maynooth Univ, Natl Ctr Geocomputat, Maynooth, Kildare, Ireland
[2] Univ Leeds, Sch Geog, Leeds, W Yorkshire, England
基金
英国自然环境研究理事会;
关键词
Critical data science; Open source; GIScience; Geocomputation; LAND-COVER; SOFTWARE;
D O I
10.1007/s10109-020-00334-2
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
This paper reflects on a number of trends towards a more open and reproducible approach to geographic and spatial data science over recent years. In particular, it considers trends towards Big Data, and the impacts this is having onspatialdata analysis and modelling. It identifies a turn in academia towards coding as a core analytic tool, and away from proprietary software tools offering 'black boxes' where the internal workings of the analysis are not revealed. It is argued that this closed form software is problematic and considers a number of ways in which issues identified in spatial data analysis (such as the MAUP) could be overlooked when working with closed tools, leading to problems of interpretation and possibly inappropriate actions and policies based on these. In addition, this paper considers the role that reproducible and open spatial science may play in such an approach, taking into account the issues raised. It highlights the dangers of failing to account for the geographical properties of data, now that all data are spatial (they are collected somewhere), the problems of a desire for n = all observations in data science and it identifies the need for a critical approach. This is one in which openness, transparency, sharing and reproducibility provide a mantra for defensible and robust spatial data science.
引用
收藏
页码:477 / 496
页数:20
相关论文
共 71 条
  • [31] Big Data, new epistemologies and paradigm shifts
    Kitchin, Rob
    [J]. BIG DATA & SOCIETY, 2014, 1 (01):
  • [32] Kitchin R, 2018, THINKING BIG DATA IN GEOGRAPHY: NEW REGIMES, NEW RESEARCH, P3
  • [33] Kuhn M., 2021, caret: Classification and Regression Training
  • [34] Laney D, 2001, META GROUP RES NOTE, V6, P1
  • [35] Leisch Friedrich., 2002, Sweave. Dynamic Generation of Statistical Reports Using Literate Data Analysis
  • [36] Crime mapping and the CrimeStat program
    Levine, N
    [J]. GEOGRAPHICAL ANALYSIS, 2006, 38 (01) : 41 - 56
  • [37] When open data is a Trojan Horse: The weaponization of transparency in science and governance
    Levy, Karen E. C.
    Johns, David Merritt
    [J]. BIG DATA & SOCIETY, 2016, 3 (01): : 1 - 6
  • [38] Li Z, 2018, 13 INT S SPAT ACC
  • [39] Lovelace R, 2020, Geocomputation with R, VFirst, DOI [10.1201/9780203730058, DOI 10.1201/9780203730058]
  • [40] Marr B., 2014, Big Data: The 5 Vs Everyone Must Know