Exploration and Explanation in Computational Notebooks

被引:188
作者
Rule, Adam [1 ]
Tabard, Aurelien [2 ]
Hollan, James D. [1 ]
机构
[1] Univ Calif San Diego, Design Lab, La Jolla, CA 92093 USA
[2] Univ Lyon, CNRS, LIRIS UMR5205, Lyon, France
来源
PROCEEDINGS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2018) | 2018年
基金
美国国家科学基金会;
关键词
Computational notebook; Jupyter Notebook; data science; data analysis; narrative; VISUALIZATION;
D O I
10.1145/3173574.3173606
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Computational notebooks combine code, visualizations, and text in a single document. Researchers, data analysts, and even journalists are rapidly adopting this new medium. We present three studies of how they are using notebooks to document and share exploratory data analyses. In the first, we analyzed over 1 million computational notebooks on GitHub, finding that one in four had no explanatoiy text but consisted entirely of visualizations or code. In a second study, we examined over 200 academic computational notebooks, finding that although the vast majority described methods, only a minority discussed reasoning or results. In a third study, we interviewed 15 academic data analysts, finding that most considered computational notebooks personal, exploratory, and messy. Importantly, they typically used other media to share analyses. These studies demonstrate a tension between exploration and explanation in constructing and sharing computational notebooks. We conclude with opportunities to encourage explanation in computational media without hindering exploration.
引用
收藏
页数:12
相关论文
共 37 条
[1]   Managing Technical Debt [J].
Allman, Eric .
COMMUNICATIONS OF THE ACM, 2012, 55 (05) :50-55
[3]  
[Anonymous], 2015, PROJECT JUPYTER COMP
[4]  
Bostock Mike, 2017, A Better Way to Code
[5]  
Brown M, 2010, SMART SCIENCE FOR EXPLORATION AND MINING, VOL 1 AND 2, P47
[6]  
Cunningham W., 1993, OOPS Messenger, V4, P29
[7]   Dynamics and associations of microbial community types across the human body [J].
Ding, Tao ;
Schloss, Patrick D. .
NATURE, 2014, 509 (7500) :357-+
[8]  
Dunbar Brian., 2010, Nasa - shuttle computers navigate record of reliability
[9]  
Fowler M., 1999, Refactoring: Improving the design of existing code
[10]   What storytelling can do for information visualization [J].
Gershon, N ;
Page, W .
COMMUNICATIONS OF THE ACM, 2001, 44 (08) :31-37