Variolite: Supporting Exploratory Programming by Data Scientists

被引:73
作者
Kery, Mary Beth [1 ]
Horvath, Amber [2 ]
Myers, Brad [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Oregon State Univ, Corvallis, OR 97331 USA
来源
PROCEEDINGS OF THE 2017 ACM SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'17) | 2017年
基金
美国国家科学基金会;
关键词
End-User Programming; Version Control Systems (VCS); Exploratory Data Analysis; Variants; Variations;
D O I
10.1145/3025453.3025626
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
How do people ideate through code? Using semi-structured interviews and a survey, we studied data scientists who program, often with small scripts, to experiment with data. These studies show that data scientists frequently code new analysis ideas by building off of their code from a previous idea. They often rely on informal versioning interactions like copying code, keeping unused code, and commenting out code to repurpose older analysis code while attempting to keep those older analyses intact. Unlike conventional version control, these informal practices allow for fast versioning of any size code snippet, and quick comparisons by interchanging which versions are run. However, data scientists must maintain a strong mental map of their code in order to distinguish versions, leading to errors and confusion. We explore the needs for improving version control tools for exploratory tasks, and demonstrate a tool for lightweight local versioning, called Variolite, which programmers found usable and desirable in a preliminary usability study.
引用
收藏
页码:1265 / 1276
页数:12
相关论文
共 32 条
[1]   Computational discovery of gene modules and regulatory networks [J].
Bar-Joseph, Z ;
Gerber, GK ;
Lee, TI ;
Rinaldi, NJ ;
Yoo, JY ;
Robert, F ;
Gordon, DB ;
Fraenkel, E ;
Jaakkola, TS ;
Young, RA ;
Gifford, DK .
NATURE BIOTECHNOLOGY, 2003, 21 (11) :1337-1342
[2]  
Carroll J.M., 1987, PARADOX ACTIVE USER
[3]  
Clements P., 2001, SOFTWARE PRODUCT LIN
[4]  
Davenport TH, 2012, HARVARD BUS REV, V90, P70
[5]  
DELINE R, 2015, VIS LANG HUM CENTR C, P111
[6]  
Guo Philip J., 2012, THESIS STANFORD U
[7]  
Guo Philip J, 2012, P 12 USENIX WORKSH T, P7
[8]  
HARTMANN B, 2008, P 21 ANN ACM S US IN, P91
[9]  
Hawley Robert, 1987, ARTIFICIAL INTELLIGE
[10]  
Hill C, 2016, S VIS LANG HUM CEN C, P162, DOI 10.1109/VLHCC.2016.7739680