Git can facilitate greater reproducibility and increased transparency in science

被引:109
作者
Ram, Karthik [1 ]
机构
[1] Univ Calif Berkeley, Environm Sci Policy & Management, Berkeley, CA 94720 USA
来源
SOURCE CODE FOR BIOLOGY AND MEDICINE | 2013年 / 8卷 / 01期
基金
美国国家科学基金会;
关键词
Reproducible research; Version control; Open science;
D O I
10.1186/1751-0473-8-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Reproducibility is the hallmark of good science. Maintaining a high degree of transparency in scientific reporting is essential not just for gaining trust and credibility within the scientific community but also for facilitating the development of new ideas. Sharing data and computer code associated with publications is becoming increasingly common, motivated partly in response to data deposition requirements from journals and mandates from funders. Despite this increase in transparency, it is still difficult to reproduce or build upon the findings of most scientific publications without access to a more complete workflow. Findings: Version control systems (VCS), which have long been used to maintain code repositories in the software industry, are now finding new applications in science. One such open source VCS, Git, provides a lightweight yet robust framework that is ideal for managing the full suite of research outputs such as datasets, statistical code, figures, lab notes, and manuscripts. For individual researchers, Git provides a powerful way to track and compare versions, retrace errors, explore new approaches in a structured manner, while maintaining a full audit trail. For larger collaborative efforts, Git and Git hosting services make it possible for everyone to work asynchronously and merge their contributions at any time, all the while maintaining a complete authorship trail. In this paper I provide an overview of Git along with use-cases that highlight how this tool can be leveraged to make science more reproducible and transparent, foster new collaborations, and support novel uses.
引用
收藏
页数:8
相关论文
共 26 条
[1]   Public Availability of Published Research Data in High-Impact Journals [J].
Alsheikh-Ali, Alawi A. ;
Qureshi, Waqas ;
Al-Mallah, Mouaz H. ;
Ioannidis, John P. A. .
PLOS ONE, 2011, 6 (09)
[2]   Raise standards for preclinical cancer research [J].
Begley, C. Glenn ;
Ellis, Lee M. .
NATURE, 2012, 483 (7391) :531-533
[3]  
Desjardins-Proulx P, PLOS BIOL IN PRESS
[4]  
Finley K., 2011, GITHUB HAS SURPASSED
[5]   Ending Honorary Authorship [J].
Greenland, Philip ;
Fontanarosa, Phil B. .
SCIENCE, 2012, 337 (6098) :1019-1019
[6]   The case for open computer programs [J].
Ince, Darrel C. ;
Hatton, Leslie ;
Graham-Cumming, John .
NATURE, 2012, 482 (7386) :485-488
[7]   A Quick Guide to Software Licensing for the Scientist-Programmer [J].
Morin, Andrew ;
Urban, Jennifer ;
Sliz, Piotr .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (07)
[8]   Open access must enable open use [J].
Neylon, Cameron .
NATURE, 2012, 492 (7429) :348-349
[9]   mMass as a Software Tool for the Annotation of Cyclic Peptide Tandem Mass Spectra [J].
Niedermeyer, Timo H. J. ;
Strohalm, Martin .
PLOS ONE, 2012, 7 (09)
[10]  
Pearson DP, 2013, GITHUB SEES 3 MILLIO