Shedding Light on the Dark Data in the Long Tail of Science

被引:29
作者
Heidorn, P. Bryan [1 ]
机构
[1] Natl Sci Fdn, Div Biol Infrastruct, Arlington, VA USA
关键词
D O I
暂无
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
One of the primary outputs of the scientific enterprise is data, but many institutions such as libraries that are charged with preserving and disseminating scholarly output have largely ignored this form of documentation of scholarly activity. This paper focuses on it particularly troublesome Class Of data, termed dark data. "Dark data" is not carefully indexed and stored so it becomes nearly invisible to scientists and other potential users and therefore is more likely to remain underutilized and eventually lost. The article discusses the Concepts from long-tail economics Cart be used to understand potential solutions for better curation of this data. The paper describes why this data is critical to scientific progress, some of the Properties of this data, as well as some social and technical barriers to proper management of this class of data. Many potentially, useful institutional, social, and technical solutions are under development and are introduced in the last sections of the paper, but these solutions are largely unprove and require additional research and development.
引用
收藏
页码:280 / 299
页数:20
相关论文
共 21 条
  • [1] ALTMAN M, 2007, D LIB MAGAZINE MAR, V13
  • [2] Anderson C., 2004, WIRED MAGAZINE, V12
  • [3] [Anonymous], 2006, CHRON HIGHER EDUC
  • [4] [Anonymous], 2006, LIB LONG TAIL SOME T
  • [5] [Anonymous], 2008, NATURE
  • [6] An e-science environment for Service Crystallography - from submission to dissemination
    Coles, Simon J.
    Frey, Jeremy G.
    Hursthouse, Michel B.
    Light, Mark E.
    Milsted, Andrew J.
    Carr, Leslie A.
    DeRoure, David
    Gutteridge, Christopher J.
    Mills, Hugo R.
    Meacham, Ken E.
    Surridge, Michael
    Lyon, Elizabeth
    Heery, Rachel
    Duke, Monica
    Day, Michael
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (03) : 1006 - 1016
  • [7] Datasets, a shift in the currency of scholarly communication: Implications for library collections and acquisitions
    Davis, Hilary M.
    Vickery, John N.
    [J]. SERIALS REVIEW, 2007, 33 (01) : 26 - 32
  • [8] GOETZ T, 2007, WIRED MAGAZINE, V15
  • [9] Free online availability substantially increases a paper's impact
    Lawrence, S
    [J]. NATURE, 2001, 411 (6837) : 521 - 521
  • [10] Growth dynamics of scholarly and scientific journals
    Mabe, M
    Amin, M
    [J]. SCIENTOMETRICS, 2001, 51 (01) : 147 - 162