SEMI-SUPERVISED TEXTUAL ANALYSIS AND HISTORICAL RESEARCH HELPING EACH OTHER: SOME THOUGHTS AND OBSERVATIONS

被引:6
作者
Nanni, Federico [1 ,2 ]
Kumper, Hiram [3 ]
Ponzetto, Simone Paolo [4 ]
机构
[1] Univ Bologna, Sci Technol & Soc, I-40126 Bologna, Italy
[2] Univ Mannheim, Mannheim, Germany
[3] Univ Mannheim, Late Medieval & Early Modern Hist, Mannheim, Germany
[4] Univ Mannheim, Semant Web Technol, Nat Language Proc & Informat Retrieval Grp, Mannheim, Germany
来源
INTERNATIONAL JOURNAL OF HUMANITIES AND ARTS COMPUTING-A JOURNAL OF DIGITAL HUMANITIES | 2016年 / 10卷 / 01期
关键词
semi-supervised methods; historical studies; data analysis; borndigital archives; HUMANITIES;
D O I
10.3366/ijhac.2016.0160
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Future historians will describe the rise of the World Wide Web as the turning point of their academic profession. As a matter of fact, thanks to an unprecedented amount of digitization projects and to the preservation of born-digital sources, for the first time they have at their disposal a gigantic collection of traces of our past. However, to understand trends and obtain useful insights from these very large amounts of data, historians will need more and more fine-grained techniques. This will be especially true if their objective will turn to hypothesis-testing studies, in order to build arguments by employing their deep in-domain expertise. For this reason, we focus our paper on a set of computational techniques, namely semi-supervised computational methods, which could potentially provide us with a methodological turning point for this change. As a matter of fact these approaches, due to their potential of affirming themselves as both knowledge and data driven at the same time, could become a solid alternative to some of the today most employed unsupervised techniques. However, historians who intend to employ them as evidences for supporting a claim, have to use computational methods not anymore as black boxes but as a series of well known methodological approaches. For this reason, we believe that if developing computational skills will be important for them, a solid background knowledge on the most important data analysis and results evaluation procedures will become far more capital.
引用
收藏
页码:63 / 77
页数:15
相关论文
共 32 条
  • [1] [Anonymous], P C EMP METH NAT LAN
  • [2] [Anonymous], 2009, NIPS
  • [3] [Anonymous], 2006, BOOK REV IEEE T NEUR
  • [4] [Anonymous], 2014, DIFFERENCES DIGITAL
  • [5] [Anonymous], 2008, UAI
  • [6] [Anonymous], PERPETUAL SUNRISE ME
  • [7] [Anonymous], 2012, Discovery and Justification are Different: Notes on Science-ing the Humanities
  • [8] [Anonymous], 2000, KDD WORKSH TEXT MIN
  • [9] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022