Data Science and Prediction

被引:460
作者
Dhar, Vasant [1 ]
机构
[1] NYU, Stern Sch Business, Ctr Business Analyt, New York, NY 10012 USA
关键词
D O I
10.1145/2500499
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Vasant Dhar states that data science or big data is gaining increasing significance with the potential of providing automated actionable knowledge creation and predictive models for use by both humans and computers. Data science implies a focus involving data and the systematic study of the organization, properties, and analysis of data and its role in inference, including confidence in the inference. Data science is different from statistics and other existing disciplines in several important ways. The emphasis on prediction is particularly strong in the machine learning and knowledge discovery in databases, or KDD, communities. The emphasis on predictive accuracy implicitly favors 'simple' theories over more complex theories in that the accuracy of sparser models tends to be more robust on future data. The requirement on predictive accuracy on observations that will occur in the future is a key consideration in data science.
引用
收藏
页码:64 / 73
页数:10
相关论文
共 32 条
  • [1] Anderson C., 2008, Wired, DOI DOI 10.1180/MINMAG.2008.072.1.7
  • [2] [Anonymous], 2009, The Elements of Statistical Learning: Data Mining, Inference, and Prediction
  • [3] [Anonymous], 2006, The tipping point: How little things can make a big difference
  • [4] [Anonymous], BIG DAT NEXT FRONT I
  • [5] Identifying Influential and Susceptible Members of Social Networks
    Aral, Sinan
    Walker, Dylan
    [J]. SCIENCE, 2012, 337 (6092) : 337 - 341
  • [6] Buchan I., 2009, A Unified Modeling Approach to Data Intensive Healthcare
  • [7] A comparison of nonlinear methods for predicting earnings surprises and returns
    Dhar, V
    Chou, DS
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (04): : 907 - 921
  • [8] Dhar V., 1997, Seven methods for transforming corporate data into business intelligence"
  • [9] Dhar V., 2011, ACM T INTEL SYST TEC, V2, P3
  • [10] Frawley W. J., 1991, Knowledge discovery in databases, P1