From machine learning to machine reasoning An essay

被引:107
作者
Bottou, Leon [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
关键词
Machine learning; Reasoning; Recursive networks;
D O I
10.1007/s10994-013-5335-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A plausible definition of "reasoning" could be "algebraically manipulating previously acquired knowledge in order to answer a new question". This definition covers first-order logical inference or probabilistic inference. It also includes much simpler manipulations commonly used to build large learning systems. For instance, we can build an optical character recognition system by first training a character segmenter, an isolated character recognizer, and a language model, using appropriate labelled training sets. Adequately concatenating these modules and fine tuning the resulting system can be viewed as an algebraic operation in a space of models. The resulting model answers a new question, that is, converting the image of a text page into a computer readable text. This observation suggests a conceptual continuity between algebraically rich inference systems, such as logical or probabilistic inference, and simple manipulations, such as the mere concatenation of trainable learning systems. Therefore, instead of trying to bridge the gap between machine learning systems and sophisticated "all-purpose" inference mechanisms, we can instead algebraically enrich the set of manipulations applicable to training systems, and build reasoning capabilities from the ground up.
引用
收藏
页码:133 / 149
页数:17
相关论文
共 55 条
[1]  
Ahmed Amr, 2008, P 10 EUR C COMP VIS
[2]  
Aiello M, 2007, HANDBOOK OF SPATIAL LOGICS, P1, DOI 10.1007/978-1-4020-5587-4
[3]  
[Anonymous], 2008, P 25 INT C MACH LEAR
[4]  
[Anonymous], 1937, CONSTRUCTION REEL CH
[5]  
[Anonymous], 2011, ICML
[6]  
[Anonymous], 2009, 2009 26 INT C INT C
[7]  
[Anonymous], P 25 C ART INT AAAI
[8]  
[Anonymous], 2004, P COMP VIS PATT REC
[9]  
[Anonymous], P ART INT STAT AISTA
[10]  
Bakir G.H., 2007, Predicting structured data