Ordinal extreme learning machine

被引:77
作者
Deng, Wan-Yu [1 ,2 ,3 ]
Zheng, Qing-Hua [1 ,2 ]
Lian, Shiguo [4 ]
Chen, Lin [3 ]
Wang, Xin [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci, MOE KLINNS Lab, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Dept Comp Sci, SKLMS Lab, Xian 710049, Peoples R China
[3] Xian Inst Post & Telecommun, Xian 710121, Peoples R China
[4] France Telecom R&D, Orange Labs, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Ordinal regression; Extreme learning machine; Error correcting output codes; CLASSIFICATION;
D O I
10.1016/j.neucom.2010.08.022
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Recently a new fast learning algorithm called Extreme Learning Machine (ELM) has been developed for Single-Hidden Layer Feedforward Networks (SLFNs) in G -B Huang Q-Y Zhu and C -K Slew [Extreme learning machine theory and applications Neurocomputing 70 (2006) 489-501] And ELM has been successfully applied to many classification and regression problems In this paper the ELM algorithm is further studied for ordinal regression problems (named ORELM) We firstly proposed an encoding-based framework for ordinal regression which Includes three encoding schemes single multi-output classifier multiple binary-classifications with one-against-all (OAA) decomposition method and one-against-one (OAO) method Then the SLFN was redesigned for ordinal egression problems based on the proposed framework and the algorithms are trained by the extreme learning machine in which input weights are assigned randomly and output weights can be decided analytically lastly widely experiments on three kinds of datasets were carried to test the proposed algorithm The comparative results with such traditional methods as Gaussian Process for Ordinal Regression (ORGP) and Support Vector for Ordinal Regression (ORSVM) show that ORELM can obtain extremely rapid training speed and good generalization ability Especially when the data set s scalability increases the advantage of ORELM will become more apparent Additionally ORELM has the following advantages including the capabilities of learning in both online and batch modes and handling non-linear data (C) 2010 Elsevier B V All rights reserved
引用
收藏
页码:447 / 456
页数:10
相关论文
共 36 条
[1]
Learnability of bipartite ranking functions [J].
Agarwal, S ;
Roth, D .
LEARNING THEORY, PROCEEDINGS, 2005, 3559 :16-31
[2]
Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[3]
[Anonymous], 2005, Advances in neural information processing systems
[4]
[Anonymous], 2004, P 21 INT C MACHINE L
[5]
[Anonymous], 2003, Journal of machine learning research
[6]
[Anonymous], 2002, Matrices: Theory and Applications
[7]
[Anonymous], 2002, P ACM SIGKDD KDD 200, DOI 10.1145/775047.775067
[8]
Burges C. J. C., 2006, P ADV NEUR INF PROC, V19, P193
[9]
Burges Chris, 2005, P 22 INT C MACHINE L, P89, DOI DOI 10.1145/1102351.1102363
[10]
A Neural Network Approach to Ordinal Regression [J].
Cheng, Jianlin ;
Wang, Zheng ;
Pollastri, Gianluca .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1279-1284