Supervised tensor learning

被引:268
作者
Dacheng Tao [1 ]
Xuelong Li
Xindong Wu
Weiming Hu
Stephen J. Maybank
机构
[1] Univ London, Sch Comp Sci & Informat Syst, London, England
[2] Univ Vermont, Dept Comp Sci, Burlington, VT USA
[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Regnit, Beijing, Peoples R China
关键词
convex optimization; supervised learning; tensor; alternating projection;
D O I
10.1007/s10115-006-0050-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tensor representation is helpful to reduce the small sample size problem in discriminative subspace selection. As pointed by this paper, this is mainly because the structure information of objects in computer vision research is a reasonable constraint to reduce the number of unknown parameters used to represent a learning model. Therefore, we apply this information to the vector-based learning and generalize the vector-based learning to the tensor-based learning as the supervised tensor learning (STL) framework, which accepts tensors as input. To obtain the solution of STL, the alternating projection optimization procedure is developed. The STL framework is a combination of the convex optimization and the operations in multilinear algebra. The tensor representation helps reduce the overfitting problem in vector-based learning. Based on STL and its alternating projection optimization procedure, we generalize support vector machines, minimax probability machine, Fisher discriminant analysis, and distance metric learning, to support tensor machines, tensor minimax probability machine, tensor Fisher discriminant analysis, and the multiple distance metrics learning, respectively. We also study the iterative procedure for feature extraction within STL. To examine the effectiveness of STL, we implement the tensor minimax probability machine for image classification. By comparing with minimax probability machine, the tensor version reduces the overfitting problem.
引用
收藏
页码:1 / 42
页数:42
相关论文
共 55 条
[1]   Semi-supervised learning with an imperfect supervisor [J].
Amini, MR ;
Gallinari, P .
KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (04) :385-413
[2]  
[Anonymous], 1998, ADV KERNEL METHODS S
[3]  
[Anonymous], P KDD 2001 KNOWL DIS
[4]  
[Anonymous], 2002, Least Squares Support Vector Machines
[5]  
BOYD D, 2006, TUTORIAL GEOMETRIC P
[6]  
Boyd D.L., 2004, CONVEX OPTIMIZATION
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]  
Duda RO, 2006, PATTERN CLASSIFICATI
[9]   Discriminant analysis for recognition of human face images [J].
Etemad, K ;
Chellappa, R .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1997, 14 (08) :1724-1733
[10]   The statistical utilization of multiple measurements [J].
Fisher, RA .
ANNALS OF EUGENICS, 1938, 8 :376-386