Evaluation of machine learning interpolation techniques for prediction of physical properties

被引:44
作者
Belisle, Eve [1 ]
Huang, Zi [1 ]
Le Digabel, Sebastien [3 ,4 ]
Gheribi, Aimen E. [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Ecole Polytech, Dept Chem Engn, CRCT Ctr Res Computat Thermochem, Montreal, PQ H3C 3A7, Canada
[3] Ecole Polytech, Gerad, Montreal, PQ H3C 3A7, Canada
[4] Ecole Polytech, Dept Math & Ind Engn, Montreal, PQ H3C 3A7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Superalloys; Database; Gaussian process; Neural network; Quadratic regression; Physical properties; Computational dependence; HETEROGENEOUS MARTENSITIC NUCLEATION; LINEAR INTERPOLATION; START TEMPERATURE; NEURAL-NETWORKS; REGRESSION; MODELS; SOFTWARE; KINETICS; DESIGN;
D O I
10.1016/j.commatsci.2014.10.032
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A knowledge of the physical properties of materials as a function of temperature, composition, applied external stresses, etc. is an important consideration in materials and process design. For new systems, such properties may be unknown and hard to measure or estimate from numerical simulations such as molecular dynamics. Engineers rely on machine learning to employ existing data in order to predict properties for new systems. Several techniques are currently used for such purposes. These include neural network, polynomial interpolation and Gaussian processes as well as the more recent dynamic trees and scalable Gaussian processes. In this paper we compare these approaches for three sets of materials sciences data: molar volume, electrical conductivity and Martensite start temperature. We make recommendations depending on the nature of the data. We demonstrate that a thorough knowledge of the problem beforehand is critical in selecting the most successful machine learning technique. Our findings show that the Gaussian process regression technique gives very good predictions for all three sets of tested data. Typically, Gaussian process is very slow with a computational complexity of typically n(3) where n is the number of data points. In this paper, we found that the scalable Gaussian process approach was able to maintain the high accuracy of the predictions while improving speed considerably, make on-line learning possible. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:170 / 177
页数:8
相关论文
共 49 条
[1]  
[Anonymous], 2001, J. Am. Stat. Assoc.
[2]  
[Anonymous], PREDICTING MARTENSIT
[3]  
BAILERJONES C, 1999, MAT SCI TECHNOL, V15
[4]   FactSage thermochemical software and databases [J].
Bale, C ;
Chartrand, P ;
Degterov, SA ;
Eriksson, G ;
Hack, K ;
Ben Mahfoud, R ;
Melançon, J ;
Pelton, AD ;
Petersen, S .
CALPHAD-COMPUTER COUPLING OF PHASE DIAGRAMS AND THERMOCHEMISTRY, 2002, 26 (02) :189-228
[5]   FactSage thermochemical software and databases - recent developments [J].
Bale, C. W. ;
Belisle, E. ;
Chartrand, P. ;
Decterov, S. A. ;
Eriksson, G. ;
Hack, K. ;
Jung, I. -H. ;
Kang, Y. -B. ;
Melancon, J. ;
Pelton, A. D. ;
Robelin, C. ;
Petersen, S. .
CALPHAD-COMPUTER COUPLING OF PHASE DIAGRAMS AND THERMOCHEMISTRY, 2009, 33 (02) :295-311
[6]  
BELISLE E, SCALABLE GAUSSIAN PR
[7]  
Bélisle E, 2014, LECT NOTES COMPUT SC, V8506, P38
[8]   DETERMINATION OF HISTAMINE-PC20 - COMPARISON OF LINEAR AND LOGARITHMIC INTERPOLATION [J].
COCKCROFT, DW ;
MURDOCK, KY ;
MINK, JT .
CHEST, 1983, 84 (04) :505-506
[9]  
Dai TS, 2007, LECT NOTES COMPUT SC, V4508, P262
[10]   Quadratic interpolation for image resampling [J].
Dodgson, NA .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1997, 6 (09) :1322-1326