Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters

被引:215
作者
Agrawal A. [1 ]
Deshpande P.D. [2 ]
Cecen A. [3 ]
Basavarsu G.P. [2 ]
Choudhary A.N. [1 ]
Kalidindi S.R. [3 ,4 ]
机构
[1] Department of Electrical Engineering and Computer Science, Northwestern University, Evanston, IL
[2] Tata Research Development and Design Centre, Tata Consultancy Services, Pune, Maharashtra
[3] School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA
[4] Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, GA
基金
美国国家科学基金会;
关键词
Data mining; Materials informatics; Processing-property linkages; Regression analysis;
D O I
10.1186/2193-9772-3-8
中图分类号
学科分类号
摘要
This paper describes the use of data analytics tools for predicting the fatigue strength of steels. Several physics-based as well as data-driven approaches have been used to arrive at correlations between various properties of alloys and their compositions and manufacturing process parameters. Data-driven approaches are of significant interest to materials engineers especially in arriving at extreme value properties such as cyclic fatigue, where the current state-of-the-art physics based models have severe limitations. Unfortunately, there is limited amount of documented success in these efforts. In this paper, we explore the application of different data science techniques, including feature selection and predictive modeling, to the fatigue properties of steels, utilizing the data from the National Institute for Material Science (NIMS) public domain database, and present a systematic end-to-end framework for exploring materials informatics. Results demonstrate that several advanced data analytics techniques such as neural networks, decision trees, and multivariate polynomial regression can achieve significant improvement in the prediction accuracy over previous efforts, with R2 values over 0.97. The results have successfully demonstrated the utility of such data mining tools for ranking the composition and process parameters in the order of their potential for predicting fatigue strength of steels, and actually develop predictive models for the same. © 2014, Agrawal et al.; licensee Springer.
引用
收藏
页码:90 / 108
页数:18
相关论文
共 44 条
  • [1] Committee on Integrated Computational Materials Engineering N.R.C., Integrated Computational Materials Engineering: A Transformational Discipline for Improved Competitiveness and National Security, (2008)
  • [2] Materials genome initiative for global competitiveness. Technical report, National Science and Technology Council, (2011)
  • [3] Kalidindi S.R., Niezgoda S.R., Salem A.A., Microstructure informatics using higher-order statistics and efficient data-mining protocols, JOM - J Minerals, Met Mater Soc, 63, 4, pp. 40-41, (2011)
  • [4] Rajan K., Materials informatics, Materials Today, 8, 10, pp. 38-45, (2005)
  • [5] Hey T., Tansley S., Tolle K., The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, 1st edition, (2009)
  • [6] Linden G., Smith B., York J., Amazon.com recommendations: item-to-item collaborative filtering, Internet Comput IEEE, 7, 1, pp. 76-80, (2003)
  • [7] Mobasher B., Data mining for web personalization, Brusilovsky P, Kobsa A, Nejdl W (eds) The adaptive web. Lecture Notes in Computer Science, vol. 4321, pp. 90-135, (2007)
  • [8] Zhou Y., Wilkinson D., Schreiber R., Pan R., Large-scale parallel collaborative filtering for the netflix prize, Proceedings of the 4th International Conference on Algorithmic Aspects in Information and Management, pp. 337-348, (2008)
  • [9] Das A.S., Datar M., Garg A., Rajaram S., Google news personalization: Scalable online collaborative filtering, Proceedings of the 16th International Conference on World Wide Web, pp. 271-280, (2007)
  • [10] URL: Walmart is Making Big Data Part of Its DNA. Bigdata Startups, (2013)