共 49 条
[1]
Sahoo R.K., Oliner A.J., Rish I., Et al., Critical event prediction for proactive management in large-scale computer clusters, Proceedings of ACM International Conference On Knowledge Discovery and Data Dining (SIGKDD), (2003)
[2]
Oliner A.J., Sahoo R.K., Moreira J.E., Et al., Faultaware job scheduling for BlueGene/L systems, Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS), (2004)
[3]
Salfner F., Lenk M., Malek M., A survey of online failure prediction methods, ACM Computing Surveys, 42, (2010)
[4]
Mickens J.W., Noble B.D., Exploiting availability prediction in distributed systems, Proceedings of USENIX Symposium On Networked Systems Design and Implementation (NSDI), (2006)
[5]
Fu S., Xu C., Quantifying event correlations for proactive failure management in networked computing systems, Journal of Parallel and Distributed Computing, 70, 11, pp. 1100-1109, (2010)
[6]
Gu J., Zheng Z., Lan Z., White J., Hocks E., Park B.-H., Dynamic meta-learning for failure prediction in large-scale systems: A case study, Proceedings of IEEE International Conference On Parallel Processing (ICPP), (2008)
[7]
Song H., Leangsuksun C., Nassar R., Availability modeling and analysis on high performance cluster computing systems, Proceedings of IEEE International Conference On Availability, Reliability and Security (ARES), (2006)
[8]
Han J., Data Mining: Concepts and Techniques, (2005)
[9]
Cover T., Thomas J., Elements of Information Theory, (1991)
[10]
Duda R.O., Hart P.E., Stork D.G., Pattern Classification, (2001)