Distributed reinforcement learning control for batch sequencing and sizing in Just-In-Time manufacturing systems

被引:28
作者
Hong, JK [1 ]
Prabhu, VV [1 ]
机构
[1] Penn State Univ, Dept Ind & Mfg Engn, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
machine learning; scheduling; Just-In-Time production;
D O I
10.1023/B:APIN.0000011143.95085.74
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an approach that is suitable for Just-In-Time (JIT) production for multi-objective scheduling problem in dynamically changing shop floor environment. The proposed distributed learning and control (DLC) approach integrates part-driven distributed arrival time control (DATC) and machine-driven distributed reinforcement learning based control. With DATC, part controllers adjust their associated parts' arrival time to minimize due-date deviation. Within the restricted pattern of arrivals, machine controllers are concurrently searching for optimal dispatching policies. The machine control problem is modeled as Semi Markov Decision Process ( SMDP) and solved using Q-learning. The DLC algorithms are evaluated using simulation for two types of manufacturing systems: family scheduling and dynamic batch sizing. Results show that DLC algorithms achieve significant performance improvement over usual dispatching rules in complex real-time shop floor control problems for JIT production.
引用
收藏
页码:71 / 87
页数:17
相关论文
共 38 条
[31]  
TSITSIKLIS JN, 1994, MACH LEARN, V16, P185, DOI 10.1007/BF00993306
[32]  
WAKTINS CJC, 1992, MACH LEARN, V8, P279
[33]  
WAKTINS CJC, 1989, THESIS CAMBRIDGE U C
[34]  
WANG G, 1999, INT C MACHINE LEARNI
[35]  
WANG G, 1998, NIPS 98 WORKSH ABSTR
[36]   SCHEDULING GROUPS OF JOBS ON A SINGLE-MACHINE [J].
WEBSTER, S ;
BAKER, KR .
OPERATIONS RESEARCH, 1995, 43 (04) :692-703
[37]   SEMI-MARKOV DECISION-MODELS FOR REAL-TIME SCHEDULING [J].
YIH, Y ;
THESEN, A .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1991, 29 (11) :2331-2346
[38]  
Zhang W, 1996, ADV NEUR IN, V8, P1024