Distributed reinforcement learning control for batch sequencing and sizing in Just-In-Time manufacturing systems

被引：28

作者：

Hong, JK ^{[1
]}

Prabhu, VV ^{[1
]}

机构：

[1] Penn State Univ, Dept Ind & Mfg Engn, University Pk, PA 16802 USA

来源：

APPLIED INTELLIGENCE | 2004年 / 20卷 / 01期

基金：

美国国家科学基金会;

关键词：

machine learning; scheduling; Just-In-Time production;

D O I：

10.1023/B:APIN.0000011143.95085.74

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents an approach that is suitable for Just-In-Time (JIT) production for multi-objective scheduling problem in dynamically changing shop floor environment. The proposed distributed learning and control (DLC) approach integrates part-driven distributed arrival time control (DATC) and machine-driven distributed reinforcement learning based control. With DATC, part controllers adjust their associated parts' arrival time to minimize due-date deviation. Within the restricted pattern of arrivals, machine controllers are concurrently searching for optimal dispatching policies. The machine control problem is modeled as Semi Markov Decision Process ( SMDP) and solved using Q-learning. The DLC algorithms are evaluated using simulation for two types of manufacturing systems: family scheduling and dynamic batch sizing. Results show that DLC algorithms achieve significant performance improvement over usual dispatching rules in complex real-time shop floor control problems for JIT production.

引用

页码：71 / 87

页数：17

共 38 条

[31]

TSITSIKLIS JN, 1994, MACH LEARN, V16, P185, DOI 10.1007/BF00993306

[32]

WAKTINS CJC, 1992, MACH LEARN, V8, P279

[33]

WAKTINS CJC, 1989, THESIS CAMBRIDGE U C

[34]

WANG G, 1999, INT C MACHINE LEARNI

[35]

WANG G, 1998, NIPS 98 WORKSH ABSTR

[36] SCHEDULING GROUPS OF JOBS ON A SINGLE-MACHINE [J].

WEBSTER, S ;

BAKER, KR .

OPERATIONS RESEARCH, 1995, 43 (04) :692-703

[37] SEMI-MARKOV DECISION-MODELS FOR REAL-TIME SCHEDULING [J].

YIH, Y ;

THESEN, A .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1991, 29 (11) :2331-2346

[38]

Zhang W, 1996, ADV NEUR IN, V8, P1024

← 1 2 3 4 →