Full Content is available to subscribers

Subscribe/Learn More  >

Reinforcement Learning Based Real-Time Control Policy for Two-Machine-One-Buffer Production System

[+] Author Affiliations
Wei Zheng, Yong Lei

Zhejiang University, Hanzhou, China

Qing Chang

SUNY – Stony Brook University, Stony Brook, NY

Paper No. MSEC2017-2771, pp. V003T04A005; 9 pages
  • ASME 2017 12th International Manufacturing Science and Engineering Conference collocated with the JSME/ASME 2017 6th International Conference on Materials and Processing
  • Volume 3: Manufacturing Equipment and Systems
  • Los Angeles, California, USA, June 4–8, 2017
  • Conference Sponsors: Manufacturing Engineering Division
  • ISBN: 978-0-7918-5074-9
  • Copyright © 2017 by ASME


It is attractive to reduce the total cost of a manufacture system with real-time control of the production. The total cost mainly consists of the production cost, the penalty of the permanent production loss, and the Work-In-Process (WIP) inventory level cost. However, it is difficult to derive an analytical model of manufacture system due to the complexity of starved and blocked phenomena, the random failure and maintenance processes. Therefore, finding a real-time control policy for the manufacture system without exact analytical model is dearly needed. In this paper, a novel reinforcement learning based control decision policy is proposed based on the action of switching the machines on or off at the start of each time slot. Firstly, a simulation model is developed with MTBF and MTTR evaluated from the history data to collect samples. Then, a reinforcement learning method, specifically, Least-Square-Policy-Iteration method, is applied to obtain a sub-optimal policy. The simulation results show that the proposed method performs well in reducing the total cost.

Copyright © 2017 by ASME



Interactive Graphics


Country-Specific Mortality and Growth Failure in Infancy and Yound Children and Association With Material Stature

Use interactive graphics and maps to view and sort country-specific infant and early dhildhood mortality and growth failure data and their association with maternal

Citing articles are presented as examples only. In non-demo SCM6 implementation, integration with CrossRef’s "Cited By" API will populate this tab (http://www.crossref.org/citedby.html).

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging and repositioning the boxes below.

Related eBook Content
Topic Collections

Sorry! You do not have access to this content. For assistance or to subscribe, please contact us:

  • TELEPHONE: 1-800-843-2763 (Toll-free in the USA)
  • EMAIL: asmedigitalcollection@asme.org
Sign In