
تعداد نشریات | 8 |
تعداد شمارهها | 427 |
تعداد مقالات | 5,564 |
تعداد مشاهده مقاله | 6,727,740 |
تعداد دریافت فایل اصل مقاله | 5,687,276 |
Towards Reliable Deep Reinforcement Learning for Industrial Applications: A DDPG-based Algorithm with Improved Performance | ||
AUT Journal of Mechanical Engineering | ||
مقالات آماده انتشار، پذیرفته شده، انتشار آنلاین از تاریخ 27 شهریور 1404 | ||
نوع مقاله: Research Article | ||
شناسه دیجیتال (DOI): 10.22060/ajme.2025.24180.6181 | ||
نویسندگان | ||
Mahdi Dolati؛ Negin Sayyaf* | ||
Department of Electrical Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran | ||
چکیده | ||
This paper proposes Improved Model-Based Deep Deterministic Policy Gradient (IMB-DDPG), a novel reinforcement learning algorithm designed to overcome three critical challenges in industrial DRL applications: (1) poor sample efficiency requiring excessive real-world trials, (2) safety risks from unstable policies during training, and (3) difficulty scaling to high-dimensional continuous control spaces. Building on DDPG's strengths for continuous control, IMB-DDPG introduces four key innovations: (i) a Virtual Environment (VE) for data-efficient learning, (ii) a Simulation Rate (SR) mechanism adapting model reliance dynamically, (iii) a Simulated Experience Buffer (SEB) preventing divergence, and (iv) a Performance Threshold (PT) for fail-safe operation. Evaluated on Cart-Pole benchmark via OpenAI Gym python library, IMB-DDPG demonstrates faster convergence than standard DDPG while maintaining performance degradation under sensor malfunctions or communication losses. These improvements derive from the algorithm's unique ability to simultaneously leverage real-world data and model-generated experiences, reducing physical trial costs while ensuring operational safety. The results establish IMB-DDPG as a practical solution for industrial control systems where reliability and data efficiency are paramount, particularly in applications like chemical process control and precision robotics that demand stable operation amid sensor/communication failures. | ||
کلیدواژهها | ||
Deep Reinforcement Learning؛ Model-Based Method؛ Deep Deterministic Policy Gradient؛ Industrial Applications؛ System Identification | ||
آمار تعداد مشاهده مقاله: 1 |