| تعداد نشریات | 8 |
| تعداد شمارهها | 430 |
| تعداد مقالات | 5,597 |
| تعداد مشاهده مقاله | 7,094,450 |
| تعداد دریافت فایل اصل مقاله | 6,041,625 |
Towards Reliable Deep Reinforcement Learning for Industrial Applications: A DDPG-based Algorithm with Improved Performance | ||
| AUT Journal of Mechanical Engineering | ||
| مقالات آماده انتشار، پذیرفته شده، انتشار آنلاین از تاریخ 27 شهریور 1404 اصل مقاله (1.13 M) | ||
| نوع مقاله: Research Article | ||
| شناسه دیجیتال (DOI): 10.22060/ajme.2025.24180.6181 | ||
| نویسندگان | ||
| Mahdi Dolati؛ Negin Sayyaf* | ||
| Department of Electrical Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran | ||
| چکیده | ||
| This paper proposes Improved Model-Based Deep Deterministic Policy Gradient (IMB-DDPG), a novel reinforcement learning algorithm designed to overcome three critical challenges in industrial DRL applications: (1) poor sample efficiency requiring excessive real-world trials, (2) safety risks from unstable policies during training, and (3) difficulty scaling to high-dimensional continuous control spaces. Building on DDPG's strengths for continuous control, IMB-DDPG introduces four key innovations: (i) a Virtual Environment (VE) for data-efficient learning, (ii) a Simulation Rate (SR) mechanism adapting model reliance dynamically, (iii) a Simulated Experience Buffer (SEB) preventing divergence, and (iv) a Performance Threshold (PT) for fail-safe operation. Evaluated on Cart-Pole benchmark via OpenAI Gym python library, IMB-DDPG demonstrates faster convergence than standard DDPG while maintaining performance degradation under sensor malfunctions or communication losses. These improvements derive from the algorithm's unique ability to simultaneously leverage real-world data and model-generated experiences, reducing physical trial costs while ensuring operational safety. The results establish IMB-DDPG as a practical solution for industrial control systems where reliability and data efficiency are paramount, particularly in applications like chemical process control and precision robotics that demand stable operation amid sensor/communication failures. | ||
| کلیدواژهها | ||
| Deep Reinforcement Learning؛ Model-Based Method؛ Deep Deterministic Policy Gradient؛ Industrial Applications؛ System Identification | ||
|
آمار تعداد مشاهده مقاله: 163 تعداد دریافت فایل اصل مقاله: 39 |
||