Towards Reliable Deep Reinforcement Learning for Industrial Applications: A DDPG-based Algorithm with Improved Performance | ||
| AUT Journal of Mechanical Engineering | ||
| مقاله 4، دوره 10، شماره 1، بهار 2026، صفحه 61-74 اصل مقاله (1.32 M) | ||
| نوع مقاله: Research Article | ||
| شناسه دیجیتال (DOI): 10.22060/ajme.2025.24180.6181 | ||
| نویسندگان | ||
| Mahdi Dolati؛ Negin Sayyaf* | ||
| Department of Electrical Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran | ||
| چکیده | ||
| This paper proposes an Improved Model-Based Deep Deterministic Policy Gradient, a novel reinforcement learning algorithm designed to overcome three critical challenges in industrial deep reinforcement learning applications: (1) poor sample efficiency requiring excessive real-world trials, (2) safety risks from unstable policies during training, and (3) difficulty scaling to high-dimensional continuous control spaces. Building on DDPG's strengths for continuous control, the proposed algorithm introduces four key innovations: (i) a virtual environment for data-efficient learning, (ii) a simulation rate mechanism adapting model reliance dynamically, (iii) a simulated experience buffer preventing divergence, and (iv) a performance threshold for fail-safe operation. Evaluated on the Cart-Pole benchmark via the OpenAI Gym Python library, the suggested method demonstrates faster convergence than standard DDPG while maintaining performance degradation under sensor malfunctions or communication losses. These improvements derive from the algorithm's unique ability to simultaneously leverage real-world data and model-generated experiences, reducing physical trial costs while ensuring operational safety. The results establish the novel framework as a practical solution for industrial control systems where reliability and data efficiency are paramount, particularly in applications like chemical process control and precision robotics that demand stable operation amid sensor/communication failures. | ||
| کلیدواژهها | ||
| Deep Reinforcement Learning؛ Model-Based Method؛ Deep Deterministic Policy Gradient؛ Industrial Applications؛ System Identification | ||
| مراجع | ||
|
| ||
|
آمار تعداد مشاهده مقاله: 420 تعداد دریافت فایل اصل مقاله: 225 |
||
| تعداد نشریات | 9 |
| تعداد شمارهها | 455 |
| تعداد مقالات | 5,771 |
| تعداد مشاهده مقاله | 8,375,706 |
| تعداد دریافت فایل اصل مقاله | 6,934,637 |