Towards Reliable Deep Reinforcement Learning for Industrial Applications: A DDPG-based Algorithm with Improved Performance

Dolati, Mahdi; Sayyaf, Negin

doi:10.22060/ajme.2025.24180.6181

	Towards Reliable Deep Reinforcement Learning for Industrial Applications: A DDPG-based Algorithm with Improved Performance
AUT Journal of Mechanical Engineering
مقاله 4، دوره 10، شماره 1، بهار 2026، صفحه 61-74 اصل مقاله (1.32 M)
نوع مقاله: Research Article
شناسه دیجیتال (DOI): 10.22060/ajme.2025.24180.6181
نویسندگان
Mahdi Dolati؛ Negin Sayyaf^*
Department of Electrical Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran
چکیده
This paper proposes an Improved Model-Based Deep Deterministic Policy Gradient, a novel reinforcement learning algorithm designed to overcome three critical challenges in industrial deep reinforcement learning applications: (1) poor sample efficiency requiring excessive real-world trials, (2) safety risks from unstable policies during training, and (3) difficulty scaling to high-dimensional continuous control spaces. Building on DDPG's strengths for continuous control, the proposed algorithm introduces four key innovations: (i) a virtual environment for data-efficient learning, (ii) a simulation rate mechanism adapting model reliance dynamically, (iii) a simulated experience buffer preventing divergence, and (iv) a performance threshold for fail-safe operation. Evaluated on the Cart-Pole benchmark via the OpenAI Gym Python library, the suggested method demonstrates faster convergence than standard DDPG while maintaining performance degradation under sensor malfunctions or communication losses. These improvements derive from the algorithm's unique ability to simultaneously leverage real-world data and model-generated experiences, reducing physical trial costs while ensuring operational safety. The results establish the novel framework as a practical solution for industrial control systems where reliability and data efficiency are paramount, particularly in applications like chemical process control and precision robotics that demand stable operation amid sensor/communication failures.
کلیدواژه‌ها
Deep Reinforcement Learning؛ Model-Based Method؛ Deep Deterministic Policy Gradient؛ Industrial Applications؛ System Identification
مراجع

آمار تعداد مشاهده مقاله: 449 تعداد دریافت فایل اصل مقاله: 281

پیوندهای مفید

دانشگاه صنعتی امیرکبیر

آمار

تعداد نشریات	9
تعداد شماره‌ها	461
تعداد مقالات	5,795
تعداد مشاهده مقاله	8,580,630
تعداد دریافت فایل اصل مقاله	7,117,164