TY - GEN
T1 - Model Predictive Control-Based Reinforcement Learning
AU - Han, Qiang
AU - Boussaid, Farid
AU - Bennamoun, Mohammed
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024/7/2
Y1 - 2024/7/2
N2 - Reinforcement Learning (RL) has garnered much attention in the field of control due to its capacity to learn from interactions and adapt to complex and dynamic environments. However, RL is challenging because it needs to balance exploration, seeking new strategies, and exploitation, leveraging known strategies for maximum gain. To address these challenges, this paper proposes a Model Predictive Control (MPC) based RL approach, where the state value function in RL is utilized as the cost function in MPC, and the system dynamic model is represented by neural networks (NNs). This eliminates the need for human intervention and addresses inaccuracies in the system model. Additionally, MPC-guided RL accelerates convergence during RL training, thereby enhancing sample efficiency. Reported results demonstrate that the proposed method outperforms traditional RL algorithms and does not require prior knowledge of the system.
AB - Reinforcement Learning (RL) has garnered much attention in the field of control due to its capacity to learn from interactions and adapt to complex and dynamic environments. However, RL is challenging because it needs to balance exploration, seeking new strategies, and exploitation, leveraging known strategies for maximum gain. To address these challenges, this paper proposes a Model Predictive Control (MPC) based RL approach, where the state value function in RL is utilized as the cost function in MPC, and the system dynamic model is represented by neural networks (NNs). This eliminates the need for human intervention and addresses inaccuracies in the system model. Additionally, MPC-guided RL accelerates convergence during RL training, thereby enhancing sample efficiency. Reported results demonstrate that the proposed method outperforms traditional RL algorithms and does not require prior knowledge of the system.
KW - Deep reinforcement learning
KW - Model predictive control
UR - http://www.scopus.com/inward/record.url?scp=85198543571&partnerID=8YFLogxK
U2 - 10.1109/ISCAS58744.2024.10558623
DO - 10.1109/ISCAS58744.2024.10558623
M3 - Conference paper
AN - SCOPUS:85198543571
T3 - Proceedings - IEEE International Symposium on Circuits and Systems
BT - ISCAS 2024 - IEEE International Symposium on Circuits and Systems
PB - IEEE, Institute of Electrical and Electronics Engineers
CY - Piscataway
T2 - 2024 IEEE International Symposium on Circuits and Systems, ISCAS 2024
Y2 - 19 May 2024 through 22 May 2024
ER -