TD3 Algorithm Based Reinforcement Learning Control for Multiple-Input Multiple-Output DC-DC Converters

Jian Ye, Huanyu Guo, Di Zhao, Benfei Wang, Xinan Zhang

Research output: Contribution to journalArticlepeer-review

Abstract

This article presents a reinforcement learning (RL) controller based on the twin delayed deep deterministic (TD3) policy gradient algorithm for single-inductor multiple-input multiple-output (SI-MIMO) dc-dc converters. The controller aims to address the power allocation challenges arising from parallel input sources and mitigate cross-regulation among multiple output channels. The objective is to enable the converter to exhibit outstanding performance in both steady-state and dynamic regulation during operation. The proposed RL controller is trained using the TD3 algorithm. It directly generates duty cycle control signals based on the observed states from the controlled converter. After applying input-side power allocation modulation and output-side time-multiplexing modulation, the controller generates the switching signals for each switch, completing the closed-loop control of the SI-MIMO converter. To validate the effectiveness of the proposed controller, stability analysis of the RL controller is conducted, and an experimental platform is established. Experimental results demonstrate that the proposed controller exhibits superior control performance. It effectively addresses the challenges associated with input-side power allocation and output voltage cross-regulation in SI-MIMO dc-dc converters.

Original languageEnglish
Pages (from-to)12729-12742
Number of pages14
JournalIEEE Transactions on Power Electronics
Volume39
Issue number10
DOIs
Publication statusPublished - Oct 2024

Fingerprint

Dive into the research topics of 'TD3 Algorithm Based Reinforcement Learning Control for Multiple-Input Multiple-Output DC-DC Converters'. Together they form a unique fingerprint.

Cite this