Safe Reinforcement Learning for Strategic Bidding of Virtual Power Plants in Day-Ahead Markets
Safe Reinforcement Learning for Strategic Bidding of Virtual Power Plants in Day-Ahead Markets
This paper presents a novel safe reinforcement learning algorithm for strategic bidding of Virtual Power Plants (VPPs) in day-ahead electricity markets. The proposed algorithm utilizes the Deep Deterministic Policy Gradient (DDPG) method to learn competitive bidding policies without requiring an accurate market model. Furthermore, to account for the complex internal …