Ask a Question

Prefer a chat interface with context about you and your work?

Bregman Gradient Policy Optimization

Bregman Gradient Policy Optimization

In the paper, we design a novel Bregman gradient policy optimization framework for reinforcement learning based on Bregman divergences and momentum techniques. Specifically, we propose a Bregman gradient policy optimization (BGPO) algorithm based on the basic momentum technique and mirror descent iteration. Meanwhile, we further propose an accelerated Bregman gradient …