Efficient Contextual Bandits with Uninformed Feedback Graphs
Efficient Contextual Bandits with Uninformed Feedback Graphs
Bandits with feedback graphs are powerful online learning models that interpolate between the full information and classic bandit problems, capturing many real-life applications. A recent work by Zhang et al. (2023) studies the contextual version of this problem and proposes an efficient and optimal algorithm via a reduction to online …