Ask a Question

Prefer a chat interface with context about you and your work?

Stochastic bandits with arm-dependent delays

Stochastic bandits with arm-dependent delays

Significant work has been recently dedicated to the stochastic delayed bandit setting because of its relevance in applications. The applicability of existing algorithms is however restricted by the fact that strong assumptions are often made on the delay distributions, such as full observability, restrictive shape constraints, or uniformity over arms. …