Ask a Question

Prefer a chat interface with context about you and your work?

Best Arm Identification with Minimal Regret

Best Arm Identification with Minimal Regret

Motivated by real-world applications that necessitate responsible experimentation, we introduce the problem of best arm identification (BAI) with minimal regret. This innovative variant of the multi-armed bandit problem elegantly amalgamates two of its most ubiquitous objectives: regret minimization and BAI. More precisely, the agent's goal is to identify the best …