Ask a Question

Prefer a chat interface with context about you and your work?

Active clustering with bandit feedback

Active clustering with bandit feedback

We investigate the Active Clustering Problem (ACP). A learner interacts with an $N$-armed stochastic bandit with $d$-dimensional subGaussian feedback. There exists a hidden partition of the arms into $K$ groups, such that arms within the same group, share the same mean vector. The learner's task is to uncover this hidden …