A General Framework for Clustering and Distribution Matching with Bandit
Feedback
A General Framework for Clustering and Distribution Matching with Bandit
Feedback
We develop a general framework for clustering and distribution matching problems with bandit feedback. We consider a $K$-armed bandit model where some subset of $K$ arms is partitioned into $M$ groups. Within each group, the random variable associated to each arm follows the same distribution on a finite alphabet. At …