Ask a Question

Prefer a chat interface with context about you and your work?

A General Framework for Clustering and Distribution Matching with Bandit Feedback

A General Framework for Clustering and Distribution Matching with Bandit Feedback

We develop a general framework for clustering and distribution matching problems with bandit feedback. We consider a $K$-armed bandit model where some subset of $K$ arms is partitioned into $M$ groups. Within each group, the random variable associated to each arm follows the same distribution on a finite alphabet. At …