Ask a Question

Prefer a chat interface with context about you and your work?

Fast Rates for Bandit PAC Multiclass Classification

Fast Rates for Bandit PAC Multiclass Classification

We study multiclass PAC learning with bandit feedback, where inputs are classified into one of $K$ possible labels and feedback is limited to whether or not the predicted labels are correct. Our main contribution is in designing a novel learning algorithm for the agnostic $(\varepsilon,\delta)$-PAC version of the problem, with …