Projects
Reading
People
Chat

SU\G(𝔸)/K·U

Projects
Reading
People
Chat

Sign Up

Ask a Question

Prefer a chat interface with context about you and your work?

Your Question

Related Paper

p-Mean Regret for Stochastic Bandits

p-Mean Regret for Stochastic Bandits

In this work, we extend the concept of the $p$-mean welfare objective from social choice theory (Moulin 2004) to study $p$-mean regret in stochastic multi-armed bandit problems. The $p$-mean regret, defined as the difference between the optimal mean among the arms and the $p$-mean of the expected rewards, offers a …

AI Backends

Gemini 2 Flash

GPT-4o

o3-mini

o1-mini

o1

Gemini 2 Pro

Sky-T1

DeepSeek R1

Claude 3 Opus

Claude 3.5 Sonnet

Claude 3.5 Haiku

Sugaku, Inc. Copyright 2024

Privacy Policy, Cookie Policy, Terms and Conditions