Preprint arXiv Bandits and online learning

Adaptive Mirror Descent for Heavy-Tailed Bandits

A mirror descent variant whose potential adapts to the heaviness of the loss tail, achieving instance-optimal regret without prior tail knowledge.