Preprint arXiv Bandits and online learning

Adaptive Mirror Descent for Heavy-Tailed Bandits

Leila Aydın, Maya Singh, Renat Ostrovsky · OOAARG · Department of Computer Science

arXiv:2604.09812

A mirror descent variant whose potential adapts to the heaviness of the loss tail, achieving instance-optimal regret without prior tail knowledge.