🏢 Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK
Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits
·1492 words·8 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK
Novel covariance-adaptive algorithms achieve optimal gap-free regret bounds for combinatorial semi-bandits, improving efficiency with sampling-based approaches.