↓Skip to main content

🏢 University of Arizona

Beyond task diversity: provable representation transfer for sequential multitask linear bandits

26 September 2024·1405 words·7 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Arizona

Lifelong learning in linear bandits gets a boost! A new algorithm, BOSS, achieves low regret without the usual ‘task diversity’ assumption, opening doors for more realistic sequential multi-task lear…

Adaptive Experimentation When You Can't Experiment

26 September 2024·1383 words·7 mins· loading · loading

AI Theory Causality 🏢 University of Arizona

Adaptive experimentation tackles confounding in online A/B tests using encouragement designs and a novel linear bandit approach, achieving near-optimal sample complexity.