Skip to main content

🏢 University of Arizona

Beyond task diversity: provable representation transfer for sequential multitask linear bandits
·1405 words·7 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 University of Arizona
Lifelong learning in linear bandits gets a boost! A new algorithm, BOSS, achieves low regret without the usual ‘task diversity’ assumption, opening doors for more realistic sequential multi-task lear…
Adaptive Experimentation When You Can't Experiment
·1383 words·7 mins· loading · loading
AI Theory Causality 🏢 University of Arizona
Adaptive experimentation tackles confounding in online A/B tests using encouragement designs and a novel linear bandit approach, achieving near-optimal sample complexity.