🏢 Department of Electrical and Computer Engineering University of Central Florida
A Unified Principle of Pessimism for Offline Reinforcement Learning under Model Mismatch
·1838 words·9 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Department of Electrical and Computer Engineering University of Central Florida
Unified pessimism principle in offline RL conquers data sparsity & model mismatch, achieving near-optimal performance across various divergence models.