Skip to main content

🏢 Department of Electrical and Computer Engineering University of Central Florida

A Unified Principle of Pessimism for Offline Reinforcement Learning under Model Mismatch
·1838 words·9 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Department of Electrical and Computer Engineering University of Central Florida
Unified pessimism principle in offline RL conquers data sparsity & model mismatch, achieving near-optimal performance across various divergence models.