🏢 University of Southern Denmark
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
·2460 words·12 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 University of Southern Denmark
MOMBO: a novel offline reinforcement learning algorithm that uses deterministic uncertainty propagation for faster convergence and tighter suboptimality bounds.