Skip to main content

🏢 University of Southern Denmark

Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
·2460 words·12 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 University of Southern Denmark
MOMBO: a novel offline reinforcement learning algorithm that uses deterministic uncertainty propagation for faster convergence and tighter suboptimality bounds.