Skip to main content

🏢 National Key Laboratory of Novel Software Technology

Multi-Agent Domain Calibration with a Handful of Offline Data
·2343 words·11 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 National Key Laboratory of Novel Software Technology
Madoc: A novel multi-agent framework calibrates RL policies for new environments using limited offline data, achieving superior performance in various locomotion tasks.