↓Skip to main content

🏢 National Key Laboratory of Novel Software Technology

Multi-Agent Domain Calibration with a Handful of Offline Data

26 September 2024·2343 words·11 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 National Key Laboratory of Novel Software Technology

Madoc: A novel multi-agent framework calibrates RL policies for new environments using limited offline data, achieving superior performance in various locomotion tasks.