🏢 National Key Laboratory of Novel Software Technology
Multi-Agent Domain Calibration with a Handful of Offline Data
·2343 words·11 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 National Key Laboratory of Novel Software Technology
Madoc: A novel multi-agent framework calibrates RL policies for new environments using limited offline data, achieving superior performance in various locomotion tasks.