↓Skip to main content

🏢 Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences

TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration

26 September 2024·1922 words·10 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences

TransAgent empowers vision-language models by collaborating with diverse expert agents, achieving state-of-the-art performance in low-shot visual recognition.