🏢 Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
·1922 words·10 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences
TransAgent empowers vision-language models by collaborating with diverse expert agents, achieving state-of-the-art performance in low-shot visual recognition.