🏢 Alibaba International Digital Commerce
Wings: Learning Multimodal LLMs without Text-only Forgetting
·1958 words·10 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Alibaba International Digital Commerce
WINGS: A novel multimodal LLM combats ’text-only forgetting’ by using complementary visual and textual learners, achieving superior performance on text-only and visual tasks.