Skip to main content

🏢 SHI Labs @ Georgia Tech & UIUC

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
·2211 words·11 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 SHI Labs @ Georgia Tech & UIUC
CuMo boosts multimodal LLMs by efficiently integrating co-upcycled Mixture-of-Experts, achieving state-of-the-art performance with minimal extra parameters during inference.