↓Skip to main content

🏢 State Key Laboratory of General Artificial Intelligence, BIGAI

On Domain-Specific Post-Training for Multimodal Large Language Models

29 November 2024·4939 words·24 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 State Key Laboratory of General Artificial Intelligence, BIGAI

AdaMLLM enhances multimodal LLMs for specific domains via a novel visual instruction synthesizer and a single-stage post-training pipeline, achieving superior performance compared to existing methods.