↓Skip to main content

🏢 Integrated Vision and Language Lab, KAIST, South Korea

CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

26 September 2024·3116 words·15 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Integrated Vision and Language Lab, KAIST, South Korea

CODE combats LMM hallucinations by contrasting self-generated descriptions with visual content during decoding, enhancing response accuracy without retraining.