🏢 Integrated Vision and Language Lab, KAIST, South Korea
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
·3116 words·15 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Integrated Vision and Language Lab, KAIST, South Korea
CODE combats LMM hallucinations by contrasting self-generated descriptions with visual content during decoding, enhancing response accuracy without retraining.