↓Skip to main content

🏢 Courant Institute of Mathematical Sciences

Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning

26 September 2024·1862 words·9 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Courant Institute of Mathematical Sciences

I2M2: A novel framework revolutionizes multi-modal learning by jointly modeling inter- and intra-modality dependencies, achieving superior performance across diverse real-world datasets.