↓Skip to main content

🏢 National Engineering Research Center for Multimedia Software,School of Computer Science,Wuhan University

Empowering Visible-Infrared Person Re-Identification with Large Foundation Models

26 September 2024·2429 words·12 mins· loading · loading

AI Generated Multimodal Learning Cross-Modal Retrieval 🏢 National Engineering Research Center for Multimedia Software,School of Computer Science,Wuhan University

Large foundation models empower visible-infrared person re-identification by enriching infrared image representations with automatically generated textual descriptions, significantly improving cross-m…