Skip to main content

🏢 National Engineering Research Center for Multimedia Software,School of Computer Science,Wuhan University

Empowering Visible-Infrared Person Re-Identification with Large Foundation Models
·2429 words·12 mins· loading · loading
AI Generated Multimodal Learning Cross-Modal Retrieval 🏢 National Engineering Research Center for Multimedia Software,School of Computer Science,Wuhan University
Large foundation models empower visible-infrared person re-identification by enriching infrared image representations with automatically generated textual descriptions, significantly improving cross-m…