🏢 National Engineering Research Center for Multimedia Software,School of Computer Science,Wuhan University
Empowering Visible-Infrared Person Re-Identification with Large Foundation Models
·2429 words·12 mins·
loading
·
loading
AI Generated
Multimodal Learning
Cross-Modal Retrieval
🏢 National Engineering Research Center for Multimedia Software,School of Computer Science,Wuhan University
Large foundation models empower visible-infrared person re-identification by enriching infrared image representations with automatically generated textual descriptions, significantly improving cross-m…