🏢 ETS Montréal, Canada
WATT: Weight Average Test Time Adaptation of CLIP
·3263 words·16 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 ETS Montréal, Canada
WATT: a novel test-time adaptation method boosts CLIP’s performance on domain shifted images by cleverly averaging weights from multiple text prompts, achieving state-of-the-art results without extra …