Skip to main content

🏢 ETS Montréal, Canada

WATT: Weight Average Test Time Adaptation of CLIP
·3263 words·16 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 ETS Montréal, Canada
WATT: a novel test-time adaptation method boosts CLIP’s performance on domain shifted images by cleverly averaging weights from multiple text prompts, achieving state-of-the-art results without extra …