🏢 AIM Lab, University of Amsterdam
IPO: Interpretable Prompt Optimization for Vision-Language Models
·3712 words·18 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 AIM Lab, University of Amsterdam
This paper introduces IPO, a novel interpretable prompt optimizer for vision-language models. IPO uses large language models (LLMs) to dynamically generate human-understandable prompts, improving acc…