Skip to main content

🏢 UC Santa Cruz

ViLBench: A Suite for Vision-Language Process Reward Modeling
·373 words·2 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 UC Santa Cruz
VILBENCH: Vision-Language Process Reward Modeling Suite