Skip to main content

🏢 University of California San Diego

Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality
·3943 words·19 mins· loading · loading
AI Generated Multimodal Learning Vision-Language Models 🏢 University of California San Diego
This paper presents Text-to-Video Human Evaluation (T2VHE), a new protocol for evaluating text-to-video models, improving reliability, reproducibility, and practicality.