Skip to main content

🏢 Bytedance Research

How Far is Video Generation from World Model: A Physical Law Perspective
·3657 words·18 mins
AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 Bytedance Research
Scaling video generation models doesn’t guarantee they’ll learn physics; this study reveals they prioritize visual cues over true physical understanding.