↓Skip to main content

🏢 Step-Video Team

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

14 February 2025·4393 words·21 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 Step-Video Team

Step-Video-T2V: A 30B parameter text-to-video model generating high-quality videos up to 204 frames, pushing the boundaries of video foundation models.