🏢 ShanghaiTech University
Learning Video Representations without Natural Videos
·3154 words·15 mins
AI Generated
🤗 Daily Papers
Computer Vision
Video Understanding
🏢 ShanghaiTech University
High-performing video representation models can be trained using only synthetic videos and images, eliminating the need for large natural video datasets.