Skip to main content

🏢 University of Parma

$^R$FLAV: Rolling Flow matching for infinite Audio Video generation
·2128 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Audio-Visual Learning 🏢 University of Parma
RFLAV: A novel rolling flow matching model for infinite audio-video generation with high quality, synchronization, and temporal coherence.