🏢 University of Parma
$^R$FLAV: Rolling Flow matching for infinite Audio Video generation
·2128 words·10 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Audio-Visual Learning
🏢 University of Parma
RFLAV: A novel rolling flow matching model for infinite audio-video generation with high quality, synchronization, and temporal coherence.