🏢 Seoul National University
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
·2816 words·14 mins·
loading
·
loading
Multimodal Learning
Audio-Visual Learning
🏢 Seoul National University
A single model tackles diverse audiovisual generation tasks using a novel Mixture of Noise Levels approach, resulting in temporally consistent and high-quality outputs.
A Gradient Accumulation Method for Dense Retriever under Memory Constraint
·1813 words·9 mins·
loading
·
loading
Natural Language Processing
Question Answering
🏢 Seoul National University
CONTACCUM: Stable, efficient memory reduction for dense retrievers using dual memory banks, surpassing high-resource baselines.
4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization
·1909 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Seoul National University
Uncertainty-aware 4D Gaussian Splatting enhances dynamic scene reconstruction from monocular videos by selectively applying regularization to uncertain regions, improving both novel view synthesis and…