Skip to main content

🏢 Seoul National University

A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
·2816 words·14 mins· loading · loading
Multimodal Learning Audio-Visual Learning 🏢 Seoul National University
A single model tackles diverse audiovisual generation tasks using a novel Mixture of Noise Levels approach, resulting in temporally consistent and high-quality outputs.
A Gradient Accumulation Method for Dense Retriever under Memory Constraint
·1813 words·9 mins· loading · loading
Natural Language Processing Question Answering 🏢 Seoul National University
CONTACCUM: Stable, efficient memory reduction for dense retrievers using dual memory banks, surpassing high-resource baselines.
4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization
·1909 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 Seoul National University
Uncertainty-aware 4D Gaussian Splatting enhances dynamic scene reconstruction from monocular videos by selectively applying regularization to uncertain regions, improving both novel view synthesis and…