Skip to main content

🏢 National University of Singapore

Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
·3600 words·17 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 National University of Singapore
Collaborative Decoding (CoDe) dramatically boosts visual auto-regressive model efficiency.
OminiControl: Minimal and Universal Control for Diffusion Transformer
·3446 words·17 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 National University of Singapore
OminiControl: A minimal, universal framework efficiently integrates image conditions into diffusion transformers, enabling diverse and precise control over image generation.
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
·4011 words·19 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 National University of Singapore
AnchorAttention enhances long-context LLMs by mitigating BFloat16’s disruptive effects on RoPE, improving performance and speeding up training.
Balancing Pipeline Parallelism with Vocabulary Parallelism
·3226 words·16 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 National University of Singapore
Boost large language model training speed by 51% with Vocabulary Parallelism, a novel technique that balances computation and memory usage across pipeline stages.
GenXD: Generating Any 3D and 4D Scenes
·2731 words·13 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 National University of Singapore
GenXD: A unified model generating high-quality 3D & 4D scenes from any number of images, advancing the field of dynamic scene generation.