Skip to main content

🏢 Huazhong University of Science and Technology

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
·3436 words·17 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Huazhong University of Science and Technology
LightningDiT resolves the optimization dilemma in latent diffusion models by aligning latent space with pre-trained vision models, achieving state-of-the-art ImageNet 256x256 generation with over 21x …