Skip to main content

Image Generation

Learning from Pattern Completion: Self-supervised Controllable Generation
·3650 words·18 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Peking University
Self-Supervised Controllable Generation (SCG) framework achieves brain-like associative generation by using a modular autoencoder with equivariance constraints and a self-supervised pattern completion…
Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks
·3093 words·15 mins· loading · loading
Computer Vision Image Generation 🏢 Artificial and Natural Intelligence Toulouse Institute
AI now draws almost as well as humans, thanks to novel latent diffusion model regularizations that mimic human cognitive biases.
Latent Intrinsics Emerge from Training to Relight
·1743 words·9 mins· loading · loading
Image Generation 🏢 University of Chicago
A novel data-driven relighting model achieves state-of-the-art accuracy by learning latent intrinsic and extrinsic scene properties, even recovering albedo without explicit supervision.
KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis
·5238 words·25 mins· loading · loading
Computer Vision Image Generation 🏢 Electronics and Telecommunications Research Institute
KOALA: New efficient text-to-image diffusion models achieving 4x speed and 69% size reduction of SDXL, generating 1024px images on consumer GPUs with 8GB VRAM.
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models.
·3743 words·18 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 University of Glasgow
Pixelsmith: Generate gigapixel images with a single GPU, surpassing limitations of existing methods through a cascading approach and innovative guidance mechanism.
IR-CM: The Fast and Universal Image Restoration Method Based on Consistency Model
·2449 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 Huazhong University of Science and Technology
IR-CM: One-step image restoration using a novel consistency model for fast and universal performance.
IODA: Instance-Guided One-shot Domain Adaptation for Super-Resolution
·2808 words·14 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Nanjing University
IODA achieves efficient one-shot domain adaptation for super-resolution using a novel instance-guided strategy and image-level domain alignment, significantly improving performance with limited target…
Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps
·4066 words·20 mins· loading · loading
Computer Vision Image Generation 🏢 HSE University
Invertible Consistency Distillation (iCD) achieves high-quality image editing in ~7 steps by enabling both fast editing and strong generation using a generalized distillation framework and dynamic cla…
Interpreting the Weight Space of Customized Diffusion Models
·3822 words·18 mins· loading · loading
Computer Vision Image Generation 🏢 UC Berkeley
Researchers model a manifold of customized diffusion models as a subspace of weights, enabling controllable creation of new models via sampling, editing, and inversion from a single image.
Interpretable Lightweight Transformer via Unrolling of Learned Graph Smoothness Priors
·1664 words·8 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 York University
Interpretable lightweight transformers are built by unrolling graph smoothness priors, achieving high performance with significantly fewer parameters than conventional transformers.
Improving the Training of Rectified Flows
·4681 words·22 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Carnegie Mellon University
Researchers significantly boosted the efficiency and quality of rectified flow, a method for generating samples from diffusion models, by introducing novel training techniques that surpass state-of-th…
Improved Distribution Matching Distillation for Fast Image Synthesis
·2390 words·12 mins· loading · loading
Image Generation 🏢 Massachusetts Institute of Technology
DMD2 dramatically speeds up image generation by cleverly distilling expensive diffusion models, achieving state-of-the-art results without sacrificing quality.
Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment
·3506 words·17 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 UC Berkeley
Immiscible Diffusion boosts diffusion model training efficiency up to 3x by cleverly assigning noise to images, preventing the mixing of data in noise space and thus improving optimization.
IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation
·2991 words·15 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Nanjing University of Science and Technology
IMAGPose: A unified framework generating high-fidelity person images from single or multiple source images & poses, addressing existing methods’ limitations.
Image Understanding Makes for A Good Tokenizer for Image Generation
·2230 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 ByteDance
Leveraging image understanding models for image tokenizer training dramatically boosts image generation quality, achieving state-of-the-art results.
Image Reconstruction Via Autoencoding Sequential Deep Image Prior
·2498 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 University of Michigan
aSeqDIP: A new unsupervised image reconstruction method using sequential deep image priors, achieving competitive performance with fewer data needs and faster runtimes.
Image Copy Detection for Diffusion Models
·3883 words·19 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 University of Technology Sydney
ICDiff, a novel Image Copy Detection system, tackles the unique challenge of identifying replicated content in diffusion model outputs, introducing a specialized dataset and deep embedding method for …
Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models
·3521 words·17 mins· loading · loading
Computer Vision Image Generation 🏢 KAIST
MuDI: a novel framework for multi-subject image personalization, effectively decoupling identities to prevent mixing using segmented subjects and a new evaluation metric.
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
·3058 words·15 mins· loading · loading
Computer Vision Image Generation 🏢 Tsinghua University
Researchers solve the conditional image leakage problem in image-to-video diffusion models by proposing a new inference strategy and a time-dependent noise distribution for training. This yields video…
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
·2714 words·13 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 ByteDance
Hyper-SD boosts diffusion model speed by using trajectory segmented consistency distillation and human feedback, achieving state-of-the-art performance.