Image Generation

Learning from Pattern Completion: Self-supervised Controllable Generation

26 September 2024·3650 words·18 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Peking University

Self-Supervised Controllable Generation (SCG) framework achieves brain-like associative generation by using a modular autoencoder with equivariance constraints and a self-supervised pattern completion…

Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks

26 September 2024·3093 words·15 mins· loading · loading

Computer Vision Image Generation 🏢 Artificial and Natural Intelligence Toulouse Institute

AI now draws almost as well as humans, thanks to novel latent diffusion model regularizations that mimic human cognitive biases.

Latent Intrinsics Emerge from Training to Relight

26 September 2024·1743 words·9 mins· loading · loading

Image Generation 🏢 University of Chicago

A novel data-driven relighting model achieves state-of-the-art accuracy by learning latent intrinsic and extrinsic scene properties, even recovering albedo without explicit supervision.

KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis

26 September 2024·5238 words·25 mins· loading · loading

Computer Vision Image Generation 🏢 Electronics and Telecommunications Research Institute

KOALA: New efficient text-to-image diffusion models achieving 4x speed and 69% size reduction of SDXL, generating 1024px images on consumer GPUs with 8GB VRAM.

Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models.

26 September 2024·3743 words·18 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 University of Glasgow

Pixelsmith: Generate gigapixel images with a single GPU, surpassing limitations of existing methods through a cascading approach and innovative guidance mechanism.

IR-CM: The Fast and Universal Image Restoration Method Based on Consistency Model

26 September 2024·2449 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 Huazhong University of Science and Technology

IR-CM: One-step image restoration using a novel consistency model for fast and universal performance.

IODA: Instance-Guided One-shot Domain Adaptation for Super-Resolution

26 September 2024·2808 words·14 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Nanjing University

IODA achieves efficient one-shot domain adaptation for super-resolution using a novel instance-guided strategy and image-level domain alignment, significantly improving performance with limited target…

Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps

26 September 2024·4066 words·20 mins· loading · loading

Computer Vision Image Generation 🏢 HSE University

Invertible Consistency Distillation (iCD) achieves high-quality image editing in ~7 steps by enabling both fast editing and strong generation using a generalized distillation framework and dynamic cla…

Interpreting the Weight Space of Customized Diffusion Models

26 September 2024·3822 words·18 mins· loading · loading

Computer Vision Image Generation 🏢 UC Berkeley

Researchers model a manifold of customized diffusion models as a subspace of weights, enabling controllable creation of new models via sampling, editing, and inversion from a single image.

Interpretable Lightweight Transformer via Unrolling of Learned Graph Smoothness Priors

26 September 2024·1664 words·8 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 York University

Interpretable lightweight transformers are built by unrolling graph smoothness priors, achieving high performance with significantly fewer parameters than conventional transformers.

Improving the Training of Rectified Flows

26 September 2024·4681 words·22 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Carnegie Mellon University

Researchers significantly boosted the efficiency and quality of rectified flow, a method for generating samples from diffusion models, by introducing novel training techniques that surpass state-of-th…

Improved Distribution Matching Distillation for Fast Image Synthesis

26 September 2024·2390 words·12 mins· loading · loading

Image Generation 🏢 Massachusetts Institute of Technology

DMD2 dramatically speeds up image generation by cleverly distilling expensive diffusion models, achieving state-of-the-art results without sacrificing quality.

Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment

26 September 2024·3506 words·17 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 UC Berkeley

Immiscible Diffusion boosts diffusion model training efficiency up to 3x by cleverly assigning noise to images, preventing the mixing of data in noise space and thus improving optimization.

IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation

26 September 2024·2991 words·15 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Nanjing University of Science and Technology

IMAGPose: A unified framework generating high-fidelity person images from single or multiple source images & poses, addressing existing methods’ limitations.

Image Understanding Makes for A Good Tokenizer for Image Generation

26 September 2024·2230 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 ByteDance

Leveraging image understanding models for image tokenizer training dramatically boosts image generation quality, achieving state-of-the-art results.

Image Reconstruction Via Autoencoding Sequential Deep Image Prior

26 September 2024·2498 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 University of Michigan

aSeqDIP: A new unsupervised image reconstruction method using sequential deep image priors, achieving competitive performance with fewer data needs and faster runtimes.

Image Copy Detection for Diffusion Models

26 September 2024·3883 words·19 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 University of Technology Sydney

ICDiff, a novel Image Copy Detection system, tackles the unique challenge of identifying replicated content in diffusion model outputs, introducing a specialized dataset and deep embedding method for …

Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models

26 September 2024·3521 words·17 mins· loading · loading

Computer Vision Image Generation 🏢 KAIST

MuDI: a novel framework for multi-subject image personalization, effectively decoupling identities to prevent mixing using segmented subjects and a new evaluation metric.

Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model

26 September 2024·3058 words·15 mins· loading · loading

Computer Vision Image Generation 🏢 Tsinghua University

Researchers solve the conditional image leakage problem in image-to-video diffusion models by proposing a new inference strategy and a time-dependent noise distribution for training. This yields video…

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

26 September 2024·2714 words·13 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 ByteDance

Hyper-SD boosts diffusion model speed by using trajectory segmented consistency distillation and human feedback, achieving state-of-the-art performance.