Image Generation

FouRA: Fourier Low-Rank Adaptation

26 September 2024·5441 words·26 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Qualcomm AI Research

FouRA: a novel low-rank adaptation method improves text-to-image generation by learning projections in the Fourier domain and using an adaptive rank selection strategy, addressing LoRA’s limitations o…

FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner

26 September 2024·1980 words·10 mins· loading · loading

Computer Vision Image Generation 🏢 Tsinghua University

FlowTurbo: Blazing-fast, high-quality flow-based image generation via a velocity refiner!

Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching

26 September 2024·2171 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 UC Los Angeles

ICTM efficiently solves linear inverse problems using flow priors by iteratively optimizing local MAP objectives, outperforming other flow-based methods.

Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models

26 September 2024·3454 words·17 mins· loading · loading

Computer Vision Image Generation 🏢 German Research Center for Artificial Intelligence

NEMO pinpoints & deactivates neurons memorizing training data in diffusion models, boosting privacy & image diversity.

FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification

26 September 2024·2727 words·13 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Huazhong University of Science and Technology

FasterDiT accelerates Diffusion Transformers training 7x without architecture modification by analyzing SNR probability density functions and implementing a new supervision method.

Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference

26 September 2024·5119 words·25 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Nankai University

Faster Diffusion achieves significant speedups in diffusion model inference by cleverly reusing encoder features and enabling parallel processing, eliminating the need for computationally expensive di…

FastDrag: Manipulate Anything in One Step

26 September 2024·2454 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 College of Computer Science and Technology, Harbin Engineering University

FastDrag: One-step image manipulation using generative models, drastically improving editing speed without sacrificing quality.

Fast samplers for Inverse Problems in Iterative Refinement models

26 September 2024·3647 words·18 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 UC Irvine

Conditional Conjugate Integrators (CCI) drastically accelerate sampling in iterative refinement models for inverse problems, achieving high-quality results with only a few steps.

FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models

26 September 2024·2387 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 Zhejiang University

FashionR2R leverages diffusion models to realistically translate rendered fashion images into photorealistic counterparts, enhancing realism and preserving fine-grained clothing textures.

FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation

26 September 2024·4446 words·21 mins· loading · loading

Computer Vision Image Generation 🏢 Singapore University of Technology and Design

FairQueue improves fair text-to-image generation by addressing prompt learning’s quality issues through prompt queuing and attention amplification.

Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation

26 September 2024·4968 words·24 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Google

This paper presents a novel neural network architecture that simultaneously learns to generate and segment images in an unsupervised manner, achieving accurate results across multiple datasets without…

Face2QR: A Unified Framework for Aesthetic, Face-Preserving, and Scannable QR Code Generation

26 September 2024·2451 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 Shanghai Jiao Tong University

Face2QR: A unified framework generates aesthetically pleasing, scannable QR codes that faithfully preserve facial features, solving the conflict between aesthetics, identity, and scannability.

Exploring Low-Dimensional Subspace in Diffusion Models for Controllable Image Editing

26 September 2024·2111 words·10 mins· loading · loading

Computer Vision Image Generation 🏢 University of Michigan

LOCO Edit achieves precise, localized image editing in diffusion models via a single-step, training-free method leveraging low-dimensional semantic subspaces.

Exploring Fixed Point in Image Editing: Theoretical Support and Convergence Optimization

26 September 2024·2322 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 East China Normal University

This paper theoretically proves the existence and uniqueness of fixed points in DDIM inversion, optimizing the loss function for improved image editing and extending this approach to unsupervised imag…

Exploring DCN-like architecture for fast image generation with arbitrary resolution

26 September 2024·1909 words·9 mins· loading · loading

Computer Vision Image Generation 🏢 Nanjing University

FlowDCN: A purely convolutional generative model achieves state-of-the-art image generation speed and quality at arbitrary resolutions, surpassing transformer-based models.

Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation

26 September 2024·4177 words·20 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Monash University

This research introduces adversarial concept preservation, a novel method for safely erasing undesirable concepts from diffusion models, outperforming existing techniques by preserving related sensiti…

Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination

26 September 2024·2128 words·10 mins· loading · loading

Computer Vision Image Generation 🏢 Technion

This paper introduces a novel post-processing technique that significantly boosts the perceptual quality of images generated by consistency models using a joint classifier-discriminator adversarially …

ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

26 September 2024·2466 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 Tsinghua University

EfficientNAT: a novel approach to token-based image synthesis boosts performance and slashes computational costs by cleverly disentangling and optimizing spatial-temporal interactions between image to…

EM Distillation for One-step Diffusion Models

26 September 2024·3404 words·16 mins· loading · loading

Computer Vision Image Generation 🏢 Google DeepMind

EM Distillation (EMD) efficiently trains one-step diffusion models by using an Expectation-Maximization approach, achieving state-of-the-art image generation quality and outperforming existing methods…

EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching

26 September 2024·3842 words·19 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Midea Group

The Efficient Diffusion Transformer (EDT) framework significantly speeds up and improves image generation by leveraging a lightweight architecture, human-like sketching-inspired Attention Modulation M…