Image Generation
FouRA: Fourier Low-Rank Adaptation
·5441 words·26 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Qualcomm AI Research
FouRA: a novel low-rank adaptation method improves text-to-image generation by learning projections in the Fourier domain and using an adaptive rank selection strategy, addressing LoRA’s limitations o…
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner
·1980 words·10 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Tsinghua University
FlowTurbo: Blazing-fast, high-quality flow-based image generation via a velocity refiner!
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
·2171 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 UC Los Angeles
ICTM efficiently solves linear inverse problems using flow priors by iteratively optimizing local MAP objectives, outperforming other flow-based methods.
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
·3454 words·17 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 German Research Center for Artificial Intelligence
NEMO pinpoints & deactivates neurons memorizing training data in diffusion models, boosting privacy & image diversity.
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
·2727 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Huazhong University of Science and Technology
FasterDiT accelerates Diffusion Transformers training 7x without architecture modification by analyzing SNR probability density functions and implementing a new supervision method.
Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference
·5119 words·25 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Nankai University
Faster Diffusion achieves significant speedups in diffusion model inference by cleverly reusing encoder features and enabling parallel processing, eliminating the need for computationally expensive di…
FastDrag: Manipulate Anything in One Step
·2454 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 College of Computer Science and Technology, Harbin Engineering University
FastDrag: One-step image manipulation using generative models, drastically improving editing speed without sacrificing quality.
Fast samplers for Inverse Problems in Iterative Refinement models
·3647 words·18 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 UC Irvine
Conditional Conjugate Integrators (CCI) drastically accelerate sampling in iterative refinement models for inverse problems, achieving high-quality results with only a few steps.
FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models
·2387 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Zhejiang University
FashionR2R leverages diffusion models to realistically translate rendered fashion images into photorealistic counterparts, enhancing realism and preserving fine-grained clothing textures.
FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation
·4446 words·21 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Singapore University of Technology and Design
FairQueue improves fair text-to-image generation by addressing prompt learning’s quality issues through prompt queuing and attention amplification.
Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation
·4968 words·24 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Google
This paper presents a novel neural network architecture that simultaneously learns to generate and segment images in an unsupervised manner, achieving accurate results across multiple datasets without…
Face2QR: A Unified Framework for Aesthetic, Face-Preserving, and Scannable QR Code Generation
·2451 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Shanghai Jiao Tong University
Face2QR: A unified framework generates aesthetically pleasing, scannable QR codes that faithfully preserve facial features, solving the conflict between aesthetics, identity, and scannability.
Exploring Low-Dimensional Subspace in Diffusion Models for Controllable Image Editing
·2111 words·10 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 University of Michigan
LOCO Edit achieves precise, localized image editing in diffusion models via a single-step, training-free method leveraging low-dimensional semantic subspaces.
Exploring Fixed Point in Image Editing: Theoretical Support and Convergence Optimization
·2322 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 East China Normal University
This paper theoretically proves the existence and uniqueness of fixed points in DDIM inversion, optimizing the loss function for improved image editing and extending this approach to unsupervised imag…
Exploring DCN-like architecture for fast image generation with arbitrary resolution
·1909 words·9 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Nanjing University
FlowDCN: A purely convolutional generative model achieves state-of-the-art image generation speed and quality at arbitrary resolutions, surpassing transformer-based models.
Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation
·4177 words·20 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Monash University
This research introduces adversarial concept preservation, a novel method for safely erasing undesirable concepts from diffusion models, outperforming existing techniques by preserving related sensiti…
Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination
·2128 words·10 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Technion
This paper introduces a novel post-processing technique that significantly boosts the perceptual quality of images generated by consistency models using a joint classifier-discriminator adversarially …
ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
·2466 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Tsinghua University
EfficientNAT: a novel approach to token-based image synthesis boosts performance and slashes computational costs by cleverly disentangling and optimizing spatial-temporal interactions between image to…
EM Distillation for One-step Diffusion Models
·3404 words·16 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Google DeepMind
EM Distillation (EMD) efficiently trains one-step diffusion models by using an Expectation-Maximization approach, achieving state-of-the-art image generation quality and outperforming existing methods…
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching
·3842 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Midea Group
The Efficient Diffusion Transformer (EDT) framework significantly speeds up and improves image generation by leveraging a lightweight architecture, human-like sketching-inspired Attention Modulation M…