Image Generation
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users
·3873 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Nanyang Technological University
ART: A novel automatic red-teaming framework reveals safety vulnerabilities in popular text-to-image models by identifying unsafe outputs even from seemingly harmless prompts.
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
·2501 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Aalto University
Boosting image generation: Applying guidance selectively during diffusion model sampling drastically enhances image quality and inference speed, achieving state-of-the-art results.
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario
·3145 words·15 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Shanghai Jiao Tong University
AnyFit: Controllable virtual try-on for any attire combination across any scenario, exceeding existing methods in accuracy and scalability.
An Image is Worth 32 Tokens for Reconstruction and Generation
·2076 words·10 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 ByteDance
Image generation gets a speed boost with TiTok, a novel 1D image tokenizer that uses just 32 tokens for high-quality image reconstruction and generation, achieving up to 410x faster processing than st…
An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations
·3657 words·18 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Peking University
EMDiffusion trains clean diffusion models from corrupted data using an expectation-maximization algorithm, achieving state-of-the-art results on diverse imaging tasks.
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization
·2692 words·13 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 ByteDance
DiMR boosts image generation fidelity by cleverly combining multi-resolution networks with time-dependent layer normalization in diffusion models, achieving state-of-the-art results on ImageNet.
Aligning Diffusion Models by Optimizing Human Utility
·3826 words·18 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 UC Los Angeles
Diffusion-KTO: Aligning text-to-image models with human preferences using simple likes/dislikes, maximizing expected human utility.
AirSketch: Generative Motion to Sketch
·2122 words·10 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 University of Central Florida
AirSketch generates aesthetically pleasing sketches directly from noisy hand-motion tracking data using a self-supervised controllable diffusion model, eliminating the need for expensive AR/VR equipme…
AID: Attention Interpolation of Text-to-Image Diffusion
·3646 words·18 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 National University of Singapore
AID, a novel training-free method, significantly improves image interpolation by fusing inner/outer interpolated attention layers and using beta-distribution for coefficient selection, enhancing consi…
Adversarial Schrödinger Bridge Matching
·3165 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Skoltech
Accelerate Schrödinger Bridge Matching with Discrete-time IMF using only a few steps, achieving comparable results to existing hundred-step methods via D-GAN implementation.
Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis
·3442 words·17 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 UC San Diego
FG-DMs revolutionize image synthesis by jointly modeling image and condition distributions, achieving higher object recall and enabling flexible editing.
Action Imitation in Common Action Space for Customized Action Image Synthesis
·1901 words·9 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Zhejiang University
TwinAct: Decoupling actions and actors for customizable text-guided action image generation.
ACFun: Abstract-Concrete Fusion Facial Stylization
·2192 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Xidian University
ACFun: A novel facial stylization method fusing abstract & concrete features for high-quality, artistically pleasing results from only one style & one face image.
A Modular Conditional Diffusion Framework for Image Reconstruction
·4235 words·20 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 MTS AI
A novel modular diffusion framework for image reconstruction dramatically cuts computational costs and achieves state-of-the-art perceptual quality across various tasks by cleverly combining pre-train…
2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution
·2009 words·10 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Shanghai Jiao Tong University
2DQuant achieves highly efficient and accurate low-bit image super-resolution by using a dual-stage post-training quantization method that minimizes accuracy loss in transformer-based models, surpassi…