Skip to main content

Image Generation

Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering
·3094 words·15 mins· loading · loading
Computer Vision Image Generation 🏢 Technion
Zero-to-Hero enhances zero-shot novel view synthesis by cleverly filtering attention maps during inference, achieving significantly higher fidelity and realism without retraining.
Zero-shot Image Editing with Reference Imitation
·2284 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 University of Hong Kong
MimicBrush: a novel image editing approach using reference imitation for intuitive zero-shot edits.
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
·2837 words·14 mins· loading · loading
Computer Vision Image Generation 🏢 University of Texas at Austin
Warped Diffusion cleverly adapts image diffusion models for video inverse problems, solving flickering and temporal inconsistency issues by viewing video frames as continuous warping transformations a…
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
·2634 words·13 mins· loading · loading
Computer Vision Image Generation 🏢 King Abdullah University of Science and Technology
Vivid-ZOO: Generating high-quality multi-view videos from text using a novel diffusion model.
Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion
·4160 words·20 mins· loading · loading
Computer Vision Image Generation 🏢 Department of Biomedical Engineering, Southern University of Science and Technology
Researchers developed a novel zero-shot EEG-based framework for visual reconstruction using a tailored brain encoder and a two-stage image generation strategy, achieving state-of-the-art performance i…
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
·1946 words·10 mins· loading · loading
Image Generation 🏢 Peking University
Visual Autoregressive Modeling (VAR) revolutionizes image generation by using a coarse-to-fine ’next-scale prediction’, outperforming diffusion models and exhibiting scaling laws similar to LLMs.
Virtual Scanning: Unsupervised Non-line-of-sight Imaging from Irregularly Undersampled Transients
·2316 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 Tianjin University
Unsupervised learning framework enables high-fidelity non-line-of-sight (NLOS) imaging from irregularly undersampled transients, surpassing state-of-the-art methods in speed and robustness.
UPS: Unified Projection Sharing for Lightweight Single-Image Super-resolution and Beyond
·2752 words·13 mins· loading · loading
Computer Vision Image Generation 🏢 Hong Kong University of Science and Technology
UPS: A novel algorithm for lightweight single-image super-resolution, decoupling feature extraction and similarity modeling for enhanced efficiency and robustness.
Untrained Neural Nets for Snapshot Compressive Imaging: Theory and Algorithms
·3555 words·17 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 ECE Department, Rutgers University
Untrained neural networks revolutionize snapshot compressive imaging (SCI) by enabling high-dimensional data recovery from a single 2D measurement, achieving state-of-the-art results without needing e…
Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization
·1995 words·10 mins· loading · loading
Computer Vision Image Generation 🏢 Samsung Electro-Mechanics
AltO: a novel unsupervised learning framework for accurately estimating homography from multimodal image pairs, achieving performance comparable to supervised methods.
Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance
·1792 words·9 mins· loading · loading
Computer Vision Image Generation 🏢 KAIST
Self-guidance boosts masked generative models’ image synthesis, achieving superior quality and diversity with fewer steps!
Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems
·3236 words·16 mins· loading · loading
Computer Vision Image Generation 🏢 Tsinghua University
ProjDiff: A novel algorithm unleashes diffusion models’ denoising power for superior inverse problem solutions.
UniFL: Improve Latent Diffusion Model via Unified Feedback Learning
·2732 words·13 mins· loading · loading
Computer Vision Image Generation 🏢 Sun Yat-Sen University
UniFL: Unified Feedback Learning revolutionizes latent diffusion models by improving image quality, aesthetics, and inference speed through a unified feedback learning framework, surpassing existing m…
Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement
·3015 words·15 mins· loading · loading
Image Generation 🏢 Shanghai Jiao Tong University
Enhance deep neural network privacy and trustworthiness with unified gradient-based machine unlearning, leveraging remain geometry for efficient forgetting and performance preservation.
Understanding Hallucinations in Diffusion Models through Mode Interpolation
·2934 words·14 mins· loading · loading
Computer Vision Image Generation 🏢 Carnegie Mellon University
Diffusion models generate unrealistic images by smoothly interpolating between data modes; this paper identifies this ‘mode interpolation’ failure and proposes a metric to detect and reduce it.
Understanding and Improving Training-free Loss-based Diffusion Guidance
·2849 words·14 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Microsoft Research
Training-free guidance revolutionizes diffusion models by enabling zero-shot conditional generation, but suffers from misaligned gradients and slow convergence. This paper provides theoretical analysi…
UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks
·3265 words·16 mins· loading · loading
Computer Vision Image Generation 🏢 Hong Kong University of Science and Technology
UltraPixel generates high-quality images at various resolutions (1K-6K) efficiently using cascade diffusion models, achieving state-of-the-art performance.
UDPM: Upsampling Diffusion Probabilistic Models
·3261 words·16 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Tel Aviv University
UDPM: Upsampling Diffusion Probabilistic Models achieves high-quality image generation with fewer computations by incorporating downsampling and upsampling within the diffusion process.
U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
·2151 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 Peking University
U-DiT: Revolutionizing diffusion transformers with a U-Net design and token downsampling for superior image generation and drastically reduced computation cost.
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
·2325 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 Shanghai Artificial Intelligence Laboratory
AdaptiveDiffusion accelerates diffusion model inference by adaptively skipping noise prediction steps, achieving 2-5x speedup without quality loss.