Image Generation

Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering

26 September 2024·3094 words·15 mins· loading · loading

Computer Vision Image Generation 🏢 Technion

Zero-to-Hero enhances zero-shot novel view synthesis by cleverly filtering attention maps during inference, achieving significantly higher fidelity and realism without retraining.

Zero-shot Image Editing with Reference Imitation

26 September 2024·2284 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 University of Hong Kong

MimicBrush: a novel image editing approach using reference imitation for intuitive zero-shot edits.

Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models

26 September 2024·2837 words·14 mins· loading · loading

Computer Vision Image Generation 🏢 University of Texas at Austin

Warped Diffusion cleverly adapts image diffusion models for video inverse problems, solving flickering and temporal inconsistency issues by viewing video frames as continuous warping transformations a…

Vivid-ZOO: Multi-View Video Generation with Diffusion Model

26 September 2024·2634 words·13 mins· loading · loading

Computer Vision Image Generation 🏢 King Abdullah University of Science and Technology

Vivid-ZOO: Generating high-quality multi-view videos from text using a novel diffusion model.

Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion

26 September 2024·4160 words·20 mins· loading · loading

Computer Vision Image Generation 🏢 Department of Biomedical Engineering, Southern University of Science and Technology

Researchers developed a novel zero-shot EEG-based framework for visual reconstruction using a tailored brain encoder and a two-stage image generation strategy, achieving state-of-the-art performance i…

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

26 September 2024·1946 words·10 mins· loading · loading

Image Generation 🏢 Peking University

Visual Autoregressive Modeling (VAR) revolutionizes image generation by using a coarse-to-fine ’next-scale prediction’, outperforming diffusion models and exhibiting scaling laws similar to LLMs.

Virtual Scanning: Unsupervised Non-line-of-sight Imaging from Irregularly Undersampled Transients

26 September 2024·2316 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 Tianjin University

Unsupervised learning framework enables high-fidelity non-line-of-sight (NLOS) imaging from irregularly undersampled transients, surpassing state-of-the-art methods in speed and robustness.

UPS: Unified Projection Sharing for Lightweight Single-Image Super-resolution and Beyond

26 September 2024·2752 words·13 mins· loading · loading

Computer Vision Image Generation 🏢 Hong Kong University of Science and Technology

UPS: A novel algorithm for lightweight single-image super-resolution, decoupling feature extraction and similarity modeling for enhanced efficiency and robustness.

Untrained Neural Nets for Snapshot Compressive Imaging: Theory and Algorithms

26 September 2024·3555 words·17 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 ECE Department, Rutgers University

Untrained neural networks revolutionize snapshot compressive imaging (SCI) by enabling high-dimensional data recovery from a single 2D measurement, achieving state-of-the-art results without needing e…

Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization

26 September 2024·1995 words·10 mins· loading · loading

Computer Vision Image Generation 🏢 Samsung Electro-Mechanics

AltO: a novel unsupervised learning framework for accurately estimating homography from multimodal image pairs, achieving performance comparable to supervised methods.

Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance

26 September 2024·1792 words·9 mins· loading · loading

Computer Vision Image Generation 🏢 KAIST

Self-guidance boosts masked generative models’ image synthesis, achieving superior quality and diversity with fewer steps!

Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems

26 September 2024·3236 words·16 mins· loading · loading

Computer Vision Image Generation 🏢 Tsinghua University

ProjDiff: A novel algorithm unleashes diffusion models’ denoising power for superior inverse problem solutions.

UniFL: Improve Latent Diffusion Model via Unified Feedback Learning

26 September 2024·2732 words·13 mins· loading · loading

Computer Vision Image Generation 🏢 Sun Yat-Sen University

UniFL: Unified Feedback Learning revolutionizes latent diffusion models by improving image quality, aesthetics, and inference speed through a unified feedback learning framework, surpassing existing m…

Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement

26 September 2024·3015 words·15 mins· loading · loading

Image Generation 🏢 Shanghai Jiao Tong University

Enhance deep neural network privacy and trustworthiness with unified gradient-based machine unlearning, leveraging remain geometry for efficient forgetting and performance preservation.

Understanding Hallucinations in Diffusion Models through Mode Interpolation

26 September 2024·2934 words·14 mins· loading · loading

Computer Vision Image Generation 🏢 Carnegie Mellon University

Diffusion models generate unrealistic images by smoothly interpolating between data modes; this paper identifies this ‘mode interpolation’ failure and proposes a metric to detect and reduce it.

Understanding and Improving Training-free Loss-based Diffusion Guidance

26 September 2024·2849 words·14 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Microsoft Research

Training-free guidance revolutionizes diffusion models by enabling zero-shot conditional generation, but suffers from misaligned gradients and slow convergence. This paper provides theoretical analysi…

UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks

26 September 2024·3265 words·16 mins· loading · loading

Computer Vision Image Generation 🏢 Hong Kong University of Science and Technology

UltraPixel generates high-quality images at various resolutions (1K-6K) efficiently using cascade diffusion models, achieving state-of-the-art performance.

UDPM: Upsampling Diffusion Probabilistic Models

26 September 2024·3261 words·16 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Tel Aviv University

UDPM: Upsampling Diffusion Probabilistic Models achieves high-quality image generation with fewer computations by incorporating downsampling and upsampling within the diffusion process.

U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers

26 September 2024·2151 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 Peking University

U-DiT: Revolutionizing diffusion transformers with a U-Net design and token downsampling for superior image generation and drastically reduced computation cost.

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

26 September 2024·2325 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 Shanghai Artificial Intelligence Laboratory

AdaptiveDiffusion accelerates diffusion model inference by adaptively skipping noise prediction steps, achieving 2-5x speedup without quality loss.