Skip to main content

Image Generation

Cooperative Hardware-Prompt Learning for Snapshot Compressive Imaging
·1775 words·9 mins· loading · loading
Computer Vision Image Generation 🏢 Rochester Institute of Technology
Federated Hardware-Prompt Learning (FedHP) enables robust cross-hardware SCI training by aligning inconsistent data distributions using a hardware-conditioned prompter, outperforming existing FL metho…
Constrained Diffusion with Trust Sampling
·2019 words·10 mins· loading · loading
Computer Vision Image Generation 🏢 Stanford University
Trust Sampling enhances guided diffusion by iteratively optimizing constrained generation at each step, improving efficiency and accuracy in image and 3D motion generation.
Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness
·1893 words·9 mins· loading · loading
Computer Vision Image Generation 🏢 University of Wisconsin-Madison
Consistency Purification boosts certified robustness by efficiently purifying noisy images using a one-step generative model, achieving state-of-the-art results while maintaining semantic alignment.
Consistency Diffusion Bridge Models
·431 words·3 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Tsinghua University
Consistency Diffusion Bridge Models (CDBMs) dramatically speed up diffusion bridge model sampling by learning a consistency function, achieving up to a 50x speedup with improved sample quality.
ColJailBreak: Collaborative Generation and Editing for Jailbreaking Text-to-Image Deep Generation
·2067 words·10 mins· loading · loading
Computer Vision Image Generation 🏢 Xi'an Jiaotong University
ColJailBreak cleverly circumvents AI safety filters by first generating safe images and then subtly injecting unsafe content using image editing.
Classification Diffusion Models: Revitalizing Density Ratio Estimation
·2385 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 Technion - Israel Institute of Technology
Classification Diffusion Models (CDMs) revolutionize density ratio estimation by integrating the strengths of diffusion models and classifiers, achieving state-of-the-art image generation and likeliho…
Can Simple Averaging Defeat Modern Watermarks?
·3146 words·15 mins· loading · loading
Computer Vision Image Generation 🏢 National University of Singapore
Simple averaging of watermarked images reveals hidden patterns, enabling watermark removal and forgery, thus highlighting the vulnerability of content-agnostic watermarking methods.
Breaking Semantic Artifacts for Generalized AI-generated Image Detection
·3098 words·15 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 School of Cyber Science and Engineering, Xi'an Jiaotong University
Researchers developed a new AI-generated image detection method that overcomes the limitation of existing detectors, achieving superior cross-scene generalization by shuffling image patches and traini…
BrainBits: How Much of the Brain are Generative Reconstruction Methods Using?
·2562 words·13 mins· loading · loading
Computer Vision Image Generation 🏢 MIT
BrainBits reveals that surprisingly little brain information is needed for high-fidelity image & text reconstruction, highlighting the dominance of generative model priors over neural signal extractio…
Blind Image Restoration via Fast Diffusion Inversion
·1974 words·10 mins· loading · loading
Computer Vision Image Generation 🏢 Computer Vision Group, Institute of Informatics, University of Bern, Switzerland
BIRD: a novel blind image restoration method jointly optimizes degradation model parameters and the restored image, ensuring realistic outputs via fast diffusion inversion and achieving state-of-the-a…
BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference
·3033 words·15 mins· loading · loading
Computer Vision Image Generation 🏢 University of Michigan
BLAST matrix learns efficient weight structures for faster deep learning inference, achieving significant compression and performance gains on various models.
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
·5994 words·29 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Snap Inc.
BitsFusion achieves 7.9x smaller Stable Diffusion models by quantizing UNet weights to 1.99 bits, surprisingly improving image generation quality!
Binarized Diffusion Model for Image Super-Resolution
·1567 words·8 mins· loading · loading
Computer Vision Image Generation 🏢 ETH Zurich
BI-DiffSR, a novel binarized diffusion model, achieves high-quality image super-resolution with significantly reduced memory and computational costs, outperforming existing methods.
BiDM: Pushing the Limit of Quantization for Diffusion Models
·2390 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 Beihang University
BiDM achieves full 1-bit quantization in diffusion models, significantly improving storage and speed without sacrificing image quality, setting a new state-of-the-art.
BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models
·3084 words·15 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Zhejiang University
O-BELM, a novel diffusion model sampler, achieves mathematically exact inversion with superior sampling quality, offering a new gold standard for diffusion model applications.
Autoregressive Image Generation without Vector Quantization
·1807 words·9 mins· loading · loading
Image Generation 🏢 Massachusetts Institute of Technology
Autoregressive image generation is revolutionized by eliminating vector quantization, achieving strong results with increased speed using a novel diffusion procedure.
Attack-Resilient Image Watermarking Using Stable Diffusion
·3069 words·15 mins· loading · loading
Computer Vision Image Generation 🏢 University of Massachusetts Amherst
ZoDiac: a novel image watermarking framework leveraging pre-trained stable diffusion models for robust, invisible watermarks resistant to state-of-the-art attacks.
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
·2720 words·13 mins· loading · loading
Computer Vision Image Generation 🏢 National University of Singapore
AsyncDiff accelerates diffusion model inference by 2.8x using asynchronous denoising and model parallelism, maintaining near-perfect image quality.
Association of Objects May Engender Stereotypes: Mitigating Association-Engendered Stereotypes in Text-to-Image Generation
·3516 words·17 mins· loading · loading
Image Generation 🏢 Southern University of Science and Technology
New framework, MAS, effectively mitigates stereotypes in text-to-image generation by aligning the probability distribution of generated images to stereotype-free distributions.
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
·2478 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 Snap Inc.
AsCAN, a novel hybrid architecture, achieves superior efficiency and performance in image recognition and generation by asymmetrically combining convolutional and transformer blocks.