Computer Vision

Unsupervised Object Detection with Theoretical Guarantees

26 September 2024·2140 words·11 mins· loading · loading

Computer Vision Object Detection 🏢 University of Oxford

First unsupervised object detection method with theoretical guarantees to recover true object positions, up to quantifiable small shifts!

Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation

26 September 2024·2466 words·12 mins· loading · loading

Computer Vision Image Segmentation 🏢 Vivo Mobile Communication Co., Ltd

Modality Adaptation with Diffusion Models (MADM) achieves state-of-the-art semantic segmentation by using pre-trained text-to-image diffusion models to enhance cross-modality capabilities and generate…

Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization

26 September 2024·1995 words·10 mins· loading · loading

Computer Vision Image Generation 🏢 Samsung Electro-Mechanics

AltO: a novel unsupervised learning framework for accurately estimating homography from multimodal image pairs, achieving performance comparable to supervised methods.

Unsupervised Hierarchy-Agnostic Segmentation: Parsing Semantic Image Structure

26 September 2024·4466 words·21 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 DIAG, Sapienza University of Rome

This study introduces a novel unsupervised hierarchy-agnostic image segmentation method achieving detailed and unbiased parsing of semantic image structures across various datasets.

UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation

26 September 2024·4157 words·20 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University

UnSeg: One universal unlearnable example generator protects images from image segmentation model training.

Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance

26 September 2024·1792 words·9 mins· loading · loading

Computer Vision Image Generation 🏢 KAIST

Self-guidance boosts masked generative models’ image synthesis, achieving superior quality and diversity with fewer steps!

Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation

26 September 2024·2279 words·11 mins· loading · loading

Computer Vision Image Segmentation 🏢 Zhejiang University

DiffewS: a novel framework leverages diffusion models for few-shot semantic segmentation, significantly outperforming existing methods in multiple settings.

Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems

26 September 2024·3236 words·16 mins· loading · loading

Computer Vision Image Generation 🏢 Tsinghua University

ProjDiff: A novel algorithm unleashes diffusion models’ denoising power for superior inverse problem solutions.

Unleashing Multispectral Video's Potential in Semantic Segmentation: A Semi-supervised Viewpoint and New UAV-View Benchmark

26 September 2024·2138 words·11 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 University of Alberta

New MVUAV dataset and SemiMV semi-supervised learning model significantly improve multispectral video semantic segmentation!

Unlearnable 3D Point Clouds: Class-wise Transformation Is All You Need

26 September 2024·3088 words·15 mins· loading · loading

Computer Vision 3D Vision 🏢 Huazhong University of Science and Technology

New unlearnable framework secures 3D point cloud data by using class-wise transformations, enabling authorized training while preventing unauthorized access.

UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections

26 September 2024·3538 words·17 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 ETH Zurich

UniSDF: Unifying neural representations reconstructs complex scenes with reflections, achieving state-of-the-art performance by blending camera and reflected view radiance fields.

Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

26 September 2024·2641 words·13 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Tsinghua University

Unique3D: Single image to high-fidelity 3D mesh in 30 seconds!

UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes

26 September 2024·2470 words·12 mins· loading · loading

AI Generated Computer Vision Object Detection 🏢 Delft University of Technology

UNION: Unsupervised 3D object detection method doubles average precision, leveraging LiDAR, camera, and temporal data for efficient training without manual labels.

UniFL: Improve Latent Diffusion Model via Unified Feedback Learning

26 September 2024·2732 words·13 mins· loading · loading

Computer Vision Image Generation 🏢 Sun Yat-Sen University

UniFL: Unified Feedback Learning revolutionizes latent diffusion models by improving image quality, aesthetics, and inference speed through a unified feedback learning framework, surpassing existing m…

Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection

26 September 2024·3024 words·15 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Korea University

Unified Domain Generalization and Adaptation (UDGA) tackles 3D object detection’s domain adaptation challenges by leveraging multi-view overlap and label-efficient learning, achieving state-of-the-art…

UniDSeg: Unified Cross-Domain 3D Semantic Segmentation via Visual Foundation Models Prior

26 September 2024·3219 words·16 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Xiamen University

UniDSeg uses Visual Foundation Models to create a unified framework for adaptable and generalizable cross-domain 3D semantic segmentation, achieving state-of-the-art results.

Understanding Visual Feature Reliance through the Lens of Complexity

26 September 2024·3993 words·19 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 Google DeepMind

Deep learning models favor simple features, hindering generalization; this paper introduces a new feature complexity metric revealing a spectrum of simple-to-complex features, their learning dynamics,…

Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

26 September 2024·2683 words·13 mins· loading · loading

Computer Vision Image Segmentation 🏢 Graduate School of Artificial Intelligence, KAIST

PartCLIPSeg, a novel framework, leverages generalized parts and object-level contexts to achieve significant improvements in open-vocabulary part segmentation, outperforming state-of-the-art methods.

Understanding Hallucinations in Diffusion Models through Mode Interpolation

26 September 2024·2934 words·14 mins· loading · loading

Computer Vision Image Generation 🏢 Carnegie Mellon University

Diffusion models generate unrealistic images by smoothly interpolating between data modes; this paper identifies this ‘mode interpolation’ failure and proposes a metric to detect and reduce it.

Understanding Bias in Large-Scale Visual Datasets

26 September 2024·5190 words·25 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 University of Pennsylvania

Researchers unveil a novel framework to dissect bias in large-scale visual datasets, identifying unique visual attributes and leveraging language models for detailed analysis, paving the way for creat…