Skip to main content

Computer Vision

Unsupervised Object Detection with Theoretical Guarantees
·2140 words·11 mins· loading · loading
Computer Vision Object Detection 🏢 University of Oxford
First unsupervised object detection method with theoretical guarantees to recover true object positions, up to quantifiable small shifts!
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation
·2466 words·12 mins· loading · loading
Computer Vision Image Segmentation 🏢 Vivo Mobile Communication Co., Ltd
Modality Adaptation with Diffusion Models (MADM) achieves state-of-the-art semantic segmentation by using pre-trained text-to-image diffusion models to enhance cross-modality capabilities and generate…
Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization
·1995 words·10 mins· loading · loading
Computer Vision Image Generation 🏢 Samsung Electro-Mechanics
AltO: a novel unsupervised learning framework for accurately estimating homography from multimodal image pairs, achieving performance comparable to supervised methods.
Unsupervised Hierarchy-Agnostic Segmentation: Parsing Semantic Image Structure
·4466 words·21 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏢 DIAG, Sapienza University of Rome
This study introduces a novel unsupervised hierarchy-agnostic image segmentation method achieving detailed and unbiased parsing of semantic image structures across various datasets.
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation
·4157 words·20 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏢 Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University
UnSeg: One universal unlearnable example generator protects images from image segmentation model training.
Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance
·1792 words·9 mins· loading · loading
Computer Vision Image Generation 🏢 KAIST
Self-guidance boosts masked generative models’ image synthesis, achieving superior quality and diversity with fewer steps!
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
·2279 words·11 mins· loading · loading
Computer Vision Image Segmentation 🏢 Zhejiang University
DiffewS: a novel framework leverages diffusion models for few-shot semantic segmentation, significantly outperforming existing methods in multiple settings.
Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems
·3236 words·16 mins· loading · loading
Computer Vision Image Generation 🏢 Tsinghua University
ProjDiff: A novel algorithm unleashes diffusion models’ denoising power for superior inverse problem solutions.
Unleashing Multispectral Video's Potential in Semantic Segmentation: A Semi-supervised Viewpoint and New UAV-View Benchmark
·2138 words·11 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏢 University of Alberta
New MVUAV dataset and SemiMV semi-supervised learning model significantly improve multispectral video semantic segmentation!
Unlearnable 3D Point Clouds: Class-wise Transformation Is All You Need
·3088 words·15 mins· loading · loading
Computer Vision 3D Vision 🏢 Huazhong University of Science and Technology
New unlearnable framework secures 3D point cloud data by using class-wise transformations, enabling authorized training while preventing unauthorized access.
UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections
·3538 words·17 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 ETH Zurich
UniSDF: Unifying neural representations reconstructs complex scenes with reflections, achieving state-of-the-art performance by blending camera and reflected view radiance fields.
Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
·2641 words·13 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Tsinghua University
Unique3D: Single image to high-fidelity 3D mesh in 30 seconds!
UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes
·2470 words·12 mins· loading · loading
AI Generated Computer Vision Object Detection 🏢 Delft University of Technology
UNION: Unsupervised 3D object detection method doubles average precision, leveraging LiDAR, camera, and temporal data for efficient training without manual labels.
UniFL: Improve Latent Diffusion Model via Unified Feedback Learning
·2732 words·13 mins· loading · loading
Computer Vision Image Generation 🏢 Sun Yat-Sen University
UniFL: Unified Feedback Learning revolutionizes latent diffusion models by improving image quality, aesthetics, and inference speed through a unified feedback learning framework, surpassing existing m…
Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
·3024 words·15 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Korea University
Unified Domain Generalization and Adaptation (UDGA) tackles 3D object detection’s domain adaptation challenges by leveraging multi-view overlap and label-efficient learning, achieving state-of-the-art…
UniDSeg: Unified Cross-Domain 3D Semantic Segmentation via Visual Foundation Models Prior
·3219 words·16 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Xiamen University
UniDSeg uses Visual Foundation Models to create a unified framework for adaptable and generalizable cross-domain 3D semantic segmentation, achieving state-of-the-art results.
Understanding Visual Feature Reliance through the Lens of Complexity
·3993 words·19 mins· loading · loading
AI Generated Computer Vision Image Classification 🏢 Google DeepMind
Deep learning models favor simple features, hindering generalization; this paper introduces a new feature complexity metric revealing a spectrum of simple-to-complex features, their learning dynamics,…
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
·2683 words·13 mins· loading · loading
Computer Vision Image Segmentation 🏢 Graduate School of Artificial Intelligence, KAIST
PartCLIPSeg, a novel framework, leverages generalized parts and object-level contexts to achieve significant improvements in open-vocabulary part segmentation, outperforming state-of-the-art methods.
Understanding Hallucinations in Diffusion Models through Mode Interpolation
·2934 words·14 mins· loading · loading
Computer Vision Image Generation 🏢 Carnegie Mellon University
Diffusion models generate unrealistic images by smoothly interpolating between data modes; this paper identifies this ‘mode interpolation’ failure and proposes a metric to detect and reduce it.
Understanding Bias in Large-Scale Visual Datasets
·5190 words·25 mins· loading · loading
AI Generated Computer Vision Image Classification 🏢 University of Pennsylvania
Researchers unveil a novel framework to dissect bias in large-scale visual datasets, identifying unique visual attributes and leveraging language models for detailed analysis, paving the way for creat…