Image Segmentation

Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need

26 September 2024·2306 words·11 mins· loading · loading

Computer Vision Image Segmentation 🏢 Beijing University of Posts and Telecommunications

DEPICT: A new white-box decoder for Transformer-based semantic segmentation, achieving better performance with fewer parameters by leveraging the principle of compression and connecting Transformer de…

Relationship Prompt Learning is Enough for Open-Vocabulary Semantic Segmentation

26 September 2024·3268 words·16 mins· loading · loading

Computer Vision Image Segmentation 🏢 School of Informatics, Xiamen University

Relationship Prompt Network (RPN) achieves state-of-the-art open-vocabulary semantic segmentation using only prompt learning and a Vision-Language Model (VLM), eliminating the need for expensive segme…

Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery

26 September 2024·2697 words·13 mins· loading · loading

Computer Vision Image Segmentation 🏢 Google DeepMind

SynCx, a novel recurrent autoencoder with complex weights, surpasses state-of-the-art models in unsupervised object discovery by iteratively refining phase relationships to achieve robust object bindi…

One-to-Multiple: A Progressive Style Transfer Unsupervised Domain-Adaptive Framework for Kidney Tumor Segmentation

26 September 2024·2746 words·13 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 Xiangtan University

PSTUDA, a novel progressive style transfer framework, efficiently segments kidney tumors across multiple MRI sequences using unsupervised domain adaptation, achieving higher accuracy and efficiency th…

Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli

26 September 2024·2181 words·11 mins· loading · loading

Computer Vision Image Segmentation 🏢 University of Tübingen

Neuroscience-inspired motion energy processing enables human-like zero-shot generalization in figure-ground segmentation, outperforming deep learning models on random dot stimuli.

MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning

26 September 2024·1904 words·9 mins· loading · loading

Computer Vision Image Segmentation 🏢 Tencent Youtu Lab

MetaUAS achieves universal visual anomaly segmentation using only one normal image prompt via a pure vision model, surpassing previous zero-shot, few-shot, and full-shot methods.

MaskFactory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation

26 September 2024·1960 words·10 mins· loading · loading

Computer Vision Image Segmentation 🏢 Zhejiang University

MaskFactory generates high-quality synthetic data for dichotomous image segmentation, improving model training efficiency and accuracy.

Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation

26 September 2024·3232 words·16 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 Huazhong University of Science and Technology

Lightweight Frequency Masker significantly improves cross-domain few-shot semantic segmentation by cleverly filtering frequency components of images, thereby reducing inter-channel correlation and enh…

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

26 September 2024·3134 words·15 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 School of Electronic Engineering and Computer Science, Queen Mary University of London

ProMaC leverages MLLM hallucinations in an iterative framework to generate precise prompts for accurate object segmentation, minimizing manual prompt dependency.

Learning Segmentation from Point Trajectories

26 September 2024·2329 words·11 mins· loading · loading

Image Segmentation 🏢 University of Oxford

This paper introduces a novel unsupervised video object segmentation method using long-term point trajectories and optical flow, outperforming prior art by effectively combining sparse, long-term moti…

Learning Frequency-Adapted Vision Foundation Model for Domain Generalized Semantic Segmentation

26 September 2024·2199 words·11 mins· loading · loading

Computer Vision Image Segmentation 🏢 Westlake University

FADA: a novel frequency-adapted learning scheme boosts domain-generalized semantic segmentation by decoupling style and content using Haar wavelets, achieving state-of-the-art results.

Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation

26 September 2024·2210 words·11 mins· loading · loading

Computer Vision Image Segmentation 🏢 UC Los Angeles

Deep metric learning and Coreset integration enables efficient slice-based active learning for 3D medical segmentation, surpassing existing methods in performance with low annotation budgets.

Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint Selection

26 September 2024·2484 words·12 mins· loading · loading

Computer Vision Image Segmentation 🏢 School of Computer Science, Fudan University

Active Viewpoint Selection (AVS) significantly improves viewpoint-independent object-centric representations by actively selecting the most informative viewpoints for each scene, leading to better seg…

Hybrid Mamba for Few-Shot Segmentation

26 September 2024·2385 words·12 mins· loading · loading

Computer Vision Image Segmentation 🏢 Nanyang Technological University

Hybrid Mamba Network (HMNet) boosts few-shot segmentation accuracy by efficiently fusing support and query features using a novel hybrid Mamba architecture, significantly outperforming current state-o…

Historical Test-time Prompt Tuning for Vision Foundation Models

26 September 2024·2286 words·11 mins· loading · loading

Computer Vision Image Segmentation 🏢 Nanyang Technological University

HisTPT: Historical Test-Time Prompt Tuning memorizes past learning, enabling robust online prompt adaptation for vision models, overcoming performance degradation in continuously changing data streams…

GraphMorph: Tubular Structure Extraction by Morphing Predicted Graphs

26 September 2024·2370 words·12 mins· loading · loading

Computer Vision Image Segmentation 🏢 Peking University

GraphMorph: revolutionizing tubular structure extraction by morphing predicted graphs for superior topological accuracy.

Geometric Exploitation for Indoor Panoramic Semantic Segmentation

26 September 2024·3017 words·15 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 MAXST

Boosting indoor panoramic semantic segmentation, a new approach leverages geometric properties to optimize over- and under-sampled image segments for improved accuracy and robustness.

Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts

26 September 2024·3159 words·15 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 ShanghaiTech University

This research presents a novel method for robust semantic segmentation, achieving state-of-the-art results by generating coherent images with both semantic and covariate shifts and recalibrating uncer…

GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting

26 September 2024·3939 words·19 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 University of Toronto

GaussianCut enables intuitive 3D object selection via graph cuts on 3D Gaussian splatting, achieving competitive segmentation without extra training.

Flaws can be Applause: Unleashing Potential of Segmenting Ambiguous Objects in SAM

26 September 2024·2042 words·10 mins· loading · loading

Computer Vision Image Segmentation 🏢 Chinese University of Hong Kong

A-SAM: Turning SAM’s inherent ambiguity into an advantage for controllable, diverse, and convincing ambiguous object segmentation.