Skip to main content

Image Segmentation

Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need
·2306 words·11 mins· loading · loading
Computer Vision Image Segmentation 🏒 Beijing University of Posts and Telecommunications
DEPICT: A new white-box decoder for Transformer-based semantic segmentation, achieving better performance with fewer parameters by leveraging the principle of compression and connecting Transformer de…
Relationship Prompt Learning is Enough for Open-Vocabulary Semantic Segmentation
·3268 words·16 mins· loading · loading
Computer Vision Image Segmentation 🏒 School of Informatics, Xiamen University
Relationship Prompt Network (RPN) achieves state-of-the-art open-vocabulary semantic segmentation using only prompt learning and a Vision-Language Model (VLM), eliminating the need for expensive segme…
Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery
·2697 words·13 mins· loading · loading
Computer Vision Image Segmentation 🏒 Google DeepMind
SynCx, a novel recurrent autoencoder with complex weights, surpasses state-of-the-art models in unsupervised object discovery by iteratively refining phase relationships to achieve robust object bindi…
One-to-Multiple: A Progressive Style Transfer Unsupervised Domain-Adaptive Framework for Kidney Tumor Segmentation
·2746 words·13 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏒 Xiangtan University
PSTUDA, a novel progressive style transfer framework, efficiently segments kidney tumors across multiple MRI sequences using unsupervised domain adaptation, achieving higher accuracy and efficiency th…
Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli
·2181 words·11 mins· loading · loading
Computer Vision Image Segmentation 🏒 University of Tübingen
Neuroscience-inspired motion energy processing enables human-like zero-shot generalization in figure-ground segmentation, outperforming deep learning models on random dot stimuli.
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning
·1904 words·9 mins· loading · loading
Computer Vision Image Segmentation 🏒 Tencent Youtu Lab
MetaUAS achieves universal visual anomaly segmentation using only one normal image prompt via a pure vision model, surpassing previous zero-shot, few-shot, and full-shot methods.
MaskFactory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation
·1960 words·10 mins· loading · loading
Computer Vision Image Segmentation 🏒 Zhejiang University
MaskFactory generates high-quality synthetic data for dichotomous image segmentation, improving model training efficiency and accuracy.
Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation
·3232 words·16 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏒 Huazhong University of Science and Technology
Lightweight Frequency Masker significantly improves cross-domain few-shot semantic segmentation by cleverly filtering frequency components of images, thereby reducing inter-channel correlation and enh…
Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation
·3134 words·15 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏒 School of Electronic Engineering and Computer Science, Queen Mary University of London
ProMaC leverages MLLM hallucinations in an iterative framework to generate precise prompts for accurate object segmentation, minimizing manual prompt dependency.
Learning Segmentation from Point Trajectories
·2329 words·11 mins· loading · loading
Image Segmentation 🏒 University of Oxford
This paper introduces a novel unsupervised video object segmentation method using long-term point trajectories and optical flow, outperforming prior art by effectively combining sparse, long-term moti…
Learning Frequency-Adapted Vision Foundation Model for Domain Generalized Semantic Segmentation
·2199 words·11 mins· loading · loading
Computer Vision Image Segmentation 🏒 Westlake University
FADA: a novel frequency-adapted learning scheme boosts domain-generalized semantic segmentation by decoupling style and content using Haar wavelets, achieving state-of-the-art results.
Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation
·2210 words·11 mins· loading · loading
Computer Vision Image Segmentation 🏒 UC Los Angeles
Deep metric learning and Coreset integration enables efficient slice-based active learning for 3D medical segmentation, surpassing existing methods in performance with low annotation budgets.
Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint Selection
·2484 words·12 mins· loading · loading
Computer Vision Image Segmentation 🏒 School of Computer Science, Fudan University
Active Viewpoint Selection (AVS) significantly improves viewpoint-independent object-centric representations by actively selecting the most informative viewpoints for each scene, leading to better seg…
Hybrid Mamba for Few-Shot Segmentation
·2385 words·12 mins· loading · loading
Computer Vision Image Segmentation 🏒 Nanyang Technological University
Hybrid Mamba Network (HMNet) boosts few-shot segmentation accuracy by efficiently fusing support and query features using a novel hybrid Mamba architecture, significantly outperforming current state-o…
Historical Test-time Prompt Tuning for Vision Foundation Models
·2286 words·11 mins· loading · loading
Computer Vision Image Segmentation 🏒 Nanyang Technological University
HisTPT: Historical Test-Time Prompt Tuning memorizes past learning, enabling robust online prompt adaptation for vision models, overcoming performance degradation in continuously changing data streams…
GraphMorph: Tubular Structure Extraction by Morphing Predicted Graphs
·2370 words·12 mins· loading · loading
Computer Vision Image Segmentation 🏒 Peking University
GraphMorph: revolutionizing tubular structure extraction by morphing predicted graphs for superior topological accuracy.
Geometric Exploitation for Indoor Panoramic Semantic Segmentation
·3017 words·15 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏒 MAXST
Boosting indoor panoramic semantic segmentation, a new approach leverages geometric properties to optimize over- and under-sampled image segments for improved accuracy and robustness.
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
·3159 words·15 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏒 ShanghaiTech University
This research presents a novel method for robust semantic segmentation, achieving state-of-the-art results by generating coherent images with both semantic and covariate shifts and recalibrating uncer…
GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting
·3939 words·19 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏒 University of Toronto
GaussianCut enables intuitive 3D object selection via graph cuts on 3D Gaussian splatting, achieving competitive segmentation without extra training.
Flaws can be Applause: Unleashing Potential of Segmenting Ambiguous Objects in SAM
·2042 words·10 mins· loading · loading
Computer Vision Image Segmentation 🏒 Chinese University of Hong Kong
A-SAM: Turning SAM’s inherent ambiguity into an advantage for controllable, diverse, and convincing ambiguous object segmentation.