Image Classification

MambaTree: Tree Topology is All You Need in State Space Model

26 September 2024·1962 words·10 mins· loading · loading

Image Classification 🏢 Tsinghua Shenzhen International Graduate School

MambaTree: A novel tree-topology-based state space model surpasses existing methods by dynamically generating input-aware topologies for enhanced long-range dependencies in vision and language.

LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate

26 September 2024·4927 words·24 mins· loading · loading

Computer Vision Image Classification 🏢 Carleton University

LookHere: Vision Transformers excel at high-resolution image classification by using 2D attention masks to direct attention heads, improving generalization and extrapolation.

Linearly Decomposing and Recomposing Vision Transformers for Diverse-Scale Models

26 September 2024·2125 words·10 mins· loading · loading

Computer Vision Image Classification 🏢 School of Computer Science and Engineering, Southeast University

Linearly decompose & recompose Vision Transformers to create diverse-scale models efficiently, reducing computational costs & improving flexibility for various applications.

Learning Where to Edit Vision Transformers

26 September 2024·3346 words·16 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 City University of Hong Kong

Meta-learning a hypernetwork on CutMix-augmented data enables data-efficient and precise correction of vision transformer errors by identifying optimal parameters for fine-tuning.

Learning to Merge Tokens via Decoupled Embedding for Efficient Vision Transformers

26 September 2024·3286 words·16 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 KAIST

Decoupled Token Embedding for Merging (DTEM) significantly improves Vision Transformer efficiency by using a decoupled embedding module for relaxed token merging, achieving consistent performance gain…

Learning Low-Rank Feature for Thorax Disease Classification

26 September 2024·3584 words·17 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 School of Computing and Augmented Intelligence, Arizona State University

Low-Rank Feature Learning (LRFL) significantly boosts thorax disease classification accuracy by reducing noise and background interference in medical images.

Learning from Offline Foundation Features with Tensor Augmentations

26 September 2024·1797 words·9 mins· loading · loading

Computer Vision Image Classification 🏢 KTH Royal Institute of Technology

LOFF-TA leverages offline foundation model features and tensor augmentations for efficient, resource-light training, achieving up to 37x faster training and 26x less GPU memory usage.

Learning Bregman Divergences with Application to Robustness

26 September 2024·2210 words·11 mins· loading · loading

Computer Vision Image Classification 🏢 ETH Zurich

Learned Bregman divergences significantly improve image corruption robustness in adversarial training.

L-TTA: Lightweight Test-Time Adaptation Using a Versatile Stem Layer

26 September 2024·2871 words·14 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 Seoul National University of Science and Technology

L-TTA: A lightweight test-time adaptation method using a versatile stem layer minimizes channel-wise uncertainty for rapid and memory-efficient adaptation to new domains.

Interpretable Image Classification with Adaptive Prototype-based Vision Transformers

26 September 2024·5008 words·24 mins· loading · loading

Computer Vision Image Classification 🏢 Dartmouth College

ProtoViT: a novel interpretable image classification method using Vision Transformers and adaptive prototypes, achieving higher accuracy and providing clear explanations.

Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification

26 September 2024·3714 words·18 mins· loading · loading

Image Classification 🏢 Xi'an-Jiaotong Liverpool University

This paper introduces L-Reg, a novel logical regularization technique, to improve generalization in visual classification. L-Reg effectively reduces model complexity and improves interpretability by f…

Initializing Variable-sized Vision Transformers from Learngene with Learnable Transformation

26 September 2024·2536 words·12 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 School of Computer Science and Engineering, Southeast University

LeTs: Learnable Transformation efficiently initializes variable-sized Vision Transformers by learning adaptable transformations from a compact learngene module, outperforming from-scratch training.

Infinite-Dimensional Feature Interaction

26 September 2024·1877 words·9 mins· loading · loading

Computer Vision Image Classification 🏢 Peking University

InfiNet achieves state-of-the-art results by enabling feature interaction in an infinite-dimensional space using RBF kernels, surpassing models limited to finite-dimensional interactions.

In Pursuit of Causal Label Correlations for Multi-label Image Recognition

26 September 2024·2377 words·12 mins· loading · loading

Computer Vision Image Classification 🏢 Wenzhou University

This research leverages causal intervention to identify and utilize genuine label correlations in multi-label image recognition, mitigating contextual bias for improved accuracy.

Improving robustness to corruptions with multiplicative weight perturbations

26 September 2024·1713 words·9 mins· loading · loading

Image Classification 🏢 Aalto University

Boost DNN robustness to corruptions without sacrificing clean image accuracy using Data Augmentation via Multiplicative Perturbations (DAMP)!

HydraViT: Stacking Heads for a Scalable ViT

26 September 2024·2612 words·13 mins· loading · loading

Computer Vision Image Classification 🏢 Kiel University

HydraViT: Stacking attention heads creates a scalable Vision Transformer, adapting to diverse hardware by dynamically selecting subnetworks during inference, improving accuracy and efficiency.

Hierarchical Selective Classification

26 September 2024·2174 words·11 mins· loading · loading

Computer Vision Image Classification 🏢 Technion

Hierarchical Selective Classification (HSC) improves deep learning model reliability for risk-sensitive tasks by leveraging hierarchical class relationships to provide more informative predictions eve…

Happy: A Debiased Learning Framework for Continual Generalized Category Discovery

26 September 2024·2362 words·12 mins· loading · loading

Computer Vision Image Classification 🏢 Institute of Automation, Chinese Academy of Sciences

Happy: a novel debiased learning framework, excels at continually discovering new categories from unlabeled data while retaining knowledge of previously learned ones, overcoming existing bias issues a…

Geometric Analysis of Nonlinear Manifold Clustering

26 September 2024·1790 words·9 mins· loading · loading

Computer Vision Image Classification 🏢 Lehigh University

Guaranteed Manifold Clustering: Novel method provides geometric conditions ensuring accurate data grouping from nonlinear manifolds, showing competitive performance on CIFAR datasets.

Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery

26 September 2024·3153 words·15 mins· loading · loading

Image Classification 🏢 Xi'an Jiaotong University

FlipClass dynamically updates the teacher model in a teacher-student framework to align with the student’s attention, resolving learning inconsistencies and significantly improving generalized categor…