Image Classification
MambaTree: Tree Topology is All You Need in State Space Model
·1962 words·10 mins·
loading
·
loading
Image Classification
🏢 Tsinghua Shenzhen International Graduate School
MambaTree: A novel tree-topology-based state space model surpasses existing methods by dynamically generating input-aware topologies for enhanced long-range dependencies in vision and language.
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate
·4927 words·24 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Carleton University
LookHere: Vision Transformers excel at high-resolution image classification by using 2D attention masks to direct attention heads, improving generalization and extrapolation.
Linearly Decomposing and Recomposing Vision Transformers for Diverse-Scale Models
·2125 words·10 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 School of Computer Science and Engineering, Southeast University
Linearly decompose & recompose Vision Transformers to create diverse-scale models efficiently, reducing computational costs & improving flexibility for various applications.
Learning Where to Edit Vision Transformers
·3346 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 City University of Hong Kong
Meta-learning a hypernetwork on CutMix-augmented data enables data-efficient and precise correction of vision transformer errors by identifying optimal parameters for fine-tuning.
Learning to Merge Tokens via Decoupled Embedding for Efficient Vision Transformers
·3286 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 KAIST
Decoupled Token Embedding for Merging (DTEM) significantly improves Vision Transformer efficiency by using a decoupled embedding module for relaxed token merging, achieving consistent performance gain…
Learning Low-Rank Feature for Thorax Disease Classification
·3584 words·17 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 School of Computing and Augmented Intelligence, Arizona State University
Low-Rank Feature Learning (LRFL) significantly boosts thorax disease classification accuracy by reducing noise and background interference in medical images.
Learning from Offline Foundation Features with Tensor Augmentations
·1797 words·9 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 KTH Royal Institute of Technology
LOFF-TA leverages offline foundation model features and tensor augmentations for efficient, resource-light training, achieving up to 37x faster training and 26x less GPU memory usage.
Learning Bregman Divergences with Application to Robustness
·2210 words·11 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 ETH Zurich
Learned Bregman divergences significantly improve image corruption robustness in adversarial training.
L-TTA: Lightweight Test-Time Adaptation Using a Versatile Stem Layer
·2871 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 Seoul National University of Science and Technology
L-TTA: A lightweight test-time adaptation method using a versatile stem layer minimizes channel-wise uncertainty for rapid and memory-efficient adaptation to new domains.
Interpretable Image Classification with Adaptive Prototype-based Vision Transformers
·5008 words·24 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Dartmouth College
ProtoViT: a novel interpretable image classification method using Vision Transformers and adaptive prototypes, achieving higher accuracy and providing clear explanations.
Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification
·3714 words·18 mins·
loading
·
loading
Image Classification
🏢 Xi'an-Jiaotong Liverpool University
This paper introduces L-Reg, a novel logical regularization technique, to improve generalization in visual classification. L-Reg effectively reduces model complexity and improves interpretability by f…
Initializing Variable-sized Vision Transformers from Learngene with Learnable Transformation
·2536 words·12 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 School of Computer Science and Engineering, Southeast University
LeTs: Learnable Transformation efficiently initializes variable-sized Vision Transformers by learning adaptable transformations from a compact learngene module, outperforming from-scratch training.
Infinite-Dimensional Feature Interaction
·1877 words·9 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Peking University
InfiNet achieves state-of-the-art results by enabling feature interaction in an infinite-dimensional space using RBF kernels, surpassing models limited to finite-dimensional interactions.
In Pursuit of Causal Label Correlations for Multi-label Image Recognition
·2377 words·12 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Wenzhou University
This research leverages causal intervention to identify and utilize genuine label correlations in multi-label image recognition, mitigating contextual bias for improved accuracy.
Improving robustness to corruptions with multiplicative weight perturbations
·1713 words·9 mins·
loading
·
loading
Image Classification
🏢 Aalto University
Boost DNN robustness to corruptions without sacrificing clean image accuracy using Data Augmentation via Multiplicative Perturbations (DAMP)!
HydraViT: Stacking Heads for a Scalable ViT
·2612 words·13 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Kiel University
HydraViT: Stacking attention heads creates a scalable Vision Transformer, adapting to diverse hardware by dynamically selecting subnetworks during inference, improving accuracy and efficiency.
Hierarchical Selective Classification
·2174 words·11 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Technion
Hierarchical Selective Classification (HSC) improves deep learning model reliability for risk-sensitive tasks by leveraging hierarchical class relationships to provide more informative predictions eve…
Happy: A Debiased Learning Framework for Continual Generalized Category Discovery
·2362 words·12 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Institute of Automation, Chinese Academy of Sciences
Happy: a novel debiased learning framework, excels at continually discovering new categories from unlabeled data while retaining knowledge of previously learned ones, overcoming existing bias issues a…
Geometric Analysis of Nonlinear Manifold Clustering
·1790 words·9 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Lehigh University
Guaranteed Manifold Clustering: Novel method provides geometric conditions ensuring accurate data grouping from nonlinear manifolds, showing competitive performance on CIFAR datasets.
Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
·3153 words·15 mins·
loading
·
loading
Image Classification
🏢 Xi'an Jiaotong University
FlipClass dynamically updates the teacher model in a teacher-student framework to align with the student’s attention, resolving learning inconsistencies and significantly improving generalized categor…