Image Classification
Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level
·2848 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 SHI Labs @ Georgia Tech
This research dramatically accelerates neighborhood attention, a cost-effective self-attention mechanism, through novel GEMM-based and fused kernel implementations, boosting performance by up to 1759%…
F-OAL: Forward-only Online Analytic Learning with Fast Training and Low Memory Footprint in Class Incremental Learning
·1519 words·8 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 South China University of Technology
F-OAL: Forward-only Online Analytic Learning achieves high accuracy and low memory usage in online class incremental learning by using a frozen encoder and recursive least squares to update a linear …
Exploring Token Pruning in Vision State Space Models
·1749 words·9 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Northeastern University
This paper introduces a novel token pruning method for vision state space models, achieving significant computational reduction with minimal performance impact, addressing the limitations of directly …
Enhancing Feature Diversity Boosts Channel-Adaptive Vision Transformers
·3757 words·18 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 Boston University
DiChaViT boosts channel-adaptive vision transformers by enhancing feature diversity, yielding a 1.5-5% accuracy gain over state-of-the-art MCI models on diverse datasets.
EMR-Merging: Tuning-Free High-Performance Model Merging
·3173 words·15 mins·
loading
·
loading
Image Classification
🏢 Fudan University
EMR-MERGING: A tuning-free model merging technique achieves high performance by electing a unified model and generating lightweight task-specific modulators, eliminating the need for additional data …
Elucidating the Design Space of Dataset Condensation
·4063 words·20 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Tsinghua University
Elucidating Dataset Condensation (EDC) achieves state-of-the-art accuracy in dataset condensation by implementing soft category-aware matching and a smoothing learning rate schedule, improving model t…
Efficient Lifelong Model Evaluation in an Era of Rapid Progress
·2830 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 University of Cambridge
Sort & Search: 1000x faster lifelong model evaluation!
Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation
·1907 words·9 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 College of Information and Control Engineering, Xi'an University of Architecture and Technology
Boosting Vision Transformer adaptation! Householder Transformation-based Adaptor (HTA) outperforms existing methods by dynamically adjusting adaptation matrix ranks across layers, improving efficiency…
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation
·3048 words·15 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 National University of Singapore
Dynamic Tuning (DyT) significantly boosts Vision Transformer (ViT) adaptation by dynamically skipping less important tokens during inference, achieving superior performance with 71% fewer FLOPs than e…
Diversity-Driven Synthesis: Enhancing Dataset Distillation through Directed Weight Adjustment
·2093 words·10 mins·
loading
·
loading
Image Classification
🏢 Agency for Science, Technology and Research (A*STAR)
Boosting dataset distillation, a new method, Diversity-Driven Synthesis, uses directed weight adjustment to create diverse, representative synthetic datasets, improving model performance while reducin…
Distribution-Aware Data Expansion with Diffusion Models
·3351 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 Tsinghua University
DistDiff, a training-free data expansion framework, leverages distribution-aware diffusion models to generate high-fidelity, diverse samples that enhance downstream model performance.
DiffuLT: Diffusion for Long-tail Recognition Without External Knowledge
·2601 words·13 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 National Key Laboratory for Novel Software Technology, Nanjing University
DiffuLT uses a novel diffusion model to generate balanced training data from imbalanced datasets, achieving state-of-the-art results in long-tailed image recognition without external knowledge.
DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators
·2183 words·11 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Nokia Bell Labs
DEX boosts CNN accuracy on tiny AI accelerators by 3.5%p, utilizing unused memory and processors to extend input channels without increasing latency.
DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms
·2446 words·12 mins·
loading
·
loading
Image Classification
🏢 Ben-Gurion University of the Negev
DeSparsify: A stealthy adversarial attack exhausts vision transformer resources by exploiting token sparsification mechanisms’ dynamic nature, highlighting the need for improved resource management i…
DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
·2917 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 Samsung Electronics
DEPrune: A novel GPU-optimized pruning method for depthwise separable convolutions, achieving up to 3.74x speedup on EfficientNet-B0 with no accuracy loss!
DenoiseRep: Denoising Model for Representation Learning
·1739 words·9 mins·
loading
·
loading
Image Classification
🏢 Beijing Jiaotong University
DenoiseRep: A novel denoising model enhances feature discrimination in computer vision tasks by integrating feature extraction and denoising within a single backbone, achieving impressive improvements…
Demystify Mamba in Vision: A Linear Attention Perspective
·2184 words·11 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Tsinghua University
Vision’s Mamba model demystified: Researchers unveil its surprising link to linear attention, improving efficiency and accuracy through design enhancements.
Decoupled Kullback-Leibler Divergence Loss
·2254 words·11 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 the Chinese University of Hong Kong
Improved Kullback-Leibler (IKL) divergence loss achieves state-of-the-art adversarial robustness and competitive knowledge distillation performance by addressing KL loss’s limitations.
Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise
·1703 words·8 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Gwangju Institute of Science and Technology
CUFIT: a novel curriculum fine-tuning paradigm significantly improves medical image classification accuracy despite noisy labels by leveraging pre-trained Vision Foundation Models.
Constructing Semantics-Aware Adversarial Examples with Probabilistic Perspective
·1825 words·9 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 University of Cambridge
Researchers developed semantics-aware adversarial examples using a probabilistic approach, achieving higher success rates in bypassing defenses while remaining undetectable to humans.