Image Classification

Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level

26 September 2024·2848 words·14 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 SHI Labs @ Georgia Tech

This research dramatically accelerates neighborhood attention, a cost-effective self-attention mechanism, through novel GEMM-based and fused kernel implementations, boosting performance by up to 1759%…

F-OAL: Forward-only Online Analytic Learning with Fast Training and Low Memory Footprint in Class Incremental Learning

26 September 2024·1519 words·8 mins· loading · loading

Computer Vision Image Classification 🏢 South China University of Technology

F-OAL: Forward-only Online Analytic Learning achieves high accuracy and low memory usage in online class incremental learning by using a frozen encoder and recursive least squares to update a linear …

Exploring Token Pruning in Vision State Space Models

26 September 2024·1749 words·9 mins· loading · loading

Computer Vision Image Classification 🏢 Northeastern University

This paper introduces a novel token pruning method for vision state space models, achieving significant computational reduction with minimal performance impact, addressing the limitations of directly …

Enhancing Feature Diversity Boosts Channel-Adaptive Vision Transformers

26 September 2024·3757 words·18 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 Boston University

DiChaViT boosts channel-adaptive vision transformers by enhancing feature diversity, yielding a 1.5-5% accuracy gain over state-of-the-art MCI models on diverse datasets.

EMR-Merging: Tuning-Free High-Performance Model Merging

26 September 2024·3173 words·15 mins· loading · loading

Image Classification 🏢 Fudan University

EMR-MERGING: A tuning-free model merging technique achieves high performance by electing a unified model and generating lightweight task-specific modulators, eliminating the need for additional data …

Elucidating the Design Space of Dataset Condensation

26 September 2024·4063 words·20 mins· loading · loading

Computer Vision Image Classification 🏢 Tsinghua University

Elucidating Dataset Condensation (EDC) achieves state-of-the-art accuracy in dataset condensation by implementing soft category-aware matching and a smoothing learning rate schedule, improving model t…

Efficient Lifelong Model Evaluation in an Era of Rapid Progress

26 September 2024·2830 words·14 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 University of Cambridge

Sort & Search: 1000x faster lifelong model evaluation!

Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation

26 September 2024·1907 words·9 mins· loading · loading

Computer Vision Image Classification 🏢 College of Information and Control Engineering, Xi'an University of Architecture and Technology

Boosting Vision Transformer adaptation! Householder Transformation-based Adaptor (HTA) outperforms existing methods by dynamically adjusting adaptation matrix ranks across layers, improving efficiency…

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

26 September 2024·3048 words·15 mins· loading · loading

Computer Vision Image Classification 🏢 National University of Singapore

Dynamic Tuning (DyT) significantly boosts Vision Transformer (ViT) adaptation by dynamically skipping less important tokens during inference, achieving superior performance with 71% fewer FLOPs than e…

Diversity-Driven Synthesis: Enhancing Dataset Distillation through Directed Weight Adjustment

26 September 2024·2093 words·10 mins· loading · loading

Image Classification 🏢 Agency for Science, Technology and Research (A*STAR)

Boosting dataset distillation, a new method, Diversity-Driven Synthesis, uses directed weight adjustment to create diverse, representative synthetic datasets, improving model performance while reducin…

Distribution-Aware Data Expansion with Diffusion Models

26 September 2024·3351 words·16 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 Tsinghua University

DistDiff, a training-free data expansion framework, leverages distribution-aware diffusion models to generate high-fidelity, diverse samples that enhance downstream model performance.

DiffuLT: Diffusion for Long-tail Recognition Without External Knowledge

26 September 2024·2601 words·13 mins· loading · loading

Computer Vision Image Classification 🏢 National Key Laboratory for Novel Software Technology, Nanjing University

DiffuLT uses a novel diffusion model to generate balanced training data from imbalanced datasets, achieving state-of-the-art results in long-tailed image recognition without external knowledge.

DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators

26 September 2024·2183 words·11 mins· loading · loading

Computer Vision Image Classification 🏢 Nokia Bell Labs

DEX boosts CNN accuracy on tiny AI accelerators by 3.5%p, utilizing unused memory and processors to extend input channels without increasing latency.

DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms

26 September 2024·2446 words·12 mins· loading · loading

Image Classification 🏢 Ben-Gurion University of the Negev

DeSparsify: A stealthy adversarial attack exhausts vision transformer resources by exploiting token sparsification mechanisms’ dynamic nature, highlighting the need for improved resource management i…

DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism

26 September 2024·2917 words·14 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 Samsung Electronics

DEPrune: A novel GPU-optimized pruning method for depthwise separable convolutions, achieving up to 3.74x speedup on EfficientNet-B0 with no accuracy loss!

DenoiseRep: Denoising Model for Representation Learning

26 September 2024·1739 words·9 mins· loading · loading

Image Classification 🏢 Beijing Jiaotong University

DenoiseRep: A novel denoising model enhances feature discrimination in computer vision tasks by integrating feature extraction and denoising within a single backbone, achieving impressive improvements…

Demystify Mamba in Vision: A Linear Attention Perspective

26 September 2024·2184 words·11 mins· loading · loading

Computer Vision Image Classification 🏢 Tsinghua University

Vision’s Mamba model demystified: Researchers unveil its surprising link to linear attention, improving efficiency and accuracy through design enhancements.

Decoupled Kullback-Leibler Divergence Loss

26 September 2024·2254 words·11 mins· loading · loading

Computer Vision Image Classification 🏢 the Chinese University of Hong Kong

Improved Kullback-Leibler (IKL) divergence loss achieves state-of-the-art adversarial robustness and competitive knowledge distillation performance by addressing KL loss’s limitations.

Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise

26 September 2024·1703 words·8 mins· loading · loading

Computer Vision Image Classification 🏢 Gwangju Institute of Science and Technology

CUFIT: a novel curriculum fine-tuning paradigm significantly improves medical image classification accuracy despite noisy labels by leveraging pre-trained Vision Foundation Models.

Constructing Semantics-Aware Adversarial Examples with Probabilistic Perspective

26 September 2024·1825 words·9 mins· loading · loading

Computer Vision Image Classification 🏢 University of Cambridge

Researchers developed semantics-aware adversarial examples using a probabilistic approach, achieving higher success rates in bypassing defenses while remaining undetectable to humans.