Skip to main content

🏢 Kiel University

HydraViT: Stacking Heads for a Scalable ViT
·2612 words·13 mins· loading · loading
Computer Vision Image Classification 🏢 Kiel University
HydraViT: Stacking attention heads creates a scalable Vision Transformer, adapting to diverse hardware by dynamically selecting subnetworks during inference, improving accuracy and efficiency.