🏢 Kiel University
HydraViT: Stacking Heads for a Scalable ViT
·2612 words·13 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Kiel University
HydraViT: Stacking attention heads creates a scalable Vision Transformer, adapting to diverse hardware by dynamically selecting subnetworks during inference, improving accuracy and efficiency.