🏢 SHI Labs @ Georgia Tech
Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level
·2848 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
🏢 SHI Labs @ Georgia Tech
This research dramatically accelerates neighborhood attention, a cost-effective self-attention mechanism, through novel GEMM-based and fused kernel implementations, boosting performance by up to 1759%…