Skip to main content

🏢 SHI Labs @ Georgia Tech

Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level
·2848 words·14 mins· loading · loading
AI Generated Computer Vision Image Classification 🏢 SHI Labs @ Georgia Tech
This research dramatically accelerates neighborhood attention, a cost-effective self-attention mechanism, through novel GEMM-based and fused kernel implementations, boosting performance by up to 1759%…