Skip to main content

🏢 Samsung AI Cambridge

QBB: Quantization with Binary Bases for LLMs
·1816 words·9 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Samsung AI Cambridge
QBB: A novel post-training quantization method for LLMs dramatically improves efficiency by replacing multiplications with summations, achieving state-of-the-art results with minimal accuracy loss.