🏢 Samsung AI Cambridge
QBB: Quantization with Binary Bases for LLMs
·1816 words·9 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Samsung AI Cambridge
QBB: A novel post-training quantization method for LLMs dramatically improves efficiency by replacing multiplications with summations, achieving state-of-the-art results with minimal accuracy loss.