🏢 University at Albany, SUNY
MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization
·1848 words·9 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 University at Albany, SUNY
MagR: a novel preprocessing technique boosts post-training quantization of LLMs by reducing weight magnitudes without inference overhead, achieving state-of-the-art performance.
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization
·2398 words·12 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 University at Albany, SUNY
CoMERA achieves 2-3x faster AI model training via rank-adaptive tensor optimization, significantly improving both computing and memory efficiency.