🏢 Microsoft Research
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
·2445 words·12 mins
AI Generated
🤗 Daily Papers
Multimodal Learning
Vision-Language Models
🏢 Microsoft Research
LLM2CLIP boosts CLIP’s performance by cleverly integrating LLMs, enabling it to understand longer, more complex image captions and achieving state-of-the-art results across various benchmarks.
BitNet a4.8: 4-bit Activations for 1-bit LLMs
·2844 words·14 mins
AI Generated
Natural Language Processing
Large Language Models
🏢 Microsoft Research
BitNet a4.8 achieves comparable performance to existing 1-bit LLMs, but with significantly faster inference, by using a hybrid quantization and sparsification strategy for 4-bit activations.