Skip to main content

🏢 Microsoft Research

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
·2445 words·12 mins
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Microsoft Research
LLM2CLIP boosts CLIP’s performance by cleverly integrating LLMs, enabling it to understand longer, more complex image captions and achieving state-of-the-art results across various benchmarks.
BitNet a4.8: 4-bit Activations for 1-bit LLMs
·2844 words·14 mins
AI Generated Natural Language Processing Large Language Models 🏢 Microsoft Research
BitNet a4.8 achieves comparable performance to existing 1-bit LLMs, but with significantly faster inference, by using a hybrid quantization and sparsification strategy for 4-bit activations.