Skip to main content

🏢 Google DeepMind

Deliberation in Latent Space via Differentiable Cache Augmentation
·3569 words·17 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Google DeepMind
Frozen LLMs get a performance boost by augmenting their key-value cache with latent embeddings generated by a differentiable offline coprocessor.
Revisiting In-Context Learning with Long Context Language Models
·4377 words·21 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Google DeepMind
Long-context models surprisingly show that simple random sampling of examples is as effective as sophisticated methods for in-context learning, shifting the focus to efficient context utilization.
LearnLM: Improving Gemini for Learning
·4335 words·21 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Education 🏢 Google DeepMind
LearnLM enhances Gemini for education by training it to follow pedagogical instructions, leading to significant preference improvements over GPT-40, Claude 3.5, and Gemini 1.5 Pro in diverse learning …
PaliGemma 2: A Family of Versatile VLMs for Transfer
·6035 words·29 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Google DeepMind
PaliGemma 2: A family of versatile, open-weight VLMs achieving state-of-the-art results on various transfer tasks by scaling model size and resolution.
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
·3896 words·19 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 Google DeepMind
CAT4D: Create realistic 4D scenes from single-view videos using a novel multi-view video diffusion model.