🏢 Google DeepMind
Deliberation in Latent Space via Differentiable Cache Augmentation
·3569 words·17 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 Google DeepMind
Frozen LLMs get a performance boost by augmenting their key-value cache with latent embeddings generated by a differentiable offline coprocessor.
Revisiting In-Context Learning with Long Context Language Models
·4377 words·21 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 Google DeepMind
Long-context models surprisingly show that simple random sampling of examples is as effective as sophisticated methods for in-context learning, shifting the focus to efficient context utilization.
LearnLM: Improving Gemini for Learning
·4335 words·21 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Education
🏢 Google DeepMind
LearnLM enhances Gemini for education by training it to follow pedagogical instructions, leading to significant preference improvements over GPT-40, Claude 3.5, and Gemini 1.5 Pro in diverse learning …
PaliGemma 2: A Family of Versatile VLMs for Transfer
·6035 words·29 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Vision-Language Models
🏢 Google DeepMind
PaliGemma 2: A family of versatile, open-weight VLMs achieving state-of-the-art results on various transfer tasks by scaling model size and resolution.
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
·3896 words·19 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
3D Vision
🏢 Google DeepMind
CAT4D: Create realistic 4D scenes from single-view videos using a novel multi-view video diffusion model.