Skip to main content

🏢 Menlo Research

AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO
·402 words·2 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Menlo Research
AlphaMaze enhances LLMs’ spatial intelligence via GRPO, achieving 93% accuracy in maze navigation and showing emergent reasoning.