Skip to main content

2025-04-02s

2025

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents
·3543 words·17 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Simular Research
Agent S2: Compositional generalist-specialist framework for computer use agents.
Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base
·2941 words·14 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 University of Southern California
SEA: Stochastic Error Ascent efficiently discovers LLM knowledge gaps, outperforming existing methods in error detection with reduced cost.