2025-04-02s
2025
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents
·3543 words·17 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Robotics
🏢 Simular Research
Agent S2: Compositional generalist-specialist framework for computer use agents.
Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base
·2941 words·14 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 University of Southern California
SEA: Stochastic Error Ascent efficiently discovers LLM knowledge gaps, outperforming existing methods in error detection with reduced cost.