↓Skip to main content

Robustness

Why Do Multi-Agent LLM Systems Fail?

17 March 2025·2168 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers AI Theory Robustness 🏢 UC Berkeley

Multi-Agent Systems (MAS) often underperform despite enthusiasm. This paper analyzes 5 popular frameworks across 150+ tasks, identifying 14 failure modes categorized into specification/design, inter-a…

Group-robust Machine Unlearning

12 March 2025·7203 words·34 mins· loading · loading

AI Generated 🤗 Daily Papers AI Theory Robustness 🏢 University of Trento

Group-robust machine unlearning via MIU reduces perf. degradation in dominant groups after unlearning, preserving model robustness without compromising accuracy.

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

23 February 2025·2433 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers AI Theory Robustness 🏢 M-a-P

CodeCriticBench: A new benchmark for holistic code critique by Large Language Models.

Evolution and The Knightian Blindspot of Machine Learning

22 January 2025·2850 words·14 mins· loading · loading

AI Generated 🤗 Daily Papers AI Theory Robustness 🏢 Second Nature AI

Machine learning overlooks robustness to an unknowable future; this paper contrasts reinforcement learning with biological evolution, revealing that ML’s formalisms limit engagement with unknown unkno…