Skip to main content

Robustness

Why Do Multi-Agent LLM Systems Fail?
·2168 words·11 mins· loading · loading
AI Generated 🤗 Daily Papers AI Theory Robustness 🏢 UC Berkeley
Multi-Agent Systems (MAS) often underperform despite enthusiasm. This paper analyzes 5 popular frameworks across 150+ tasks, identifying 14 failure modes categorized into specification/design, inter-a…
Group-robust Machine Unlearning
·7203 words·34 mins· loading · loading
AI Generated 🤗 Daily Papers AI Theory Robustness 🏢 University of Trento
Group-robust machine unlearning via MIU reduces perf. degradation in dominant groups after unlearning, preserving model robustness without compromising accuracy.
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
·2433 words·12 mins· loading · loading
AI Generated 🤗 Daily Papers AI Theory Robustness 🏢 M-a-P
CodeCriticBench: A new benchmark for holistic code critique by Large Language Models.
Evolution and The Knightian Blindspot of Machine Learning
·2850 words·14 mins· loading · loading
AI Generated 🤗 Daily Papers AI Theory Robustness 🏢 Second Nature AI
Machine learning overlooks robustness to an unknowable future; this paper contrasts reinforcement learning with biological evolution, revealing that ML’s formalisms limit engagement with unknown unkno…