Robustness
Why Do Multi-Agent LLM Systems Fail?
·2168 words·11 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Theory
Robustness
🏢 UC Berkeley
Multi-Agent Systems (MAS) often underperform despite enthusiasm. This paper analyzes 5 popular frameworks across 150+ tasks, identifying 14 failure modes categorized into specification/design, inter-a…
Group-robust Machine Unlearning
·7203 words·34 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Theory
Robustness
🏢 University of Trento
Group-robust machine unlearning via MIU reduces perf. degradation in dominant groups after unlearning, preserving model robustness without compromising accuracy.
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
·2433 words·12 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Theory
Robustness
🏢 M-a-P
CodeCriticBench: A new benchmark for holistic code critique by Large Language Models.
Evolution and The Knightian Blindspot of Machine Learning
·2850 words·14 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Theory
Robustness
🏢 Second Nature AI
Machine learning overlooks robustness to an unknowable future; this paper contrasts reinforcement learning with biological evolution, revealing that ML’s formalisms limit engagement with unknown unkno…