🏢 Max Planck Institute for Software Systems
Prediction-Powered Ranking of Large Language Models
·4368 words·21 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 Max Planck Institute for Software Systems
This paper presents a novel statistical framework for ranking LLMs using pairwise comparisons, accounting for the uncertainty introduced when using an LLM instead of human preferences. The framework …
Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets
·2473 words·12 mins·
loading
·
loading
AI Theory
Causality
🏢 Max Planck Institute for Software Systems
AI decision support systems can unintentionally harm users; this paper introduces a novel framework to design systems that minimize this counterfactual harm, balancing accuracy and user well-being.