Skip to main content

🏢 Max Planck Institute for Software Systems

Prediction-Powered Ranking of Large Language Models
·4368 words·21 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Max Planck Institute for Software Systems
This paper presents a novel statistical framework for ranking LLMs using pairwise comparisons, accounting for the uncertainty introduced when using an LLM instead of human preferences. The framework …
Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets
·2473 words·12 mins· loading · loading
AI Theory Causality 🏢 Max Planck Institute for Software Systems
AI decision support systems can unintentionally harm users; this paper introduces a novel framework to design systems that minimize this counterfactual harm, balancing accuracy and user well-being.