↓Skip to main content

🏢 Max Planck Institute for Software Systems

Prediction-Powered Ranking of Large Language Models

26 September 2024·4368 words·21 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Max Planck Institute for Software Systems

This paper presents a novel statistical framework for ranking LLMs using pairwise comparisons, accounting for the uncertainty introduced when using an LLM instead of human preferences. The framework …

Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets

26 September 2024·2473 words·12 mins· loading · loading

AI Theory Causality 🏢 Max Planck Institute for Software Systems

AI decision support systems can unintentionally harm users; this paper introduces a novel framework to design systems that minimize this counterfactual harm, balancing accuracy and user well-being.