Skip to main content

🏢 Nanjing University of Aeronautics and Astronautics

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
·1926 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Nanjing University of Aeronautics and Astronautics
LLM reasoning boosts self-confidence, even when answers are wrong, highlighting limitations in current evaluation metrics.