Skip to main content

🏢 Nanjing University of Aeronautics and Astronautics

SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers
·2117 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers Machine Learning Deep Learning 🏢 Nanjing University of Aeronautics and Astronautics
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers.
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
·1926 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Nanjing University of Aeronautics and Astronautics
LLM reasoning boosts self-confidence, even when answers are wrong, highlighting limitations in current evaluation metrics.
Buy Me A Coffee