↓Skip to main content

🏢 Yale University

PHYSICS: Benchmarking Foundation Models on University-Level Physics Problem Solving

26 March 2025·2247 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Education 🏢 Yale University

PHYSICS: A new benchmark reveals foundation models struggle with university-level physics, highlighting needs for improved reasoning and knowledge integration.

MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search

26 March 2025·2082 words·10 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Question Answering 🏢 Yale University

MCTS-RAG: Combines Monte Carlo Tree Search with Retrieval-Augmented Generation to enhance small LMs’ reasoning on complex tasks.

MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

10 March 2025·2900 words·14 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 Yale University

MEDAGENTSBENCH: a new benchmark for assessing complex medical reasoning in LLMs, revealing performance gaps and cost-effective strategies.

SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces

16 January 2025·2347 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Yale University

SynthLight: A novel diffusion model relights portraits realistically by learning to re-render synthetic faces, generalizing remarkably well to real photographs.