Skip to main content

🏢 Yale University

PHYSICS: Benchmarking Foundation Models on University-Level Physics Problem Solving
·2247 words·11 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Education 🏢 Yale University
PHYSICS: A new benchmark reveals foundation models struggle with university-level physics, highlighting needs for improved reasoning and knowledge integration.
MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search
·2082 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Question Answering 🏢 Yale University
MCTS-RAG: Combines Monte Carlo Tree Search with Retrieval-Augmented Generation to enhance small LMs’ reasoning on complex tasks.
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
·2900 words·14 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 Yale University
MEDAGENTSBENCH: a new benchmark for assessing complex medical reasoning in LLMs, revealing performance gaps and cost-effective strategies.
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces
·2347 words·12 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Yale University
SynthLight: A novel diffusion model relights portraits realistically by learning to re-render synthetic faces, generalizing remarkably well to real photographs.