Skip to main content

🏢 Tongyi Lab

AlphaMath Almost Zero: Process Supervision without Process
·2731 words·13 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Tongyi Lab
AlphaMath: LLMs excel at math reasoning without human-annotated process supervision, using Monte Carlo Tree Search.