🏢 Tongyi Lab
AlphaMath Almost Zero: Process Supervision without Process
·2731 words·13 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Tongyi Lab
AlphaMath: LLMs excel at math reasoning without human-annotated process supervision, using Monte Carlo Tree Search.