Skip to main content

🏢 Microsoft Research Asia

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
·3688 words·18 mins· loading · loading
AI Generated 🤗 Daily Papers Machine Learning Reinforcement Learning 🏢 Microsoft Research Asia
Logic-RL unlocks LLM reasoning via rule-based reinforcement learning, generalizing to math problems after training on logic puzzles.