🏢 SCB 10X R&D
Typhoon T1: An Open Thai Reasoning Model
·3148 words·15 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 SCB 10X R&D
Typhoon T1: Open Thai reasoning model improves complex task performance by generating long chains of thought, detailed methodology, and open-source resources are provided.
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging
·3494 words·17 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 SCB 10X R&D
Low-resource language LLMs gain strong reasoning abilities by merging with a high-resource reasoning model, achieving performance comparable to state-of-the-art models while maintaining target languag…