🏢 Center for Artificial Intelligence and Data Science
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch
·3133 words·15 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 Center for Artificial Intelligence and Data Science
New German-only LLMs, LLäMmlein 120M & 1B, trained from scratch & openly released, show competitive performance and offer insights into efficient model training.