Skip to main content

🏢 Meituan Group

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity
·2929 words·14 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Meituan Group
SampleMix: Sample-wise Pre-training Data Mixing by Coordinating Data Quality and Diversity