Skip to main content

🏢 Sea AI Lab

Sample-Efficient Alignment for LLMs
·2536 words·12 mins
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Sea AI Lab
Sample-efficient LLM alignment achieved via a novel Thompson sampling algorithm (SEA), outperforming existing methods.