Skip to main content

🏢 Sakana AI

Discovering Preference Optimization Algorithms with and for Large Language Models
·4948 words·24 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Sakana AI
LLMs discover novel offline preference optimization algorithms, achieving state-of-the-art performance on various tasks.