Skip to main content

🏢 Scale AI

Learning Goal-Conditioned Representations for Language Reward Models
·3372 words·16 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Scale AI
Goal-conditioned contrastive learning boosts language reward model performance and enables better control of language model generation.