🏢 Scale AI
Learning Goal-Conditioned Representations for Language Reward Models
·3372 words·16 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Scale AI
Goal-conditioned contrastive learning boosts language reward model performance and enables better control of language model generation.