Skip to main content

🏢 University of Southampton

Variational Delayed Policy Optimization
·1922 words·10 mins· loading · loading
Reinforcement Learning 🏢 University of Southampton
VDPO: A novel framework for delayed reinforcement learning achieving 50% sample efficiency improvement without compromising performance.
Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication
·1851 words·9 mins· loading · loading
Natural Language Processing Dialogue Systems 🏢 University of Southampton
AI agents developed a communication system using spatial relationships, achieving over 90% accuracy in conveying relative positions of objects within a scene.
Rethinking Deep Thinking: Stable Learning of Algorithms using Lipschitz Constraints
·2740 words·13 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Southampton
Stable algorithm learning achieved by Deep Thinking networks with Lipschitz Constraints, ensuring convergence and better extrapolation to complex problems.