↓Skip to main content

🏢 University of Southampton

Variational Delayed Policy Optimization

26 September 2024·1922 words·10 mins· loading · loading

Reinforcement Learning 🏢 University of Southampton

VDPO: A novel framework for delayed reinforcement learning achieving 50% sample efficiency improvement without compromising performance.

Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication

26 September 2024·1851 words·9 mins· loading · loading

Natural Language Processing Dialogue Systems 🏢 University of Southampton

AI agents developed a communication system using spatial relationships, achieving over 90% accuracy in conveying relative positions of objects within a scene.

Rethinking Deep Thinking: Stable Learning of Algorithms using Lipschitz Constraints

26 September 2024·2740 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Southampton

Stable algorithm learning achieved by Deep Thinking networks with Lipschitz Constraints, ensuring convergence and better extrapolation to complex problems.