↓Skip to main content

🏢 EPFL, Switzerland

Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces

26 September 2024·1647 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 EPFL, Switzerland

This paper presents randomized algorithms with PAC bounds for solving inverse reinforcement learning problems in continuous state and action spaces, offering robust theoretical guarantees and practica…

Fast Proxy Experiment Design for Causal Effect Identification

26 September 2024·2057 words·10 mins· loading · loading

AI Theory Causality 🏢 EPFL, Switzerland

This paper presents efficient algorithms for designing cost-optimal proxy experiments to identify causal effects, significantly improving upon prior methods.

Analyzing & Reducing the Need for Learning Rate Warmup in GPT Training

26 September 2024·2347 words·12 mins· loading · loading

Natural Language Processing Large Language Models 🏢 EPFL, Switzerland

This study reveals that modifying optimizers to normalize updates based on angular changes and gradient signal-to-noise ratio significantly reduces the need for learning rate warmup in GPT training.