Skip to main content

🏢 Uppsala University

Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational Data
·3848 words·19 mins· loading · loading
AI Theory Causality 🏢 Uppsala University
This paper introduces a novel nonparametric method to make policy evaluations from randomized trials externally valid, even when trial and target populations differ. It leverages additional covariate…
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
·3343 words·16 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Uppsala University
Entropy-regularized diffusion policy with Q-ensembles achieves state-of-the-art offline reinforcement learning by tackling overestimation of Q-values and boosting exploration.