🏢 Uppsala University
Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational Data
·3848 words·19 mins·
loading
·
loading
AI Theory
Causality
🏢 Uppsala University
This paper introduces a novel nonparametric method to make policy evaluations from randomized trials externally valid, even when trial and target populations differ. It leverages additional covariate…
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
·3343 words·16 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Uppsala University
Entropy-regularized diffusion policy with Q-ensembles achieves state-of-the-art offline reinforcement learning by tackling overestimation of Q-values and boosting exploration.