Reinforcement Learning
Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment
·2568 words·13 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Boston University
PAGAR: a novel semi-supervised IRL framework prioritizing task alignment over data alignment, leveraging expert demonstrations as weak supervision to derive task-aligned reward functions for improved …
Rethinking Exploration in Reinforcement Learning with Effective Metric-Based Exploration Bonus
·2904 words·14 mins·
loading
·
loading
Reinforcement Learning
🏢 University of Macau
Effective Metric-based Exploration Bonus (EME) enhances reinforcement learning exploration by using a robust metric for state discrepancy and a dynamically adjusted scaling factor based on reward mode…
Relating Hopfield Networks to Episodic Control
·3465 words·17 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Inria Centre of the University of Bordeaux
Neural Episodic Control’s differentiable dictionary is shown to be a Universal Hopfield Network, enabling improved performance and a novel evaluation criterion.
Reinforcement Learning with LTL and ⍵-Regular Objectives via Optimality-Preserving Translation to Average Rewards
·1651 words·8 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 NTU Singapore
Reinforcement learning with complex objectives made easy: This paper introduces an optimality-preserving translation to reduce problems with Linear Temporal Logic (LTL) objectives to standard average …
Reinforcement Learning with Lookahead Information
·333 words·2 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 FairPlay Joint Team, CREST, ENSAE Paris
Provably efficient RL algorithms are designed to utilize immediate reward or transition information, significantly improving reward collection in unknown environments.
Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control
·2663 words·13 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 University of South Carolina
Boosting RL data efficiency for continuous control, this paper advocates Euclidean data augmentation using limb-based state features, significantly improving performance across various tasks.
Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems
·2792 words·14 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Dyson School of Design Engineering
Safe reinforcement learning is achieved via RL-AR, an algorithm that combines a safe policy with an RL policy using a focus module, ensuring safety during training while achieving competitive performa…
Reinforcement Learning Under Latent Dynamics: Toward Statistical and Algorithmic Modularity
·330 words·2 mins·
loading
·
loading
Reinforcement Learning
🏢 University of Michigan
This paper pioneers a modular framework for reinforcement learning, addressing the challenge of learning under complex observations and simpler latent dynamics, offering both statistical and algorithm…
Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer
·2290 words·11 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Nanjing University
Reinforcement learning refines existing macro placements, enhancing chip design by improving power, performance, and area (PPA) metrics and integrating the often-overlooked metric of regularity.
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
·4163 words·20 mins·
loading
·
loading
Reinforcement Learning
🏢 University of Illinois Urbana-Champaign
Boost online finetuning of Decision Transformers by adding TD3 gradients, especially when pretrained with low-reward data.
Regularized Q-Learning
·1497 words·8 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 KAIST
RegQ: A novel regularized Q-learning algorithm ensures convergence with linear function approximation, solving a long-standing instability problem in reinforcement learning.
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment
·2209 words·11 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Institute of Artificial Intelligence (TeleAI), China Telecom
A novel regularized conditional diffusion model enables effective multi-task preference alignment in sequential decision-making by learning unified preference representations and maximizing mutual inf…
Recurrent Reinforcement Learning with Memoroids
·2207 words·11 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 University of Macau
Memoroids and Tape-Based Batching revolutionize recurrent RL, enabling efficient processing of long sequences and improving sample efficiency by eliminating segmentation.
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
·1896 words·9 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 UC Los Angeles
Reciprocators: AI agents that learn to cooperate by reciprocating influence, achieving prosocial outcomes in complex scenarios.
REBEL: Reinforcement Learning via Regressing Relative Rewards
·2652 words·13 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Cornell University
REBEL, a novel reinforcement learning algorithm, simplifies policy optimization by regressing relative rewards, achieving strong performance in language and image generation tasks with increased effic…
Real-Time Selection Under General Constraints via Predictive Inference
·2826 words·14 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Nankai University
II-COS: a novel online sample selection method effectively controls individual and interactive constraints in real-time via predictive inference, improving efficiency and addressing various practical …
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
·3400 words·16 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 University of Alberta
Recurrent Trace Units (RTUs) significantly enhance real-time recurrent learning in reinforcement learning, outperforming other methods with less computation.
Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
·3372 words·16 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Duke University
Provably efficient randomized exploration in cooperative MARL is achieved via a novel unified algorithm framework, CoopTS, using Thompson Sampling with PHE and LMC exploration strategies.
Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
·543 words·3 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Seoul National University
First provably efficient randomized RL algorithms using multinomial logistic function approximation are introduced, achieving superior performance and constant-time computational cost.
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
·1647 words·8 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 EPFL, Switzerland
This paper presents randomized algorithms with PAC bounds for solving inverse reinforcement learning problems in continuous state and action spaces, offering robust theoretical guarantees and practica…