Reinforcement Learning

Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment

26 September 2024·2568 words·13 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Boston University

PAGAR: a novel semi-supervised IRL framework prioritizing task alignment over data alignment, leveraging expert demonstrations as weak supervision to derive task-aligned reward functions for improved …

Rethinking Exploration in Reinforcement Learning with Effective Metric-Based Exploration Bonus

26 September 2024·2904 words·14 mins· loading · loading

Reinforcement Learning 🏢 University of Macau

Effective Metric-based Exploration Bonus (EME) enhances reinforcement learning exploration by using a robust metric for state discrepancy and a dynamically adjusted scaling factor based on reward mode…

Relating Hopfield Networks to Episodic Control

26 September 2024·3465 words·17 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Inria Centre of the University of Bordeaux

Neural Episodic Control’s differentiable dictionary is shown to be a Universal Hopfield Network, enabling improved performance and a novel evaluation criterion.

Reinforcement Learning with LTL and ⍵-Regular Objectives via Optimality-Preserving Translation to Average Rewards

26 September 2024·1651 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 NTU Singapore

Reinforcement learning with complex objectives made easy: This paper introduces an optimality-preserving translation to reduce problems with Linear Temporal Logic (LTL) objectives to standard average …

Reinforcement Learning with Lookahead Information

26 September 2024·333 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 FairPlay Joint Team, CREST, ENSAE Paris

Provably efficient RL algorithms are designed to utilize immediate reward or transition information, significantly improving reward collection in unknown environments.

Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control

26 September 2024·2663 words·13 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of South Carolina

Boosting RL data efficiency for continuous control, this paper advocates Euclidean data augmentation using limb-based state features, significantly improving performance across various tasks.

Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems

26 September 2024·2792 words·14 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Dyson School of Design Engineering

Safe reinforcement learning is achieved via RL-AR, an algorithm that combines a safe policy with an RL policy using a focus module, ensuring safety during training while achieving competitive performa…

Reinforcement Learning Under Latent Dynamics: Toward Statistical and Algorithmic Modularity

26 September 2024·330 words·2 mins· loading · loading

Reinforcement Learning 🏢 University of Michigan

This paper pioneers a modular framework for reinforcement learning, addressing the challenge of learning under complex observations and simpler latent dynamics, offering both statistical and algorithm…

Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer

26 September 2024·2290 words·11 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Nanjing University

Reinforcement learning refines existing macro placements, enhancing chip design by improving power, performance, and area (PPA) metrics and integrating the often-overlooked metric of regularity.

Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers

26 September 2024·4163 words·20 mins· loading · loading

Reinforcement Learning 🏢 University of Illinois Urbana-Champaign

Boost online finetuning of Decision Transformers by adding TD3 gradients, especially when pretrained with low-reward data.

Regularized Q-Learning

26 September 2024·1497 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 KAIST

RegQ: A novel regularized Q-learning algorithm ensures convergence with linear function approximation, solving a long-standing instability problem in reinforcement learning.

Regularized Conditional Diffusion Model for Multi-Task Preference Alignment

26 September 2024·2209 words·11 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Institute of Artificial Intelligence (TeleAI), China Telecom

A novel regularized conditional diffusion model enables effective multi-task preference alignment in sequential decision-making by learning unified preference representations and maximizing mutual inf…

Recurrent Reinforcement Learning with Memoroids

26 September 2024·2207 words·11 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Macau

Memoroids and Tape-Based Batching revolutionize recurrent RL, enabling efficient processing of long sequences and improving sample efficiency by eliminating segmentation.

Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

26 September 2024·1896 words·9 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 UC Los Angeles

Reciprocators: AI agents that learn to cooperate by reciprocating influence, achieving prosocial outcomes in complex scenarios.

REBEL: Reinforcement Learning via Regressing Relative Rewards

26 September 2024·2652 words·13 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Cornell University

REBEL, a novel reinforcement learning algorithm, simplifies policy optimization by regressing relative rewards, achieving strong performance in language and image generation tasks with increased effic…

Real-Time Selection Under General Constraints via Predictive Inference

26 September 2024·2826 words·14 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Nankai University

II-COS: a novel online sample selection method effectively controls individual and interactive constraints in real-time via predictive inference, improving efficiency and addressing various practical …

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

26 September 2024·3400 words·16 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Alberta

Recurrent Trace Units (RTUs) significantly enhance real-time recurrent learning in reinforcement learning, outperforming other methods with less computation.

Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning

26 September 2024·3372 words·16 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Duke University

Provably efficient randomized exploration in cooperative MARL is achieved via a novel unified algorithm framework, CoopTS, using Thompson Sampling with PHE and LMC exploration strategies.

Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation

26 September 2024·543 words·3 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Seoul National University

First provably efficient randomized RL algorithms using multinomial logistic function approximation are introduced, achieving superior performance and constant-time computational cost.

Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces

26 September 2024·1647 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 EPFL, Switzerland

This paper presents randomized algorithms with PAC bounds for solving inverse reinforcement learning problems in continuous state and action spaces, offering robust theoretical guarantees and practica…