🏢 University of Alberta

Unleashing Multispectral Video's Potential in Semantic Segmentation: A Semi-supervised Viewpoint and New UAV-View Benchmark

26 September 2024·2138 words·11 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 University of Alberta

New MVUAV dataset and SemiMV semi-supervised learning model significantly improve multispectral video semantic segmentation!

Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear q^π-Realizability and Concentrability

26 September 2024·479 words·3 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of Alberta

Offline RL with trajectory data achieves statistically efficient learning under linear q*-realizability and concentrability, solving a previously deemed impossible problem.

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

26 September 2024·3400 words·16 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Alberta

Recurrent Trace Units (RTUs) significantly enhance real-time recurrent learning in reinforcement learning, outperforming other methods with less computation.

Learning Truncated Causal History Model for Video Restoration

26 September 2024·2473 words·12 mins· loading · loading

Computer Vision Video Understanding 🏢 University of Alberta

TURTLE: a novel video restoration framework that learns a truncated causal history model for efficient and high-performing video restoration, achieving state-of-the-art results on various benchmark ta…

Distributional Reinforcement Learning with Regularized Wasserstein Loss

26 September 2024·2196 words·11 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Alberta

Sinkhorn distributional RL (SinkhornDRL) uses a regularized Wasserstein loss to improve distributional reinforcement learning.

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers

26 September 2024·3268 words·16 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Alberta

Deep RL excels in simulated robotics, but struggles with real-world limitations like limited computational resources. This paper introduces Action Value Gradient (AVG), a novel incremental deep polic…

Confident Natural Policy Gradient for Local Planning in q_π-realizable Constrained MDPs

26 September 2024·227 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Alberta

Confident-NPG-CMDP: First primal-dual algorithm achieving polynomial sample complexity for solving constrained Markov decision processes (CMDPs) using function approximation and local access model.

Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits

26 September 2024·359 words·2 mins· loading · loading

AI Theory Optimization 🏢 University of Alberta

Generalized linear bandits with subexponential reward distributions are self-concordant, enabling second-order regret bounds free of exponential dependence on problem parameters.

A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning

26 September 2024·1860 words·9 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Alberta

New empirical methodology quantifies how much reinforcement learning algorithm performance relies on per-environment hyperparameter tuning, enabling better algorithm design.