Machine Learning

Sample Selection via Contrastive Fragmentation for Noisy Label Regression

26 September 2024·6755 words·32 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Seoul National University

ConFrag, a novel approach to noisy label regression, leverages contrastive fragmentation and neighborhood agreement to select clean samples, significantly outperforming state-of-the-art baselines on s…

SampDetox: Black-box Backdoor Defense via Perturbation-based Sample Detoxification

26 September 2024·2579 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 Singapore Management University

SampDetox uses diffusion models to purify poisoned machine learning samples by strategically adding noise to eliminate backdoors without compromising data integrity.

SAMPa: Sharpness-aware Minimization Parallelized

26 September 2024·2453 words·12 mins· loading · loading

Machine Learning Optimization 🏢 EPFL

SAMPa: Parallelizing gradient computations in Sharpness-Aware Minimization (SAM) achieves a 2x speedup and superior generalization.

Safety through feedback in Constrained RL

26 September 2024·2526 words·12 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Singapore Management University

Reinforcement Learning from Safety Feedback (RLSF) efficiently infers cost functions from trajectory-level feedback, enabling safe policy learning in complex environments.

SAFE: Slow and Fast Parameter-Efﬁcient Tuning for Continual Learning with Pre-Trained Models

26 September 2024·2317 words·11 mins· loading · loading

Machine Learning Continual Learning 🏢 Tencent AI Lab

SAFE, a novel parameter-efficient tuning framework, boosts pre-trained model performance in continual learning by balancing model stability and plasticity through slow and fast learning stages, signif…

Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data Coverage

26 September 2024·1556 words·8 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 ShanghaiTech University

A novel primal-dual method boosts offline safe reinforcement learning efficiency for convex CMDPs by using uncertainty parameters and achieving a sample complexity of O(1/(1-γ)√n).

S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning

26 September 2024·2415 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 Fudan University

S2HPruner bridges the discretization gap in neural network pruning via a novel soft-to-hard distillation framework, achieving superior performance across various benchmarks without fine-tuning.

S-MolSearch: 3D Semi-supervised Contrastive Learning for Bioactive Molecule Search

26 September 2024·1909 words·9 mins· loading · loading

Machine Learning Semi-Supervised Learning 🏢 Renmin University of China

S-MolSearch: a novel semi-supervised framework using 3D molecular data and contrastive learning achieves state-of-the-art in bioactive molecule search, outperforming existing methods.

Rough Transformers: Lightweight Continuous-Time Sequence Modelling with Path Signatures

26 September 2024·4237 words·20 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 University of Oxford

Rough Transformers: A lightweight continuous-time sequence modeling approach using path signatures to significantly reduce computational costs, improving efficiency and accuracy, particularly for long…

ROIDICE: Offline Return on Investment Maximization for Efficient Decision Making

26 September 2024·2572 words·13 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Korea University

ROIDICE: A novel offline reinforcement learning algorithm maximizes Return on Investment (ROI) by formulating the problem as linear fractional programming, yielding superior return-cost trade-offs.

Robust Reinforcement Learning with General Utility

26 September 2024·352 words·2 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of Maryland College Park

This paper introduces robust reinforcement learning with general utility, providing novel algorithms with convergence guarantees for training robust policies under environmental uncertainty, significa…

Robust Reinforcement Learning from Corrupted Human Feedback

26 September 2024·2358 words·12 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Georgia Tech

R³M enhances reinforcement learning from human feedback by robustly handling corrupted preference labels, consistently learning the underlying reward and identifying outliers with minimal computationa…

Robust Offline Active Learning on Graphs

26 September 2024·1968 words·10 mins· loading · loading

AI Generated Machine Learning Active Learning 🏢 Pennsylvania State University

This paper introduces a novel offline active learning method for node-level tasks on graphs, incorporating network structure and node covariates to improve efficiency and robustness, especially in noi…

Robust group and simultaneous inferences for high-dimensional single index model

26 September 2024·473 words·3 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Beijing Normal University

This paper introduces robust group inference procedures for high-dimensional single index models, offering substantial efficiency gains for heavy-tailed errors and handling group testing effectively w…

Robust Gaussian Processes via Relevance Pursuit

26 September 2024·2238 words·11 mins· loading · loading

Machine Learning Robustness 🏢 Meta

Robust Gaussian Processes via Relevance Pursuit tackles noisy data by cleverly inferring data-point specific noise levels, leading to more accurate predictions.

Robust Contrastive Multi-view Clustering against Dual Noisy Correspondence

26 September 2024·2413 words·12 mins· loading · loading

AI Generated Machine Learning Unsupervised Learning 🏢 College of Computer Science, Sichuan University, China

CANDY refines contrastive multi-view clustering by cleverly using inter-view similarities to identify and correct false negatives and a spectral method to remove false positives, resulting in signific…

Robot Policy Learning with Temporal Optimal Transport Reward

26 September 2024·2204 words·11 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 McGill University

Temporal Optimal Transport (TemporalOT) reward enhances robot policy learning by incorporating temporal order information into Optimal Transport (OT)-based proxy rewards, leading to improved accuracy …

RMLR: Extending Multinomial Logistic Regression into General Geometries

26 September 2024·2575 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Trento

RMLR: A novel framework extends multinomial logistic regression to diverse geometries, overcoming limitations of existing methods by requiring minimal geometric properties for broad applicability.

RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation

26 September 2024·435 words·3 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Wisconsin-Madison

First sample-efficient algorithm for LMDPs without separation assumptions, achieving near-optimal guarantees via novel off-policy evaluation.

Risk-sensitive control as inference with Rényi divergence

26 September 2024·1494 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Tokyo

Risk-sensitive control is recast as inference using Rényi divergence, yielding new algorithms and revealing equivalences between seemingly disparate methods.