Skip to main content

Machine Learning

Sample Selection via Contrastive Fragmentation for Noisy Label Regression
·6755 words·32 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Seoul National University
ConFrag, a novel approach to noisy label regression, leverages contrastive fragmentation and neighborhood agreement to select clean samples, significantly outperforming state-of-the-art baselines on s…
SampDetox: Black-box Backdoor Defense via Perturbation-based Sample Detoxification
·2579 words·13 mins· loading · loading
Machine Learning Deep Learning 🏢 Singapore Management University
SampDetox uses diffusion models to purify poisoned machine learning samples by strategically adding noise to eliminate backdoors without compromising data integrity.
SAMPa: Sharpness-aware Minimization Parallelized
·2453 words·12 mins· loading · loading
Machine Learning Optimization 🏢 EPFL
SAMPa: Parallelizing gradient computations in Sharpness-Aware Minimization (SAM) achieves a 2x speedup and superior generalization.
Safety through feedback in Constrained RL
·2526 words·12 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Singapore Management University
Reinforcement Learning from Safety Feedback (RLSF) efficiently infers cost functions from trajectory-level feedback, enabling safe policy learning in complex environments.
SAFE: Slow and Fast Parameter-Efficient Tuning for Continual Learning with Pre-Trained Models
·2317 words·11 mins· loading · loading
Machine Learning Continual Learning 🏢 Tencent AI Lab
SAFE, a novel parameter-efficient tuning framework, boosts pre-trained model performance in continual learning by balancing model stability and plasticity through slow and fast learning stages, signif…
Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data Coverage
·1556 words·8 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 ShanghaiTech University
A novel primal-dual method boosts offline safe reinforcement learning efficiency for convex CMDPs by using uncertainty parameters and achieving a sample complexity of O(1/(1-γ)√n).
S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning
·2415 words·12 mins· loading · loading
Machine Learning Deep Learning 🏢 Fudan University
S2HPruner bridges the discretization gap in neural network pruning via a novel soft-to-hard distillation framework, achieving superior performance across various benchmarks without fine-tuning.
S-MolSearch: 3D Semi-supervised Contrastive Learning for Bioactive Molecule Search
·1909 words·9 mins· loading · loading
Machine Learning Semi-Supervised Learning 🏢 Renmin University of China
S-MolSearch: a novel semi-supervised framework using 3D molecular data and contrastive learning achieves state-of-the-art in bioactive molecule search, outperforming existing methods.
Rough Transformers: Lightweight Continuous-Time Sequence Modelling with Path Signatures
·4237 words·20 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 University of Oxford
Rough Transformers: A lightweight continuous-time sequence modeling approach using path signatures to significantly reduce computational costs, improving efficiency and accuracy, particularly for long…
ROIDICE: Offline Return on Investment Maximization for Efficient Decision Making
·2572 words·13 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Korea University
ROIDICE: A novel offline reinforcement learning algorithm maximizes Return on Investment (ROI) by formulating the problem as linear fractional programming, yielding superior return-cost trade-offs.
Robust Reinforcement Learning with General Utility
·352 words·2 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 University of Maryland College Park
This paper introduces robust reinforcement learning with general utility, providing novel algorithms with convergence guarantees for training robust policies under environmental uncertainty, significa…
Robust Reinforcement Learning from Corrupted Human Feedback
·2358 words·12 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Georgia Tech
R³M enhances reinforcement learning from human feedback by robustly handling corrupted preference labels, consistently learning the underlying reward and identifying outliers with minimal computationa…
Robust Offline Active Learning on Graphs
·1968 words·10 mins· loading · loading
AI Generated Machine Learning Active Learning 🏢 Pennsylvania State University
This paper introduces a novel offline active learning method for node-level tasks on graphs, incorporating network structure and node covariates to improve efficiency and robustness, especially in noi…
Robust group and simultaneous inferences for high-dimensional single index model
·473 words·3 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Beijing Normal University
This paper introduces robust group inference procedures for high-dimensional single index models, offering substantial efficiency gains for heavy-tailed errors and handling group testing effectively w…
Robust Gaussian Processes via Relevance Pursuit
·2238 words·11 mins· loading · loading
Machine Learning Robustness 🏢 Meta
Robust Gaussian Processes via Relevance Pursuit tackles noisy data by cleverly inferring data-point specific noise levels, leading to more accurate predictions.
Robust Contrastive Multi-view Clustering against Dual Noisy Correspondence
·2413 words·12 mins· loading · loading
AI Generated Machine Learning Unsupervised Learning 🏢 College of Computer Science, Sichuan University, China
CANDY refines contrastive multi-view clustering by cleverly using inter-view similarities to identify and correct false negatives and a spectral method to remove false positives, resulting in signific…
Robot Policy Learning with Temporal Optimal Transport Reward
·2204 words·11 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 McGill University
Temporal Optimal Transport (TemporalOT) reward enhances robot policy learning by incorporating temporal order information into Optimal Transport (OT)-based proxy rewards, leading to improved accuracy …
RMLR: Extending Multinomial Logistic Regression into General Geometries
·2575 words·13 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Trento
RMLR: A novel framework extends multinomial logistic regression to diverse geometries, overcoming limitations of existing methods by requiring minimal geometric properties for broad applicability.
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
·435 words·3 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 University of Wisconsin-Madison
First sample-efficient algorithm for LMDPs without separation assumptions, achieving near-optimal guarantees via novel off-policy evaluation.
Risk-sensitive control as inference with Rényi divergence
·1494 words·8 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 University of Tokyo
Risk-sensitive control is recast as inference using Rényi divergence, yielding new algorithms and revealing equivalences between seemingly disparate methods.