Machine Learning

Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback

26 September 2024·355 words·2 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Google Research

New algorithms conquer adversarial low-rank MDPs, improving regret bounds for unknown transitions and bandit feedback.

Bayesian Domain Adaptation with Gaussian Mixture Domain-Indexing

26 September 2024·2584 words·13 mins· loading · loading

AI Generated Machine Learning Transfer Learning 🏢 Sun Yat-Sen University

GMDI: a novel Bayesian domain adaptation algorithm significantly improves adaptation by dynamically modeling domain indices using Gaussian Mixture Models, outperforming state-of-the-art methods.

Bayesian Adaptive Calibration and Optimal Design

26 September 2024·1874 words·9 mins· loading · loading

Machine Learning Active Learning 🏢 CSIRO's Data61

BACON: a novel Bayesian adaptive calibration and optimal design algorithm maximizes information gain for data-efficient computer model calibration, significantly outperforming existing methods in synt…

Batched Energy-Entropy acquisition for Bayesian Optimization

26 September 2024·4543 words·22 mins· loading · loading

Machine Learning Optimization 🏢 Machine Intelligence

BEEBO: a novel acquisition function for Bayesian Optimization, offering superior explore-exploit balance and handling large batches efficiently, even with noisy data.

Bandits with Ranking Feedback

26 September 2024·1499 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Politecnico Di Milano

This paper introduces ‘bandits with ranking feedback,’ a novel bandit variation providing ranked feedback instead of numerical rewards. It proves instance-dependent cases require superlogarithmic reg…

Bandits with Preference Feedback: A Stackelberg Game Perspective

26 September 2024·1514 words·8 mins· loading · loading

Machine Learning Optimization 🏢 ETH Zurich

MAXMINLCB, a novel game-theoretic algorithm, efficiently solves bandit problems with preference feedback over continuous domains, providing anytime-valid, rate-optimal regret guarantees.

Bandits with Abstention under Expert Advice

26 September 2024·2058 words·10 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Alan Turing Institute

The Confidence-Rated Bandits with Abstentions (CBA) algorithm significantly improves reward bounds for prediction with expert advice by strategically leveraging an abstention action.

BAN: Detecting Backdoors Activated by Neuron Noise

26 September 2024·2897 words·14 mins· loading · loading

Machine Learning Deep Learning 🏢 Radboud University

BAN: a novel backdoor defense using adversarial neuron noise for efficient detection and mitigation.

Balancing Context Length and Mixing Times for Reinforcement Learning at Scale

26 September 2024·1724 words·9 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 IBM Research

Longer context in RL boosts generalization but slows down learning; this paper reveals the crucial tradeoff and offers theoretical insights.

B-ary Tree Push-Pull Method is Provably Efficient for Distributed Learning on Heterogeneous Data

26 September 2024·1511 words·8 mins· loading · loading

Machine Learning Deep Learning 🏢 Chinese University of Hong Kong, Shenzhen

B-ary Tree Push-Pull (BTPP) achieves linear speedup for distributed learning on heterogeneous data, significantly outperforming state-of-the-art methods with minimal communication.

Avoiding Undesired Future with Minimal Cost in Non-Stationary Environments

26 September 2024·2100 words·10 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 National Key Laboratory for Novel Software Technology, Nanjing University, China

AUF-MICNS: A novel sequential method efficiently solves the avoiding undesired future problem by dynamically updating influence relations in non-stationary environments while minimizing action costs.

Autoregressive Policy Optimization for Constrained Allocation Tasks

26 September 2024·2331 words·11 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Munich Center for Machine Learning

PASPO: a novel autoregressive policy optimization method for constrained allocation tasks guarantees constraint satisfaction and outperforms existing methods.

AUC Maximization under Positive Distribution Shift

26 September 2024·2247 words·11 mins· loading · loading

Machine Learning Semi-Supervised Learning 🏢 NTT

New method maximizes AUC under positive distribution shift using only positive and unlabeled training data, and unlabeled test data; improving imbalanced classification.

Attractor Memory for Long-Term Time Series Forecasting: A Chaos Perspective

26 September 2024·3387 words·16 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Hong Kong University of Science and Technology

Attraos: a novel long-term time series forecasting model leveraging chaos theory, significantly outperforms existing methods by utilizing attractor dynamics for efficient and accurate prediction.

Asymptotics of Alpha-Divergence Variational Inference Algorithms with Exponential Families

26 September 2024·1536 words·8 mins· loading · loading

Machine Learning Optimization 🏢 Telecom Sud-Paris

This paper rigorously analyzes alpha-divergence variational inference, proving its convergence and providing convergence rates, thereby advancing the theoretical foundations of this increasingly impor…

Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning

26 September 2024·1937 words·10 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Oxford

Reinforcement learning agents achieve emergent cultural accumulation by balancing social and independent learning, outperforming single-lifetime agents.

AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields

26 September 2024·3676 words·18 mins· loading · loading

Machine Learning Deep Learning 🏢 Sorbonne Université

AROMA: Attentive Reduced Order Model with Attention enhances PDE modeling with local neural fields, offering efficient processing of diverse geometries and superior performance in simulating 1D and 2D…

Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?

26 September 2024·2360 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 MIT

Evidential deep learning’s uncertainty quantification is unreliable; this paper reveals its limitations, proposes model uncertainty incorporation for improved performance.

Are Self-Attentions Effective for Time Series Forecasting?

26 September 2024·3575 words·17 mins· loading · loading

Machine Learning Deep Learning 🏢 Seoul National University

Cross-Attention-only Time Series Transformer (CATS) outperforms existing models by removing self-attention, improving long-term forecasting accuracy, and reducing computational cost.

Are Multiple Instance Learning Algorithms Learnable for Instances?

26 September 2024·2369 words·12 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Graduate School of Data Science, Seoul National University of Science and Technology

Deep MIL algorithms’ instance-level learnability is theoretically proven, revealing crucial conditions for success and highlighting gaps in existing models.