Machine Learning
Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback
·355 words·2 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
π’ Google Research
New algorithms conquer adversarial low-rank MDPs, improving regret bounds for unknown transitions and bandit feedback.
Bayesian Domain Adaptation with Gaussian Mixture Domain-Indexing
·2584 words·13 mins·
loading
·
loading
AI Generated
Machine Learning
Transfer Learning
π’ Sun Yat-Sen University
GMDI: a novel Bayesian domain adaptation algorithm significantly improves adaptation by dynamically modeling domain indices using Gaussian Mixture Models, outperforming state-of-the-art methods.
Bayesian Adaptive Calibration and Optimal Design
·1874 words·9 mins·
loading
·
loading
Machine Learning
Active Learning
π’ CSIRO's Data61
BACON: a novel Bayesian adaptive calibration and optimal design algorithm maximizes information gain for data-efficient computer model calibration, significantly outperforming existing methods in synt…
Batched Energy-Entropy acquisition for Bayesian Optimization
·4543 words·22 mins·
loading
·
loading
Machine Learning
Optimization
π’ Machine Intelligence
BEEBO: a novel acquisition function for Bayesian Optimization, offering superior explore-exploit balance and handling large batches efficiently, even with noisy data.
Bandits with Ranking Feedback
·1499 words·8 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
π’ Politecnico Di Milano
This paper introduces ‘bandits with ranking feedback,’ a novel bandit variation providing ranked feedback instead of numerical rewards. It proves instance-dependent cases require superlogarithmic reg…
Bandits with Preference Feedback: A Stackelberg Game Perspective
·1514 words·8 mins·
loading
·
loading
Machine Learning
Optimization
π’ ETH Zurich
MAXMINLCB, a novel game-theoretic algorithm, efficiently solves bandit problems with preference feedback over continuous domains, providing anytime-valid, rate-optimal regret guarantees.
Bandits with Abstention under Expert Advice
·2058 words·10 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
π’ Alan Turing Institute
The Confidence-Rated Bandits with Abstentions (CBA) algorithm significantly improves reward bounds for prediction with expert advice by strategically leveraging an abstention action.
BAN: Detecting Backdoors Activated by Neuron Noise
·2897 words·14 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Radboud University
BAN: a novel backdoor defense using adversarial neuron noise for efficient detection and mitigation.
Balancing Context Length and Mixing Times for Reinforcement Learning at Scale
·1724 words·9 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
π’ IBM Research
Longer context in RL boosts generalization but slows down learning; this paper reveals the crucial tradeoff and offers theoretical insights.
B-ary Tree Push-Pull Method is Provably Efficient for Distributed Learning on Heterogeneous Data
·1511 words·8 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Chinese University of Hong Kong, Shenzhen
B-ary Tree Push-Pull (BTPP) achieves linear speedup for distributed learning on heterogeneous data, significantly outperforming state-of-the-art methods with minimal communication.
Avoiding Undesired Future with Minimal Cost in Non-Stationary Environments
·2100 words·10 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
π’ National Key Laboratory for Novel Software Technology, Nanjing University, China
AUF-MICNS: A novel sequential method efficiently solves the avoiding undesired future problem by dynamically updating influence relations in non-stationary environments while minimizing action costs.
Autoregressive Policy Optimization for Constrained Allocation Tasks
·2331 words·11 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
π’ Munich Center for Machine Learning
PASPO: a novel autoregressive policy optimization method for constrained allocation tasks guarantees constraint satisfaction and outperforms existing methods.
AUC Maximization under Positive Distribution Shift
·2247 words·11 mins·
loading
·
loading
Machine Learning
Semi-Supervised Learning
π’ NTT
New method maximizes AUC under positive distribution shift using only positive and unlabeled training data, and unlabeled test data; improving imbalanced classification.
Attractor Memory for Long-Term Time Series Forecasting: A Chaos Perspective
·3387 words·16 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
π’ Hong Kong University of Science and Technology
Attraos: a novel long-term time series forecasting model leveraging chaos theory, significantly outperforms existing methods by utilizing attractor dynamics for efficient and accurate prediction.
Asymptotics of Alpha-Divergence Variational Inference Algorithms with Exponential Families
·1536 words·8 mins·
loading
·
loading
Machine Learning
Optimization
π’ Telecom Sud-Paris
This paper rigorously analyzes alpha-divergence variational inference, proving its convergence and providing convergence rates, thereby advancing the theoretical foundations of this increasingly impor…
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
·1937 words·10 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
π’ University of Oxford
Reinforcement learning agents achieve emergent cultural accumulation by balancing social and independent learning, outperforming single-lifetime agents.
AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields
·3676 words·18 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Sorbonne UniversitΓ©
AROMA: Attentive Reduced Order Model with Attention enhances PDE modeling with local neural fields, offering efficient processing of diverse geometries and superior performance in simulating 1D and 2D…
Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?
·2360 words·12 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ MIT
Evidential deep learning’s uncertainty quantification is unreliable; this paper reveals its limitations, proposes model uncertainty incorporation for improved performance.
Are Self-Attentions Effective for Time Series Forecasting?
·3575 words·17 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Seoul National University
Cross-Attention-only Time Series Transformer (CATS) outperforms existing models by removing self-attention, improving long-term forecasting accuracy, and reducing computational cost.
Are Multiple Instance Learning Algorithms Learnable for Instances?
·2369 words·12 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
π’ Graduate School of Data Science, Seoul National University of Science and Technology
Deep MIL algorithms’ instance-level learnability is theoretically proven, revealing crucial conditions for success and highlighting gaps in existing models.