Machine Learning

Streaming Bayes GFlowNets

26 September 2024·2088 words·10 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Getulio Vargas Foundation

SB-GFlowNets: Streaming Bayesian inference is now efficient and accurate using GFlowNets, enabling real-time model updates for large, sequential datasets.

Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting

26 September 2024·1460 words·7 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 CREST, ENSAE

Incentive-aware algorithm achieves low regret in strategic multi-armed bandits under debt-free reporting, establishing truthful equilibrium among arms.

Stopping Bayesian Optimization with Probabilistic Regret Bounds

26 September 2024·3802 words·18 mins· loading · loading

Machine Learning Optimization 🏢 Morgan Stanley

This paper presents a novel probabilistic regret bound (PRB) framework for Bayesian optimization, replacing the traditional fixed-budget stopping rule with a criterion based on the probability of find…

Stochastic Optimization Schemes for Performative Prediction with Nonconvex Loss

26 September 2024·2232 words·11 mins· loading · loading

AI Generated Machine Learning Optimization 🏢 Chinese University of Hong Kong

Bias-free performative prediction is achieved using a novel lazy deployment scheme with SGD, handling non-convex loss functions.

Stochastic Optimal Control for Diffusion Bridges in Function Spaces

26 September 2024·2194 words·11 mins· loading · loading

Machine Learning Deep Learning 🏢 KAIST

Researchers extended stochastic optimal control theory to infinite-dimensional spaces, enabling the creation of diffusion bridges for generative modeling in function spaces, demonstrating applications…

Stochastic Kernel Regularisation Improves Generalisation in Deep Kernel Machines

26 September 2024·1434 words·7 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Bristol

Deep kernel machines now achieve 94.5% accuracy on CIFAR-10, matching neural networks, by using stochastic kernel regularization to improve generalization.

Stochastic contextual bandits with graph feedback: from independence number to MAS number

26 September 2024·289 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 New York University

Contextual bandits with graph feedback achieve near-optimal regret by leveraging a novel graph-theoretic quantity that interpolates between independence and maximum acyclic subgraph numbers, depending…

Stepping on the Edge: Curvature Aware Learning Rate Tuners

26 September 2024·2482 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 Google DeepMind

Adaptive learning rate tuners often underperform; Curvature Dynamics Aware Tuning (CDAT) prioritizes long-term curvature stabilization, outperforming tuned constant learning rates.

Stepping Forward on the Last Mile

26 September 2024·2832 words·14 mins· loading · loading

Machine Learning Few-Shot Learning 🏢 Qualcomm AI Research

On-device training with fixed-point forward gradients enables efficient model personalization on resource-constrained edge devices, overcoming backpropagation’s memory limitations.

State-free Reinforcement Learning

26 September 2024·357 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Boston University

State-free Reinforcement Learning (SFRL) framework eliminates the need for state-space information in RL algorithms, achieving regret bounds independent of the state space size and adaptive to the rea…

State Space Models on Temporal Graphs: A First-Principles Study

26 September 2024·1794 words·9 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Sun Yat-Sen University

GRAPHSSM: a novel graph state space model efficiently captures temporal graph dynamics, overcoming limitations of existing sequence models.

State Chrono Representation for Enhancing Generalization in Reinforcement Learning

26 September 2024·2535 words·12 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of California, Santa Barbara

State Chrono Representation (SCR) enhances reinforcement learning generalization by incorporating extensive temporal information and cumulative rewards into state representations, improving performanc…

ST$_k$: A Scalable Module for Solving Top-k Problems

26 September 2024·2467 words·12 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 School of Mathematical Sciences

STk: a novel, differentiable module solves Top-k problems in neural networks without extra time/GPU memory, boosting performance in long-tailed learning.

SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning

26 September 2024·3010 words·15 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Singapore Management University

SPRINQL: Sub-optimal Demonstrations for Offline Imitation Learning

SPO: Sequential Monte Carlo Policy Optimisation

26 September 2024·3026 words·15 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Amsterdam

SPO: A novel model-based RL algorithm leverages parallelisable Monte Carlo tree search for efficient and robust policy improvement in both discrete and continuous environments.

Spiking Token Mixer: A event-driven friendly Former structure for spiking neural networks

26 September 2024·1852 words·9 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Electronic Science and Technology of China

STMixer: a novel SNN architecture enabling high performance on both synchronous and asynchronous neuromorphic hardware, achieving comparable results to spiking transformers with drastically lower powe…

Spiking Graph Neural Network on Riemannian Manifolds

26 September 2024·3421 words·17 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 North China Electric Power University

Spiking Graph Neural Networks (SGNNs) on Riemannian Manifolds achieve superior performance and energy efficiency via a novel Manifold Spiking GNN (MSG).

Speculative Monte-Carlo Tree Search

26 September 2024·1943 words·10 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Pennsylvania State University

Speculative MCTS accelerates AlphaZero training by implementing speculative execution, enabling parallel processing of future moves and reducing latency by up to 5.8x.

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees

26 September 2024·2502 words·12 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Seoul National University

SRCPO: a novel spectral risk measure-constrained RL algorithm guaranteeing convergence to a global optimum, outperforming existing methods in continuous control tasks.

Spectral Learning of Shared Dynamics Between Generalized-Linear Processes

26 September 2024·2151 words·11 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 University of Southern California

PGLDM, a novel algorithm, accurately identifies shared and private dynamics in two generalized-linear time series, improving model accuracy and enabling lower-dimensional latent state representations.