Skip to main content

Machine Learning

Streaming Bayes GFlowNets
·2088 words·10 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Getulio Vargas Foundation
SB-GFlowNets: Streaming Bayesian inference is now efficient and accurate using GFlowNets, enabling real-time model updates for large, sequential datasets.
Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting
·1460 words·7 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 CREST, ENSAE
Incentive-aware algorithm achieves low regret in strategic multi-armed bandits under debt-free reporting, establishing truthful equilibrium among arms.
Stopping Bayesian Optimization with Probabilistic Regret Bounds
·3802 words·18 mins· loading · loading
Machine Learning Optimization 🏢 Morgan Stanley
This paper presents a novel probabilistic regret bound (PRB) framework for Bayesian optimization, replacing the traditional fixed-budget stopping rule with a criterion based on the probability of find…
Stochastic Optimization Schemes for Performative Prediction with Nonconvex Loss
·2232 words·11 mins· loading · loading
AI Generated Machine Learning Optimization 🏢 Chinese University of Hong Kong
Bias-free performative prediction is achieved using a novel lazy deployment scheme with SGD, handling non-convex loss functions.
Stochastic Optimal Control for Diffusion Bridges in Function Spaces
·2194 words·11 mins· loading · loading
Machine Learning Deep Learning 🏢 KAIST
Researchers extended stochastic optimal control theory to infinite-dimensional spaces, enabling the creation of diffusion bridges for generative modeling in function spaces, demonstrating applications…
Stochastic Kernel Regularisation Improves Generalisation in Deep Kernel Machines
·1434 words·7 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Bristol
Deep kernel machines now achieve 94.5% accuracy on CIFAR-10, matching neural networks, by using stochastic kernel regularization to improve generalization.
Stochastic contextual bandits with graph feedback: from independence number to MAS number
·289 words·2 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 New York University
Contextual bandits with graph feedback achieve near-optimal regret by leveraging a novel graph-theoretic quantity that interpolates between independence and maximum acyclic subgraph numbers, depending…
Stepping on the Edge: Curvature Aware Learning Rate Tuners
·2482 words·12 mins· loading · loading
Machine Learning Deep Learning 🏢 Google DeepMind
Adaptive learning rate tuners often underperform; Curvature Dynamics Aware Tuning (CDAT) prioritizes long-term curvature stabilization, outperforming tuned constant learning rates.
Stepping Forward on the Last Mile
·2832 words·14 mins· loading · loading
Machine Learning Few-Shot Learning 🏢 Qualcomm AI Research
On-device training with fixed-point forward gradients enables efficient model personalization on resource-constrained edge devices, overcoming backpropagation’s memory limitations.
State-free Reinforcement Learning
·357 words·2 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Boston University
State-free Reinforcement Learning (SFRL) framework eliminates the need for state-space information in RL algorithms, achieving regret bounds independent of the state space size and adaptive to the rea…
State Space Models on Temporal Graphs: A First-Principles Study
·1794 words·9 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Sun Yat-Sen University
GRAPHSSM: a novel graph state space model efficiently captures temporal graph dynamics, overcoming limitations of existing sequence models.
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
·2535 words·12 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 University of California, Santa Barbara
State Chrono Representation (SCR) enhances reinforcement learning generalization by incorporating extensive temporal information and cumulative rewards into state representations, improving performanc…
ST$_k$: A Scalable Module for Solving Top-k Problems
·2467 words·12 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 School of Mathematical Sciences
STk: a novel, differentiable module solves Top-k problems in neural networks without extra time/GPU memory, boosting performance in long-tailed learning.
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
·3010 words·15 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Singapore Management University
SPRINQL: Sub-optimal Demonstrations for Offline Imitation Learning
SPO: Sequential Monte Carlo Policy Optimisation
·3026 words·15 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 University of Amsterdam
SPO: A novel model-based RL algorithm leverages parallelisable Monte Carlo tree search for efficient and robust policy improvement in both discrete and continuous environments.
Spiking Token Mixer: A event-driven friendly Former structure for spiking neural networks
·1852 words·9 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Electronic Science and Technology of China
STMixer: a novel SNN architecture enabling high performance on both synchronous and asynchronous neuromorphic hardware, achieving comparable results to spiking transformers with drastically lower powe…
Spiking Graph Neural Network on Riemannian Manifolds
·3421 words·17 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 North China Electric Power University
Spiking Graph Neural Networks (SGNNs) on Riemannian Manifolds achieve superior performance and energy efficiency via a novel Manifold Spiking GNN (MSG).
Speculative Monte-Carlo Tree Search
·1943 words·10 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Pennsylvania State University
Speculative MCTS accelerates AlphaZero training by implementing speculative execution, enabling parallel processing of future moves and reducing latency by up to 5.8x.
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
·2502 words·12 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Seoul National University
SRCPO: a novel spectral risk measure-constrained RL algorithm guaranteeing convergence to a global optimum, outperforming existing methods in continuous control tasks.
Spectral Learning of Shared Dynamics Between Generalized-Linear Processes
·2151 words·11 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 University of Southern California
PGLDM, a novel algorithm, accurately identifies shared and private dynamics in two generalized-linear time series, improving model accuracy and enabling lower-dimensional latent state representations.