Skip to main content

Machine Learning

ARC: A Generalist Graph Anomaly Detector with In-Context Learning
·2432 words·12 mins· loading · loading
Machine Learning Few-Shot Learning 🏢 Griffith University
ARC: a novel generalist graph anomaly detector leveraging in-context learning for efficient, one-for-all anomaly detection across various datasets without retraining.
Approximation Rate of the Transformer Architecture for Sequence Modeling
·1599 words·8 mins· loading · loading
Machine Learning Deep Learning 🏢 CNRS@CREATE LTD
This paper unveils the Transformer’s approximation power, deriving explicit Jackson-type rates to reveal its strengths and limitations in handling various sequential relationships.
Approximately Pareto-optimal Solutions for Bi-Objective k-Clustering
·5572 words·27 mins· loading · loading
AI Generated Machine Learning Unsupervised Learning 🏢 Heinrich Heine University Düsseldorf
This paper presents novel algorithms for approximating Pareto-optimal solutions to bi-objective k-clustering problems, achieving provable approximation guarantees and demonstrating effectiveness throu…
Approximately Equivariant Neural Processes
·2389 words·12 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Cambridge
Boosting meta-learning, this paper introduces a novel, flexible approach to create approximately equivariant neural processes that outperform both non-equivariant and strictly equivariant counterparts…
ANT: Adaptive Noise Schedule for Time Series Diffusion Models
·4333 words·21 mins· loading · loading
Machine Learning Deep Learning 🏢 Yonsei University
ANT: An adaptive noise schedule automatically determines optimal noise schedules for time series diffusion models, significantly boosting performance across diverse tasks.
Analysis of Corrected Graph Convolutions
·1907 words·9 mins· loading · loading
AI Generated Machine Learning Semi-Supervised Learning 🏢 Cheriton School of Computer Science, University of Waterloo
Corrected graph convolutions prevent oversmoothing and exponentially improve GNN classification accuracy.
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
·2513 words·12 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Sun Yat-Sen University
This work introduces PDOA, an offline adaptation framework for constrained multi-objective RL, using demonstrations instead of manually designed preferences to infer optimal policies while satisfying …
An Information Theoretic Perspective on Conformal Prediction
·3559 words·17 mins· loading · loading
AI Generated Machine Learning Federated Learning 🏢 Qualcomm AI Research
This paper uses information theory to improve conformal prediction, proving new ways to bound uncertainty and creating better training methods and side-information incorporation.
An Improved Empirical Fisher Approximation for Natural Gradient Descent
·2632 words·13 mins· loading · loading
Machine Learning Optimization 🏢 University of Cambridge
Improved Empirical Fisher (iEF) approximation significantly boosts the performance of Natural Gradient Descent (NGD) optimizers, offering superior convergence and generalization.
An exactly solvable model for emergence and scaling laws in the multitask sparse parity problem
·2679 words·13 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Oxford
A novel multilinear model analytically explains the emergence and scaling laws of skills in the multitask sparse parity problem, accurately predicting skill emergence in neural networks.
An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning
·2278 words·11 mins· loading · loading
Machine Learning Few-Shot Learning 🏢 Tsinghua University
Mecoin: a novel memory module for efficient graph few-shot class-incremental learning, tackles catastrophic forgetting by employing structured memory units and a memory representation adaptation modul…
An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning
·210 words·1 min· loading · loading
Machine Learning Reinforcement Learning 🏢 Artificial Intelligence Research Institute (IIIA-CSIC)
This paper provides novel theoretical analyses of utility functions in MORL, characterizing preferences and functions guaranteeing optimal policies.
An Adaptive Approach for Infinitely Many-armed Bandits under Generalized Rotting Constraints
·1703 words·8 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Seoul National University
Adaptive algorithm achieves tight regret bounds for infinitely many-armed bandits under generalized rotting constraints, addressing the challenge of decreasing rewards over time.
An Accelerated Gradient Method for Convex Smooth Simple Bilevel Optimization
·448 words·3 mins· loading · loading
Machine Learning Optimization 🏢 University of Texas at Austin
Accelerated Gradient Method for Bilevel Optimization (AGM-BiO) achieves state-of-the-art convergence rates for simple bilevel optimization problems, requiring fewer iterations than existing methods to…
An Accelerated Algorithm for Stochastic Bilevel Optimization under Unbounded Smoothness
·406 words·2 mins· loading · loading
Machine Learning Meta Learning 🏢 George Mason University
AccBO: A new accelerated algorithm achieves O(ε⁻³) oracle complexity for stochastic bilevel optimization with unbounded smoothness, significantly improving upon existing O(ε⁻⁴) methods.
Amortizing intractable inference in diffusion models for vision, language, and control
·5979 words·29 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Mila, Université De Montréal
Amortized sampling from complex posteriors using diffusion models is achieved via a novel data-free learning objective, Relative Trajectory Balance (RTB). RTB’s asymptotic correctness is proven, offe…
Amortized Planning with Large-Scale Transformers: A Case Study on Chess
·3346 words·16 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Google DeepMind
Large-scale transformers achieve grandmaster-level chess play via supervised learning on a new 10M game benchmark dataset, demonstrating impressive generalization beyond memorization.
Amortized Fourier Neural Operators
·2070 words·10 mins· loading · loading
Machine Learning Deep Learning 🏢 Qing Yuan Research Institute, SEIEE, Shanghai Jiao Tong University
Amortized Fourier Neural Operators (AM-FNOs) dramatically improve efficiency in solving PDEs by using neural networks for kernel parameterization, achieving up to 31% better accuracy compared to exist…
Amortized Bayesian Experimental Design for Decision-Making
·1746 words·9 mins· loading · loading
Machine Learning Active Learning 🏢 Aalto University
Amortized Decision-Aware BED prioritizes maximizing downstream decision utility by instantly proposing informative experimental designs and inferring decisions, facilitating accurate decision-making.
Amortized Active Causal Induction with Deep Reinforcement Learning
·3383 words·16 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 University of Oxford
CAASL: An amortized active intervention design policy trained via reinforcement learning, enabling adaptive, real-time causal graph inference without likelihood access.