Machine Learning

ARC: A Generalist Graph Anomaly Detector with In-Context Learning

26 September 2024·2432 words·12 mins· loading · loading

Machine Learning Few-Shot Learning 🏢 Griffith University

ARC: a novel generalist graph anomaly detector leveraging in-context learning for efficient, one-for-all anomaly detection across various datasets without retraining.

Approximation Rate of the Transformer Architecture for Sequence Modeling

26 September 2024·1599 words·8 mins· loading · loading

Machine Learning Deep Learning 🏢 CNRS@CREATE LTD

This paper unveils the Transformer’s approximation power, deriving explicit Jackson-type rates to reveal its strengths and limitations in handling various sequential relationships.

Approximately Pareto-optimal Solutions for Bi-Objective k-Clustering

26 September 2024·5572 words·27 mins· loading · loading

AI Generated Machine Learning Unsupervised Learning 🏢 Heinrich Heine University Düsseldorf

This paper presents novel algorithms for approximating Pareto-optimal solutions to bi-objective k-clustering problems, achieving provable approximation guarantees and demonstrating effectiveness throu…

Approximately Equivariant Neural Processes

26 September 2024·2389 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Cambridge

Boosting meta-learning, this paper introduces a novel, flexible approach to create approximately equivariant neural processes that outperform both non-equivariant and strictly equivariant counterparts…

ANT: Adaptive Noise Schedule for Time Series Diffusion Models

26 September 2024·4333 words·21 mins· loading · loading

Machine Learning Deep Learning 🏢 Yonsei University

ANT: An adaptive noise schedule automatically determines optimal noise schedules for time series diffusion models, significantly boosting performance across diverse tasks.

Analysis of Corrected Graph Convolutions

26 September 2024·1907 words·9 mins· loading · loading

AI Generated Machine Learning Semi-Supervised Learning 🏢 Cheriton School of Computer Science, University of Waterloo

Corrected graph convolutions prevent oversmoothing and exponentially improve GNN classification accuracy.

An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning

26 September 2024·2513 words·12 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Sun Yat-Sen University

This work introduces PDOA, an offline adaptation framework for constrained multi-objective RL, using demonstrations instead of manually designed preferences to infer optimal policies while satisfying …

An Information Theoretic Perspective on Conformal Prediction

26 September 2024·3559 words·17 mins· loading · loading

AI Generated Machine Learning Federated Learning 🏢 Qualcomm AI Research

This paper uses information theory to improve conformal prediction, proving new ways to bound uncertainty and creating better training methods and side-information incorporation.

An Improved Empirical Fisher Approximation for Natural Gradient Descent

26 September 2024·2632 words·13 mins· loading · loading

Machine Learning Optimization 🏢 University of Cambridge

Improved Empirical Fisher (iEF) approximation significantly boosts the performance of Natural Gradient Descent (NGD) optimizers, offering superior convergence and generalization.

An exactly solvable model for emergence and scaling laws in the multitask sparse parity problem

26 September 2024·2679 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Oxford

A novel multilinear model analytically explains the emergence and scaling laws of skills in the multitask sparse parity problem, accurately predicting skill emergence in neural networks.

An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

26 September 2024·2278 words·11 mins· loading · loading

Machine Learning Few-Shot Learning 🏢 Tsinghua University

Mecoin: a novel memory module for efficient graph few-shot class-incremental learning, tackles catastrophic forgetting by employing structured memory units and a memory representation adaptation modul…

An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning

26 September 2024·210 words·1 min· loading · loading

Machine Learning Reinforcement Learning 🏢 Artificial Intelligence Research Institute (IIIA-CSIC)

This paper provides novel theoretical analyses of utility functions in MORL, characterizing preferences and functions guaranteeing optimal policies.

An Adaptive Approach for Infinitely Many-armed Bandits under Generalized Rotting Constraints

26 September 2024·1703 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Seoul National University

Adaptive algorithm achieves tight regret bounds for infinitely many-armed bandits under generalized rotting constraints, addressing the challenge of decreasing rewards over time.

An Accelerated Gradient Method for Convex Smooth Simple Bilevel Optimization

26 September 2024·448 words·3 mins· loading · loading

Machine Learning Optimization 🏢 University of Texas at Austin

Accelerated Gradient Method for Bilevel Optimization (AGM-BiO) achieves state-of-the-art convergence rates for simple bilevel optimization problems, requiring fewer iterations than existing methods to…

An Accelerated Algorithm for Stochastic Bilevel Optimization under Unbounded Smoothness

26 September 2024·406 words·2 mins· loading · loading

Machine Learning Meta Learning 🏢 George Mason University

AccBO: A new accelerated algorithm achieves O(ε⁻³) oracle complexity for stochastic bilevel optimization with unbounded smoothness, significantly improving upon existing O(ε⁻⁴) methods.

Amortizing intractable inference in diffusion models for vision, language, and control

26 September 2024·5979 words·29 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Mila, Université De Montréal

Amortized sampling from complex posteriors using diffusion models is achieved via a novel data-free learning objective, Relative Trajectory Balance (RTB). RTB’s asymptotic correctness is proven, offe…

Amortized Planning with Large-Scale Transformers: A Case Study on Chess

26 September 2024·3346 words·16 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Google DeepMind

Large-scale transformers achieve grandmaster-level chess play via supervised learning on a new 10M game benchmark dataset, demonstrating impressive generalization beyond memorization.

Amortized Fourier Neural Operators

26 September 2024·2070 words·10 mins· loading · loading

Machine Learning Deep Learning 🏢 Qing Yuan Research Institute, SEIEE, Shanghai Jiao Tong University

Amortized Fourier Neural Operators (AM-FNOs) dramatically improve efficiency in solving PDEs by using neural networks for kernel parameterization, achieving up to 31% better accuracy compared to exist…

Amortized Bayesian Experimental Design for Decision-Making

26 September 2024·1746 words·9 mins· loading · loading

Machine Learning Active Learning 🏢 Aalto University

Amortized Decision-Aware BED prioritizes maximizing downstream decision utility by instantly proposing informative experimental designs and inferring decisions, facilitating accurate decision-making.

Amortized Active Causal Induction with Deep Reinforcement Learning

26 September 2024·3383 words·16 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of Oxford

CAASL: An amortized active intervention design policy trained via reinforcement learning, enabling adaptive, real-time causal graph inference without likelihood access.