Skip to main content

Deep Learning

The Feature Speed Formula: a flexible approach to scale hyper-parameters of deep neural networks
·1538 words·8 mins· loading · loading
Machine Learning Deep Learning 🏒 Institute of Mathematics, EPFL
New ‘Feature Speed Formula’ predicts & controls deep learning’s hierarchical feature learning by linking hyperparameter tuning to the angle between feature updates and backward pass.
The Best of Both Worlds: On the Dilemma of Out-of-distribution Detection
·2465 words·12 mins· loading · loading
Machine Learning Deep Learning 🏒 Tencent AI Lab
Researchers found that superior OOD detection performance comes at the cost of reduced generalization. Their novel Decoupled Uncertainty Learning (DUL) algorithm harmonizes OOD detection and generali…
Task Confusion and Catastrophic Forgetting in Class-Incremental Learning: A Mathematical Framework for Discriminative and Generative Modelings
·1620 words·8 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 Queen's University
Researchers unveil the Infeasibility Theorem, proving optimal class-incremental learning is impossible with discriminative models due to task confusion, and the Feasibility Theorem, showing generative…
Take A Shortcut Back: Mitigating the Gradient Vanishing for Training Spiking Neural Networks
·1272 words·6 mins· loading · loading
Machine Learning Deep Learning 🏒 Peking University
Shortcut back-propagation and an evolutionary training framework conquer gradient vanishing in spiking neural networks, drastically improving training and achieving state-of-the-art accuracy.
Symmetry-Informed Governing Equation Discovery
·2189 words·11 mins· loading · loading
Machine Learning Deep Learning 🏒 UC San Diego
Leveraging symmetry in automated equation discovery improves accuracy and simplicity of learned governing equations, enhancing robustness against noise and achieving higher success rates across divers…
Swift Sampler: Efficient Learning of Sampler by 10 Parameters
·2624 words·13 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 University of Washington
Swift Sampler (SS) automates the learning of efficient data samplers for deep learning, achieving significant performance gains (e.g., 1.5% on ImageNet) with minimal computational cost using only 10 p…
Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling
·2206 words·11 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 Tencent Hunyuan
Deep learning’s Adam-style optimizers exhibit a surprising surge phenomenon: optimal learning rates initially increase, then decrease, before converging to a non-zero value as batch size grows.
Supra-Laplacian Encoding for Transformer on Dynamic Graphs
·2790 words·14 mins· loading · loading
Machine Learning Deep Learning 🏒 Conservatoire National Des Arts Et Métiers
SLATE: Supra-Laplacian encoding for spatio-temporal Transformers achieves state-of-the-art dynamic link prediction by innovatively using a multi-layer graph representation and a unique cross-attention…
Super Consistency of Neural Network Landscapes and Learning Rate Transfer
·3859 words·19 mins· loading · loading
Machine Learning Deep Learning 🏒 ETH Zurich
Neural network hyperparameter transferability across vastly different model sizes is achieved via a newly discovered property called ‘Super Consistency’ of loss landscapes.
Structured Matrix Basis for Multivariate Time Series Forecasting with Interpretable Dynamics
·2313 words·11 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 Harbin Institute of Technology
Sumba: a novel forecasting model achieves up to 8.5% improvement by using a structured matrix basis to generate dynamic spatial structures with lower variance and better interpretability.
Structural Inference of Dynamical Systems with Conjoined State Space Models
·2686 words·13 mins· loading · loading
Machine Learning Deep Learning 🏒 University of Luxembourg
SICSM, a novel framework, integrates selective SSMs and GFNs to accurately infer complex dynamical system structures from irregularly sampled, partially observed trajectories.
Stochastic Optimal Control for Diffusion Bridges in Function Spaces
·2194 words·11 mins· loading · loading
Machine Learning Deep Learning 🏒 KAIST
Researchers extended stochastic optimal control theory to infinite-dimensional spaces, enabling the creation of diffusion bridges for generative modeling in function spaces, demonstrating applications…
Stochastic Kernel Regularisation Improves Generalisation in Deep Kernel Machines
·1434 words·7 mins· loading · loading
Machine Learning Deep Learning 🏒 University of Bristol
Deep kernel machines now achieve 94.5% accuracy on CIFAR-10, matching neural networks, by using stochastic kernel regularization to improve generalization.
Stepping on the Edge: Curvature Aware Learning Rate Tuners
·2482 words·12 mins· loading · loading
Machine Learning Deep Learning 🏒 Google DeepMind
Adaptive learning rate tuners often underperform; Curvature Dynamics Aware Tuning (CDAT) prioritizes long-term curvature stabilization, outperforming tuned constant learning rates.
State Space Models on Temporal Graphs: A First-Principles Study
·1794 words·9 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 Sun Yat-Sen University
GRAPHSSM: a novel graph state space model efficiently captures temporal graph dynamics, overcoming limitations of existing sequence models.
ST$_k$: A Scalable Module for Solving Top-k Problems
·2467 words·12 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 School of Mathematical Sciences
STk: a novel, differentiable module solves Top-k problems in neural networks without extra time/GPU memory, boosting performance in long-tailed learning.
Spiking Token Mixer: A event-driven friendly Former structure for spiking neural networks
·1852 words·9 mins· loading · loading
Machine Learning Deep Learning 🏒 University of Electronic Science and Technology of China
STMixer: a novel SNN architecture enabling high performance on both synchronous and asynchronous neuromorphic hardware, achieving comparable results to spiking transformers with drastically lower powe…
Spiking Graph Neural Network on Riemannian Manifolds
·3421 words·17 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 North China Electric Power University
Spiking Graph Neural Networks (SGNNs) on Riemannian Manifolds achieve superior performance and energy efficiency via a novel Manifold Spiking GNN (MSG).
Spectral Learning of Shared Dynamics Between Generalized-Linear Processes
·2151 words·11 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 University of Southern California
PGLDM, a novel algorithm, accurately identifies shared and private dynamics in two generalized-linear time series, improving model accuracy and enabling lower-dimensional latent state representations.
Spatio-Spectral Graph Neural Networks
·4413 words·21 mins· loading · loading
Machine Learning Deep Learning 🏒 Technical University of Munich
Spatio-Spectral GNNs synergistically combine spatial and spectral graph filters for efficient, global information propagation, overcoming limitations of existing methods.