Skip to main content

Deep Learning

Even Sparser Graph Transformers
·2059 words·10 mins· loading · loading
Machine Learning Deep Learning 🏢 University of British Columbia
Spexphormer achieves significant memory reduction in graph Transformers by leveraging a two-stage training process that leverages attention score consistency across network widths to effectively spars…
Evaluating the design space of diffusion-based generative models
·378 words·2 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 UC Berkeley
This paper provides the first complete error analysis for diffusion models, theoretically justifying optimal training and sampling strategies and design choices for enhanced generative capabilities.
Euclidean distance compression via deep random features
·1451 words·7 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 UC Davis
Deep random features enable efficient Euclidean distance compression, offering improved bit storage compared to linear methods for specific parameter ranges, thus significantly advancing high-dimensio…
Estimating Epistemic and Aleatoric Uncertainty with a Single Model
·2171 words·11 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Maryland
HyperDM accurately estimates both epistemic and aleatoric uncertainty using a single model, overcoming the computational limitations of existing ensemble methods.
Error Correction Output Codes for Robust Neural Networks against Weight-errors: A Neural Tangent Kernel Point of View
·1996 words·10 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Lehigh University
Boosting neural network robustness against weight errors, this research leverages neural tangent kernels to theoretically explain and optimize error-correcting output codes (ECOCs), achieving superior…
Equivariant Neural Diffusion for Molecule Generation
·1916 words·9 mins· loading · loading
Machine Learning Deep Learning 🏢 Technical University of Denmark
Equivariant Neural Diffusion (END) revolutionizes 3D molecule generation with a learnable forward process, achieving state-of-the-art results and enhanced controllability.
Equivariant Machine Learning on Graphs with Nonlinear Spectral Filters
·2665 words·13 mins· loading · loading
Machine Learning Deep Learning 🏢 Viterbi Faculty of Electrical and Computer Engineering, Technion
Nonlinear spectral filters (NLSFs) enable fully equivariant graph neural networks, improving accuracy and generalization.
Equivariant Blurring Diffusion for Hierarchical Molecular Conformer Generation
·2848 words·14 mins· loading · loading
Machine Learning Deep Learning 🏢 Texas A&M University
Equivariant Blurring Diffusion (EBD) generates 3D molecular conformers hierarchically, first creating coarse-grained fragments then refining atomic details, significantly outperforming existing method…
EnOF-SNN: Training Accurate Spiking Neural Networks via Enhancing the Output Feature
·1417 words·7 mins· loading · loading
Machine Learning Deep Learning 🏢 Peking University
EnOF-SNN boosts spiking neural network (SNN) accuracy by enhancing output feature representation using a novel knowledge distillation method and ReLU activation, outperforming current state-of-the-art…
Enhancing Protein Mutation Effect Prediction through a Retrieval-Augmented Framework
·1980 words·10 mins· loading · loading
Machine Learning Deep Learning 🏢 Tsinghua University
Revolutionizing protein mutation effect prediction, this work introduces a retrieval-augmented framework achieving state-of-the-art accuracy by efficiently incorporating similar local structure inform…
Enhancing Diversity in Bayesian Deep Learning via Hyperspherical Energy Minimization of CKA
·3150 words·15 mins· loading · loading
Machine Learning Deep Learning 🏢 Oregon State University
Boosting Bayesian deep learning’s diversity and uncertainty quantification, this study proposes hyperspherical energy minimization of CKA to generate diverse and reliable neural network ensembles and …
Energy-Based Modelling for Discrete and Mixed Data via Heat Equations on Structured Spaces
·2907 words·14 mins· loading · loading
Machine Learning Deep Learning 🏢 Imperial College London
Train discrete EBMs efficiently with Energy Discrepancy, a novel loss function that eliminates the need for Markov Chain Monte Carlo, using diffusion processes on structured spaces.
Energy-based Hopfield Boosting for Out-of-Distribution Detection
·4678 words·22 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Institute for Machine Learning
Hopfield Boosting, a novel energy-based boosting approach, achieves state-of-the-art OOD detection by leveraging Hopfield energy to sharpen the decision boundary between in-distribution and out-of-dis…
ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series Transformer
·2198 words·11 mins· loading · loading
Machine Learning Deep Learning 🏢 Microsoft Research
ElasTST: A novel time-series transformer enables robust forecasting across various horizons without per-horizon training, enhancing adaptability and accuracy.
einspace: Searching for Neural Architectures from Fundamental Operations
·4610 words·22 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 School of Engineering
Einspace: A novel neural architecture search space built from fundamental operations, enabling discovery of diverse high-performing network architectures and surpassing existing NAS methods.
EGonc : Energy-based Open-Set Node Classification with substitute Unknowns
·1692 words·8 mins· loading · loading
Machine Learning Deep Learning 🏢 Shenzhen University
EGonc, a novel energy-based open-set node classification method, leverages substitute unknowns and energy scores to achieve superior accuracy and robustness in classifying nodes from known classes whi…
Dynamic Rescaling for Training GNNs
·1913 words·9 mins· loading · loading
Machine Learning Deep Learning 🏢 CISPA
Dynamic rescaling boosts GNN training by controlling layer learning speeds and balancing networks, leading to faster training and improved generalization, especially on heterophilic graphs.
Dynamic Neural Regeneration: Enhancing Deep Learning Generalization on Small Datasets
·2806 words·14 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Eindhoven University of Technology
Dynamic Neural Regeneration (DNR) enhances deep learning generalization on small datasets using a data-aware dynamic masking scheme inspired by neurogenesis.
Dynamic Conditional Optimal Transport through Simulation-Free Flows
·2192 words·11 mins· loading · loading
Machine Learning Deep Learning 🏢 UC Irvine
Simulation-free flow generates conditional distributions via dynamic conditional optimal transport.
Dual Cone Gradient Descent for Training Physics-Informed Neural Networks
·3668 words·18 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Artificial Intelligence Graduate School UNIST
Dual Cone Gradient Descent (DCGD) enhances Physics-Informed Neural Network (PINN) training by resolving gradient imbalance issues, leading to more accurate and stable solutions for complex partial dif…