Machine Learning
Model Sensitivity Aware Continual Learning
·2062 words·10 mins·
loading
·
loading
Machine Learning
Continual Learning
🏢 University of Maryland College Park
Model Sensitivity Aware Continual Learning (MACL) tackles the CL challenge by optimizing model performance based on parameter distribution, achieving superior old knowledge retention and new task perf…
Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks
·2425 words·12 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Zhejiang University
Model LEGO (MDA) revolutionizes deep learning by enabling the creation of new models by assembling and disassembling task-aware components from pre-trained models, eliminating the need for retraining.
Model Based Inference of Synaptic Plasticity Rules
·2178 words·11 mins·
loading
·
loading
Machine Learning
Meta Learning
🏢 Janelia Research Campus
New computational method infers complex brain learning rules from experimental data, revealing active forgetting in reward learning.
Mixture of Link Predictors on Graphs
·3247 words·16 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Shanghai Jiao Tong University
Link-MoE boosts link prediction accuracy by strategically selecting the best model for each node pair, surpassing single-model approaches.
Mixture of Experts Meets Prompt-Based Continual Learning
·1840 words·9 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 VinAI Research
Non-linear Residual Gates (NoRGa) boosts prompt-based continual learning by theoretically framing prefix tuning as adding new experts to a pre-trained Mixture-of-Experts model, achieving state-of-the-…
Mitigating Partial Observability in Decision Processes via the Lambda Discrepancy
·2495 words·12 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 UC Berkeley
New metric, λ-discrepancy, precisely detects & mitigates partial observability in sequential decision processes, significantly boosting reinforcement learning agent performance.
Mitigating Covariate Shift in Behavioral Cloning via Robust Stationary Distribution Correction
·2650 words·13 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 KAIST
DrilDICE robustly tackles covariate shift in offline imitation learning by using a stationary distribution correction and a distributionally robust objective, significantly improving performance.
Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor
·3713 words·18 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 School of Data Science
Proactive Defensive Backdoor (PDB) thwarts malicious backdoors by injecting a hidden defensive backdoor during training, suppressing attacks while maintaining model utility.
MiSO: Optimizing brain stimulation to create neural activity states
·2684 words·13 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
🏢 Carnegie Mellon University
MiSO: a novel closed-loop brain stimulation framework optimizes stimulation parameters to achieve desired neural population activity states, overcoming limitations of current methods by merging data a…
Minimizing UCB: a Better Local Search Strategy in Local Bayesian Optimization
·1728 words·9 mins·
loading
·
loading
Machine Learning
Optimization
🏢 Academy of Mathematics and Systems Science, Chinese Academy of Sciences
MinUCB and LA-MinUCB, novel local Bayesian optimization algorithms, replace gradient descent with UCB minimization for efficient, theoretically-sound local search.
Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning
·1835 words·9 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Duke University
Minimax-optimal, computationally efficient algorithms are proposed for distributionally robust offline reinforcement learning, addressing challenges posed by function approximation and model uncertain…
Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent Games
·1956 words·10 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Singapore Management University
IMAX-PPO: A novel multi-agent RL algorithm leveraging imitation learning to predict opponent actions, achieving superior performance in complex games.
Metric Flow Matching for Smooth Interpolations on the Data Manifold
·2425 words·12 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
🏢 University of Oxford
METRIC FLOW MATCHING (MFM) generates smooth interpolations on data manifolds by minimizing kinetic energy, outperforming Euclidean methods and achieving state-of-the-art results in single-cell traject…
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
·362 words·2 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Inria
MetaCURL: First algorithm for non-stationary Concave Utility Reinforcement Learning (CURL), achieving near-optimal dynamic regret by using a meta-algorithm and sleeping experts framework.
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
·1665 words·8 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Pennsylvania State University
Provable near-optimality in meta-RL is achieved using a novel bilevel optimization framework and universal policy adaptation algorithm.
Meta-Learning Universal Priors Using Non-Injective Change of Variables
·2169 words·11 mins·
loading
·
loading
AI Generated
Machine Learning
Meta Learning
🏢 University of Minnesota
MetaNCoV: Learn data-driven priors via non-injective change of variables for enhanced few-shot learning.
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
·4081 words·20 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Nanjing University
Meta-DT: Offline meta-RL masters unseen tasks via conditional sequence modeling and world model disentanglement, showcasing superior few-shot and zero-shot generalization.
Meta-Controller: Few-Shot Imitation of Unseen Embodiments and Tasks in Continuous Control
·3389 words·16 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 School of Computing, KAIST
Meta-Controller: A novel few-shot behavior cloning framework enables robots to generalize to unseen embodiments and tasks using only a few reward-free demonstrations, showcasing superior few-shot gene…
Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization
·3095 words·15 mins·
loading
·
loading
AI Generated
Machine Learning
Meta Learning
🏢 National University of Singapore
FG²U: a novel memory-efficient algorithm for unbiased stochastic approximation of meta-gradients in large-scale bi-level optimization, showing superior performance across diverse tasks.
Measuring Mutual Policy Divergence for Multi-Agent Sequential Exploration
·2042 words·10 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Xi'an Jiaotong University
MADPO, a novel MARL framework, uses mutual policy divergence maximization with conditional Cauchy-Schwarz divergence to enhance exploration and agent heterogeneity in sequential updating, outperformin…