Machine Learning

Model Sensitivity Aware Continual Learning

26 September 2024·2062 words·10 mins· loading · loading

Machine Learning Continual Learning 🏢 University of Maryland College Park

Model Sensitivity Aware Continual Learning (MACL) tackles the CL challenge by optimizing model performance based on parameter distribution, achieving superior old knowledge retention and new task perf…

Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks

26 September 2024·2425 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 Zhejiang University

Model LEGO (MDA) revolutionizes deep learning by enabling the creation of new models by assembling and disassembling task-aware components from pre-trained models, eliminating the need for retraining.

Model Based Inference of Synaptic Plasticity Rules

26 September 2024·2178 words·11 mins· loading · loading

Machine Learning Meta Learning 🏢 Janelia Research Campus

New computational method infers complex brain learning rules from experimental data, revealing active forgetting in reward learning.

Mixture of Link Predictors on Graphs

26 September 2024·3247 words·16 mins· loading · loading

Machine Learning Deep Learning 🏢 Shanghai Jiao Tong University

Link-MoE boosts link prediction accuracy by strategically selecting the best model for each node pair, surpassing single-model approaches.

Mixture of Experts Meets Prompt-Based Continual Learning

26 September 2024·1840 words·9 mins· loading · loading

Machine Learning Deep Learning 🏢 VinAI Research

Non-linear Residual Gates (NoRGa) boosts prompt-based continual learning by theoretically framing prefix tuning as adding new experts to a pre-trained Mixture-of-Experts model, achieving state-of-the-…

Mitigating Partial Observability in Decision Processes via the Lambda Discrepancy

26 September 2024·2495 words·12 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 UC Berkeley

New metric, λ-discrepancy, precisely detects & mitigates partial observability in sequential decision processes, significantly boosting reinforcement learning agent performance.

Mitigating Covariate Shift in Behavioral Cloning via Robust Stationary Distribution Correction

26 September 2024·2650 words·13 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 KAIST

DrilDICE robustly tackles covariate shift in offline imitation learning by using a stationary distribution correction and a distributionally robust objective, significantly improving performance.

Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor

26 September 2024·3713 words·18 mins· loading · loading

Machine Learning Deep Learning 🏢 School of Data Science

Proactive Defensive Backdoor (PDB) thwarts malicious backdoors by injecting a hidden defensive backdoor during training, suppressing attacks while maintaining model utility.

MiSO: Optimizing brain stimulation to create neural activity states

26 September 2024·2684 words·13 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Carnegie Mellon University

MiSO: a novel closed-loop brain stimulation framework optimizes stimulation parameters to achieve desired neural population activity states, overcoming limitations of current methods by merging data a…

Minimizing UCB: a Better Local Search Strategy in Local Bayesian Optimization

26 September 2024·1728 words·9 mins· loading · loading

Machine Learning Optimization 🏢 Academy of Mathematics and Systems Science, Chinese Academy of Sciences

MinUCB and LA-MinUCB, novel local Bayesian optimization algorithms, replace gradient descent with UCB minimization for efficient, theoretically-sound local search.

Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

26 September 2024·1835 words·9 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Duke University

Minimax-optimal, computationally efficient algorithms are proposed for distributionally robust offline reinforcement learning, addressing challenges posed by function approximation and model uncertain…

Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent Games

26 September 2024·1956 words·10 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Singapore Management University

IMAX-PPO: A novel multi-agent RL algorithm leveraging imitation learning to predict opponent actions, achieving superior performance in complex games.

Metric Flow Matching for Smooth Interpolations on the Data Manifold

26 September 2024·2425 words·12 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 University of Oxford

METRIC FLOW MATCHING (MFM) generates smooth interpolations on data manifolds by minimizing kinetic energy, outperforming Euclidean methods and achieving state-of-the-art results in single-cell traject…

MetaCURL: Non-stationary Concave Utility Reinforcement Learning

26 September 2024·362 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Inria

MetaCURL: First algorithm for non-stationary Concave Utility Reinforcement Learning (CURL), achieving near-optimal dynamic regret by using a meta-algorithm and sleeping experts framework.

Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator

26 September 2024·1665 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Pennsylvania State University

Provable near-optimality in meta-RL is achieved using a novel bilevel optimization framework and universal policy adaptation algorithm.

Meta-Learning Universal Priors Using Non-Injective Change of Variables

26 September 2024·2169 words·11 mins· loading · loading

AI Generated Machine Learning Meta Learning 🏢 University of Minnesota

MetaNCoV: Learn data-driven priors via non-injective change of variables for enhanced few-shot learning.

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement

26 September 2024·4081 words·20 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Nanjing University

Meta-DT: Offline meta-RL masters unseen tasks via conditional sequence modeling and world model disentanglement, showcasing superior few-shot and zero-shot generalization.

Meta-Controller: Few-Shot Imitation of Unseen Embodiments and Tasks in Continuous Control

26 September 2024·3389 words·16 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 School of Computing, KAIST

Meta-Controller: A novel few-shot behavior cloning framework enables robots to generalize to unseen embodiments and tasks using only a few reward-free demonstrations, showcasing superior few-shot gene…

Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization

26 September 2024·3095 words·15 mins· loading · loading

AI Generated Machine Learning Meta Learning 🏢 National University of Singapore

FG²U: a novel memory-efficient algorithm for unbiased stochastic approximation of meta-gradients in large-scale bi-level optimization, showing superior performance across diverse tasks.

Measuring Mutual Policy Divergence for Multi-Agent Sequential Exploration

26 September 2024·2042 words·10 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Xi'an Jiaotong University

MADPO, a novel MARL framework, uses mutual policy divergence maximization with conditional Cauchy-Schwarz divergence to enhance exploration and agent heterogeneity in sequential updating, outperformin…