🏢 Pennsylvania State University

Speculative Monte-Carlo Tree Search

26 September 2024·1943 words·10 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Pennsylvania State University

Speculative MCTS accelerates AlphaZero training by implementing speculative execution, enabling parallel processing of future moves and reducing latency by up to 5.8x.

Robust Offline Active Learning on Graphs

26 September 2024·1968 words·10 mins· loading · loading

AI Generated Machine Learning Active Learning 🏢 Pennsylvania State University

This paper introduces a novel offline active learning method for node-level tasks on graphs, incorporating network structure and node covariates to improve efficiency and robustness, especially in noi…

pFedClub: Controllable Heterogeneous Model Aggregation for Personalized Federated Learning

26 September 2024·1986 words·10 mins· loading · loading

Machine Learning Federated Learning 🏢 Pennsylvania State University

pFedClub: Controllable heterogeneous model aggregation boosts personalized federated learning by generating reasonable-sized, personalized models, significantly cutting computational costs.

Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization

26 September 2024·4408 words·21 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Pennsylvania State University

Bi-directional Preference Optimization (BiPO) generates superior steering vectors for personalized LLM control, improving upon existing methods by directly influencing the generation probability of hu…

Newton Informed Neural Operator for Computing Multiple Solutions of Nonlinear Partials Differential Equations

26 September 2024·2011 words·10 mins· loading · loading

AI Generated AI Applications Healthcare 🏢 Pennsylvania State University

Newton Informed Neural Operator efficiently solves nonlinear PDEs with multiple solutions by learning the Newton solver, enabling faster computation and the discovery of new solutions with limited dat…

Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator

26 September 2024·1665 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Pennsylvania State University

Provable near-optimality in meta-RL is achieved using a novel bilevel optimization framework and universal policy adaptation algorithm.

Learn more, but bother less: parameter efficient continual learning

26 September 2024·2442 words·12 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Pennsylvania State University

LB-CL: A novel parameter-efficient continual learning method for LLMs that boosts performance and reduces forgetting by leveraging parametric knowledge transfer and maintaining orthogonal low-rank sub…

In-Trajectory Inverse Reinforcement Learning: Learn Incrementally From An Ongoing Trajectory

26 September 2024·1427 words·7 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Pennsylvania State University

MERIT-IRL: First in-trajectory IRL framework learns reward & policy incrementally from ongoing trajectories, guaranteeing sub-linear regret.

Implicit Regularization of Decentralized Gradient Descent for Sparse Regression

26 September 2024·1887 words·9 mins· loading · loading

AI Generated AI Theory Optimization 🏢 Pennsylvania State University

Decentralized Gradient Descent achieves statistically optimal sparse model learning via implicit regularization, even with communication-efficient truncation.

Federated Online Prediction from Experts with Differential Privacy: Separations and Regret Speed-ups

26 September 2024·1475 words·7 mins· loading · loading

Machine Learning Federated Learning 🏢 Pennsylvania State University

This paper presents novel algorithms achieving speed-ups in differentially private federated online prediction from experts, addressing both stochastic and oblivious adversaries, with rigorous theoret…

Automated Multi-Task Learning for Joint Disease Prediction on Electronic Health Records

26 September 2024·2109 words·10 mins· loading · loading

AI Generated AI Applications Healthcare 🏢 Pennsylvania State University

AutoDP automates multi-task learning for joint disease prediction on EHRs, significantly improving performance via automated task grouping and architecture search.