Skip to main content

🏢 Pennsylvania State University

Speculative Monte-Carlo Tree Search
·1943 words·10 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Pennsylvania State University
Speculative MCTS accelerates AlphaZero training by implementing speculative execution, enabling parallel processing of future moves and reducing latency by up to 5.8x.
Robust Offline Active Learning on Graphs
·1968 words·10 mins· loading · loading
AI Generated Machine Learning Active Learning 🏢 Pennsylvania State University
This paper introduces a novel offline active learning method for node-level tasks on graphs, incorporating network structure and node covariates to improve efficiency and robustness, especially in noi…
pFedClub: Controllable Heterogeneous Model Aggregation for Personalized Federated Learning
·1986 words·10 mins· loading · loading
Machine Learning Federated Learning 🏢 Pennsylvania State University
pFedClub: Controllable heterogeneous model aggregation boosts personalized federated learning by generating reasonable-sized, personalized models, significantly cutting computational costs.
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
·4408 words·21 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Pennsylvania State University
Bi-directional Preference Optimization (BiPO) generates superior steering vectors for personalized LLM control, improving upon existing methods by directly influencing the generation probability of hu…
Newton Informed Neural Operator for Computing Multiple Solutions of Nonlinear Partials Differential Equations
·2011 words·10 mins· loading · loading
AI Generated AI Applications Healthcare 🏢 Pennsylvania State University
Newton Informed Neural Operator efficiently solves nonlinear PDEs with multiple solutions by learning the Newton solver, enabling faster computation and the discovery of new solutions with limited dat…
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
·1665 words·8 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Pennsylvania State University
Provable near-optimality in meta-RL is achieved using a novel bilevel optimization framework and universal policy adaptation algorithm.
Learn more, but bother less: parameter efficient continual learning
·2442 words·12 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Pennsylvania State University
LB-CL: A novel parameter-efficient continual learning method for LLMs that boosts performance and reduces forgetting by leveraging parametric knowledge transfer and maintaining orthogonal low-rank sub…
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally From An Ongoing Trajectory
·1427 words·7 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Pennsylvania State University
MERIT-IRL: First in-trajectory IRL framework learns reward & policy incrementally from ongoing trajectories, guaranteeing sub-linear regret.
Implicit Regularization of Decentralized Gradient Descent for Sparse Regression
·1887 words·9 mins· loading · loading
AI Generated AI Theory Optimization 🏢 Pennsylvania State University
Decentralized Gradient Descent achieves statistically optimal sparse model learning via implicit regularization, even with communication-efficient truncation.
Federated Online Prediction from Experts with Differential Privacy: Separations and Regret Speed-ups
·1475 words·7 mins· loading · loading
Machine Learning Federated Learning 🏢 Pennsylvania State University
This paper presents novel algorithms achieving speed-ups in differentially private federated online prediction from experts, addressing both stochastic and oblivious adversaries, with rigorous theoret…
Automated Multi-Task Learning for Joint Disease Prediction on Electronic Health Records
·2109 words·10 mins· loading · loading
AI Generated AI Applications Healthcare 🏢 Pennsylvania State University
AutoDP automates multi-task learning for joint disease prediction on EHRs, significantly improving performance via automated task grouping and architecture search.