↓Skip to main content

🏢 California Institute of Technology

Universality in Transfer Learning for Linear Models

26 September 2024·1460 words·7 mins· loading · loading

AI Generated Machine Learning Transfer Learning 🏢 California Institute of Technology

Linear model transfer learning achieves universal generalization error improvements, depending only on first and second-order target statistics, and defying Gaussian assumptions.

Understanding Model Selection for Learning in Strategic Environments

26 September 2024·394 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 California Institute of Technology

Larger machine learning models don’t always mean better performance; strategic interactions can reverse this trend, as this research shows, prompting a new paradigm for model selection in games.

Practical Bayesian Algorithm Execution via Posterior Sampling

26 September 2024·2028 words·10 mins· loading · loading

AI Generated Machine Learning Active Learning 🏢 California Institute of Technology

PS-BAX, a novel Bayesian algorithm execution method using posterior sampling, efficiently selects evaluation points for complex tasks, outperforming existing methods in speed and scalability.

Mini-Sequence Transformers: Optimizing Intermediate Memory for Long Sequences Training

26 September 2024·2712 words·13 mins· loading · loading

Natural Language Processing Large Language Models 🏢 California Institute of Technology

MINI-SEQUENCE TRANSFORMER (MST) drastically reduces memory usage in LLM training by processing mini-sequences iteratively, enabling training with 12-24x longer sequences than conventional methods with…