🏢 RIKEN AIP
SLTrain: a sparse plus low rank approach for parameter and memory efficient pretraining
·4422 words·21 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 RIKEN AIP
SLTrain: Sparsity+low-rank pretraining boosts LLM efficiency by up to 73% memory reduction without performance loss!
On the Comparison between Multi-modal and Single-modal Contrastive Learning
·455 words·3 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
🏢 RIKEN AIP
Multi-modal contrastive learning surpasses single-modal by leveraging inter-modal correlations to improve feature learning and downstream task performance, as demonstrated through a novel theoretical …
Generalized Tensor Decomposition for Understanding Multi-Output Regression under Combinatorial Shifts
·1493 words·8 mins·
loading
·
loading
Machine Learning
Multi-Output Regression
🏢 RIKEN AIP
This paper proposes Functional t-SVD and ERM-DS to solve multi-output regression under Combinatorial Distribution Shift (CDS), providing robust performance guarantees.
A Framework for Bilevel Optimization on Riemannian Manifolds
·1520 words·8 mins·
loading
·
loading
Machine Learning
Meta Learning
🏢 RIKEN AIP
This paper introduces a novel framework for bilevel optimization on Riemannian manifolds, providing efficient hypergradient estimation strategies and convergence analysis, with successful applications…