Skip to main content

🏢 Carnegie Mellon University

Linear Causal Representation Learning from Unknown Multi-node Interventions
·418 words·2 mins· loading · loading
AI Theory Causality 🏢 Carnegie Mellon University
Unlocking Causal Structures: New algorithms identify latent causal relationships from interventions, even when multiple variables are affected simultaneously.
Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios
·2514 words·12 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Carnegie Mellon University
Boosting complex visual reasoning, a new Iterative and Parallel Reasoning Mechanism (IPRM) outperforms existing methods by combining step-by-step and simultaneous computations, improving accuracy and …
Learning Social Welfare Functions
·2157 words·11 mins· loading · loading
AI Theory Optimization 🏢 Carnegie Mellon University
Learning social welfare functions from past decisions is possible! This paper shows how to efficiently learn power mean functions, a widely used family, using both cardinal and pairwise welfare compar…
Learning Discrete Latent Variable Structures with Tensor Rank Conditions
·2759 words·13 mins· loading · loading
AI Generated AI Theory Causality 🏢 Carnegie Mellon University
This paper introduces a novel tensor rank condition for identifying causal structures among discrete latent variables, advancing causal discovery in complex scenarios.
Learning Discrete Concepts in Latent Hierarchical Models
·2302 words·11 mins· loading · loading
AI Theory Interpretability 🏢 Carnegie Mellon University
This paper introduces a novel framework for learning discrete concepts from high-dimensional data, establishing theoretical conditions for identifying underlying hierarchical causal structures and pro…
John Ellipsoids via Lazy Updates
·311 words·2 mins· loading · loading
AI Theory Optimization 🏢 Carnegie Mellon University
Faster John ellipsoid computation achieved via lazy updates and fast matrix multiplication, improving efficiency and enabling low-space streaming algorithms.
Interventional Causal Discovery in a Mixture of DAGs
·1892 words·9 mins· loading · loading
AI Generated AI Theory Causality 🏢 Carnegie Mellon University
This study presents CADIM, an adaptive algorithm using interventions to learn true causal relationships from mixtures of DAGs, achieving near-optimal intervention sizes and providing quantifiable opti…
Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation
·2821 words·14 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Carnegie Mellon University
IsCiL: a novel adapter-based continual imitation learning framework that efficiently adapts to new tasks by incrementally learning and retrieving reusable skills.
In-Context Learning with Representations: Contextual Generalization of Trained Transformers
·1880 words·9 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Carnegie Mellon University
Transformers learn contextual information for generalization to unseen examples and tasks, even with limited training data, converging linearly to a global minimum.
Improving the Training of Rectified Flows
·4681 words·22 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Carnegie Mellon University
Researchers significantly boosted the efficiency and quality of rectified flow, a method for generating samples from diffusion models, by introducing novel training techniques that surpass state-of-th…
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations
·2869 words·14 mins· loading · loading
AI Generated Machine Learning Semi-Supervised Learning 🏢 Carnegie Mellon University
Unified framework for imprecise label learning handles noisy, partial, and semi-supervised data, improving model training efficiency and accuracy.
Implicit Regularization Paths of Weighted Neural Representations
·1797 words·9 mins· loading · loading
AI Theory Generalization 🏢 Carnegie Mellon University
Weighted pretrained features implicitly regularize models, and this paper reveals equivalent paths between weighting schemes and ridge regularization, enabling efficient hyperparameter tuning.
Identifying Selections for Unsupervised Subtask Discovery
·3702 words·18 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Carnegie Mellon University
This paper introduces seq-NMF, a novel method for unsupervised subtask discovery in reinforcement learning that leverages selection variables to enhance generalization and data efficiency.
Identifying Latent State-Transition Processes for Individualized Reinforcement Learning
·2375 words·12 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Carnegie Mellon University
This study introduces a novel framework for individualized reinforcement learning, guaranteeing the identifiability of latent factors influencing state transitions and providing a practical method for…
Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers
·2522 words·12 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Carnegie Mellon University
Hydra: Bidirectional sequence modeling redefined with quasiseparable matrix mixers, outperforming existing models on various benchmarks!
Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba
·3671 words·18 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Carnegie Mellon University
Hamba: a novel graph-guided framework for single-view 3D hand reconstruction, significantly outperforms existing methods by efficiently modeling spatial relationships between joints using a fraction o…
Global Rewards in Restless Multi-Armed Bandits
·2031 words·10 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Carnegie Mellon University
Restless multi-armed bandits with global rewards (RMAB-G) are introduced, extending the model to handle non-separable rewards and offering novel index-based and adaptive policies that outperform exist…
GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration
·2622 words·13 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Carnegie Mellon University
GL-NeRF accelerates NeRF rendering by using Gauss-Laguerre quadrature, drastically reducing MLP calls without needing additional networks or data structures.
From Causal to Concept-Based Representation Learning
·1733 words·9 mins· loading · loading
AI Theory Representation Learning 🏢 Carnegie Mellon University
This paper introduces a novel geometric approach to concept-based representation learning, provably recovering interpretable concepts from diverse data without strict causal assumptions or many interv…
Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning
·1640 words·8 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Carnegie Mellon University
This paper introduces federated natural policy gradient and actor-critic methods achieving near dimension-free global convergence for decentralized multi-task reinforcement learning, a significant bre…