Robotics
MeMo: Meaningful, Modular Controllers via Noise Injection
·3100 words·15 mins·
loading
·
loading
AI Applications
Robotics
🏢 MIT
MeMo: a novel framework for pretraining meaningful, modular robot controllers via noise injection, enabling efficient transfer learning across different robot morphologies and tasks.
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion
·3025 words·15 mins·
loading
·
loading
AI Generated
AI Applications
Robotics
🏢 Tsinghua University
Make-An-Agent generates high-performing robotic control policies from single behavioral demonstrations using behavior-prompted diffusion, showcasing impressive generalization and real-world applicabil…
Long-Horizon Planning for Multi-Agent Robots in Partially Observable Environments
·2334 words·11 mins·
loading
·
loading
AI Applications
Robotics
🏢 MIT
LLaMAR: LM-based planner for multi-agent robots excels in long-horizon, partially observable tasks, achieving 30% higher success than existing methods.
LLM-based Skill Diffusion for Zero-shot Policy Adaptation
·3701 words·18 mins·
loading
·
loading
AI Generated
AI Applications
Robotics
🏢 SungKyunKwan University
LLM-based Skill Diffusion (LDuS) enables zero-shot robotic policy adaptation to various contexts specified in natural language by generating controllable skill trajectories via loss-guided diffusion a…
Learning rigid-body simulators over implicit shapes for large-scale scenes and vision
·2932 words·14 mins·
loading
·
loading
AI Applications
Robotics
🏢 Google DeepMind
SDF-Sim: A novel learned rigid-body simulator that leverages SDFs to achieve unprecedented scalability, enabling simulations with hundreds of objects and millions of nodes.
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training
·2188 words·11 mins·
loading
·
loading
AI Applications
Robotics
🏢 Hong Kong University of Science and Technology
Actionable AI agents are trained efficiently via a novel framework, VPDD, which uses discrete diffusion to pre-train on massive human videos, and fine-tunes on limited robot data for superior multi-ta…
iVideoGPT: Interactive VideoGPTs are Scalable World Models
·3466 words·17 mins·
loading
·
loading
AI Applications
Robotics
🏢 Tsinghua University
iVideoGPT: A scalable, interactive world model trained on millions of human & robot manipulation videos, enabling efficient video prediction and model-based reinforcement learning.
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
·2643 words·13 mins·
loading
·
loading
AI Applications
Robotics
🏢 Princeton University
Robots using LLMs for task planning often make unsafe or wrong decisions due to LLM hallucination and ambiguity in instructions. This paper introduces ‘introspective planning,’ a novel method that us…
Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent Space
·4386 words·21 mins·
loading
·
loading
AI Applications
Robotics
🏢 Delft University of Technology
Stable closed-loop control in latent space is achieved using a novel Coupled Oscillator Network, offering efficient model-based control for complex nonlinear systems directly from image data.
Humanoid Locomotion as Next Token Prediction
·1485 words·7 mins·
loading
·
loading
AI Applications
Robotics
🏢 University of California, Berkeley
Humanoid robots now walk in San Francisco zero-shot, thanks to a novel ’next token prediction’ approach trained on diverse sensorimotor data, enabling real-world generalization and data efficiency.
Grasp as You Say: Language-guided Dexterous Grasp Generation
·2373 words·12 mins·
loading
·
loading
AI Applications
Robotics
🏢 Stanford University
Robots can now dexterously grasp objects based on natural language commands thanks to DexGYS, a new language-guided dexterous grasp generation framework and dataset.
Graph Learning for Numeric Planning
·2258 words·11 mins·
loading
·
loading
AI Applications
Robotics
🏢 LAAS-CNRS
GOOSE: a novel framework using graph learning for efficient and interpretable numeric planning, outperforming existing methods in many benchmarks.
GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation
·2482 words·12 mins·
loading
·
loading
AI Applications
Robotics
🏢 Peking University
GarmentLab: A new benchmark and simulation platform tackles garment manipulation challenges by offering realistic simulations, diverse assets, and tasks bridging the sim-to-real gap for more robust AI…
FactorSim: Generative Simulation via Factorized Representation
·2722 words·13 mins·
loading
·
loading
AI Applications
Robotics
🏢 Stanford University
FACTORSim generates full, coded simulations from natural language descriptions, outperforming existing methods in accuracy and zero-shot transfer learning by using a factored POMDP representation.
EGODE: An Event-attended Graph ODE Framework for Modeling Rigid Dynamics
·1865 words·9 mins·
loading
·
loading
AI Applications
Robotics
🏢 Peking University
EGODE, a novel framework, leverages coupled graph ODEs and an event module to accurately model continuous and instantaneous changes in rigid body dynamics, outperforming existing methods.
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control
·2471 words·12 mins·
loading
·
loading
AI Applications
Robotics
🏢 New York University
DynaMo: a novel self-supervised method significantly boosts visuo-motor control by learning in-domain dynamics from limited expert demonstrations, improving policy performance across various environme…
Disentangling Linear Quadratic Control with Untrusted ML Predictions
·1894 words·9 mins·
loading
·
loading
AI Applications
Robotics
🏢 Chinese University of Hong Kong, Shenzhen
DISC, a novel control policy, disentangles untrusted ML predictions to achieve near-optimal performance when accurate, while guaranteeing competitive ratio bounds even with significant prediction erro…
Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based Policies
·2633 words·13 mins·
loading
·
loading
AI Applications
Robotics
🏢 Georgia Institute of Technology
DP-Attacker unveils diffusion-based policy vulnerabilities by crafting effective adversarial attacks, significantly impacting robot safety and paving the way for more robust AI.
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion
·2643 words·13 mins·
loading
·
loading
AI Applications
Robotics
🏢 MIT CSAIL
Diffusion Forcing merges next-token prediction and full-sequence diffusion for superior sequence generation.
DiffuserLite: Towards Real-time Diffusion Planning
·1923 words·10 mins·
loading
·
loading
AI Applications
Robotics
🏢 UC San Diego
DiffuserLite: a super-fast diffusion planning framework achieving real-time performance (122Hz).