Robotics

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

9 December 2024·3880 words·19 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Westlake University

CARP: A novel visuomotor policy learning paradigm achieves high accuracy and 10x faster inference than state-of-the-art by combining autoregressive efficiency and diffusion model precision through a c…

Maximizing Alignment with Minimal Feedback: Efficiently Learning Rewards for Visuomotor Robot Policy Alignment

6 December 2024·2984 words·15 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Robotics 🏢 UC Berkeley

RAPL efficiently aligns robots with human preferences using minimal feedback by aligning visual representations before reward learning.

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

5 December 2024·3555 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 University of Hong Kong

Moto: Bridging language for robot manipulation using latent motion tokens, achieving superior performance with limited data.

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

5 December 2024·6193 words·30 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Peking University

Code-as-Monitor (CaM) uses vision-language models and constraint-aware visual programming to achieve both reactive and proactive robotic failure detection in real-time, improving success rates and red…

WildLMa: Long Horizon Loco-Manipulation in the Wild

22 November 2024·2396 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 UC San Diego

WildLMa enables robots to perform complex, long-horizon manipulation tasks in unstructured environments by combining language-conditioned imitation learning, a whole-body controller for efficient tele…

Soft Robotic Dynamic In-Hand Pen Spinning

19 November 2024·2419 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Carnegie Mellon University

SWIFT, a new system, enables a soft robotic hand to learn dynamic pen spinning via real-world trial-and-error, achieving 100% success across diverse pen properties without explicit object modeling.

DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

7 November 2024·2203 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 New York University

DynaMem empowers robots with online dynamic spatio-semantic memory, achieving a 2x improvement in pick-and-drop success rate on non-stationary objects compared to static systems.

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

4 November 2024·3111 words·15 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Tsinghua University

DeeR-VLA dynamically adjusts the size of a multimodal large language model based on task difficulty, significantly reducing computational cost and memory usage in robotic control without compromising …