Skip to main content

Robotics

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
·3880 words·19 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Westlake University
CARP: A novel visuomotor policy learning paradigm achieves high accuracy and 10x faster inference than state-of-the-art by combining autoregressive efficiency and diffusion model precision through a c…
Maximizing Alignment with Minimal Feedback: Efficiently Learning Rewards for Visuomotor Robot Policy Alignment
·2984 words·15 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Robotics 🏢 UC Berkeley
RAPL efficiently aligns robots with human preferences using minimal feedback by aligning visual representations before reward learning.
Moto: Latent Motion Token as the Bridging Language for Robot Manipulation
·3555 words·17 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 University of Hong Kong
Moto: Bridging language for robot manipulation using latent motion tokens, achieving superior performance with limited data.
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
·6193 words·30 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Peking University
Code-as-Monitor (CaM) uses vision-language models and constraint-aware visual programming to achieve both reactive and proactive robotic failure detection in real-time, improving success rates and red…
WildLMa: Long Horizon Loco-Manipulation in the Wild
·2396 words·12 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 UC San Diego
WildLMa enables robots to perform complex, long-horizon manipulation tasks in unstructured environments by combining language-conditioned imitation learning, a whole-body controller for efficient tele…
Soft Robotic Dynamic In-Hand Pen Spinning
·2419 words·12 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Carnegie Mellon University
SWIFT, a new system, enables a soft robotic hand to learn dynamic pen spinning via real-world trial-and-error, achieving 100% success across diverse pen properties without explicit object modeling.
DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation
·2203 words·11 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 New York University
DynaMem empowers robots with online dynamic spatio-semantic memory, achieving a 2x improvement in pick-and-drop success rate on non-stationary objects compared to static systems.
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
·3111 words·15 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Tsinghua University
DeeR-VLA dynamically adjusts the size of a multimodal large language model based on task difficulty, significantly reducing computational cost and memory usage in robotic control without compromising …