AI Applications
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
·2924 words·14 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Autonomous Vehicles
🏢 Institute of Artificial Intelligence, Huazhong University of Science and Technology
DiffusionDrive: a novel truncated diffusion model achieves real-time, high-quality end-to-end autonomous driving by leveraging multi-mode action distributions and significantly reducing computational …
Soft Robotic Dynamic In-Hand Pen Spinning
·2419 words·12 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Robotics
🏢 Carnegie Mellon University
SWIFT, a new system, enables a soft robotic hand to learn dynamic pen spinning via real-world trial-and-error, achieving 100% success across diverse pen properties without explicit object modeling.
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use
·614 words·3 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Human-AI Interaction
🏢 Show Lab, National University of Singapore
Claude 3.5 Computer Use: A groundbreaking AI model offering public beta graphical user interface (GUI) agent for computer use is comprehensively analyzed in this research. This study provides an out-o…
Hermes: A Large Language Model Framework on the Journey to Autonomous Networks
·1636 words·8 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Autonomous Vehicles
🏢 Paris Research Center, Huawei Technologies
Hermes, a novel LLM-based framework, automates cellular network modeling by generating explainable ‘blueprints’ for constructing Network Digital Twins (NDTs), paving the way for fully autonomous netwo…
DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation
·2203 words·11 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Robotics
🏢 New York University
DynaMem empowers robots with online dynamic spatio-semantic memory, achieving a 2x improvement in pick-and-drop success rate on non-stationary objects compared to static systems.
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
·3111 words·15 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Robotics
🏢 Tsinghua University
DeeR-VLA dynamically adjusts the size of a multimodal large language model based on task difficulty, significantly reducing computational cost and memory usage in robotic control without compromising …
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks
·6756 words·32 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Human-AI Interaction
🏢 Southeast University
Collaborative Assistant for Personalized Exploration (CARE) enhances LLM chatbots for exploratory tasks by combining a multi-agent framework with a structured interface, delivering tailored solutions …