Skip to main content

AI Applications

PHYSICS: Benchmarking Foundation Models on University-Level Physics Problem Solving
·2247 words·11 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Education 🏢 Yale University
PHYSICS: A new benchmark reveals foundation models struggle with university-level physics, highlighting needs for improved reasoning and knowledge integration.
ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems
·2349 words·12 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Autonomous Vehicles 🏢 Zhejiang University
ADS-Edit: Empowering autonomous driving with multimodal knowledge editing!
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
·3847 words·19 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Shanghai AI Lab
Dita: Scales a diffusion transformer for generalist robot policies, enabling 10-shot learning in complex, real-world tasks.
PathoHR: Breast Cancer Survival Prediction on High-Resolution Pathological Images
·1466 words·7 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 XJLTU
PathoHR: Boost breast cancer survival prediction with high-resolution pathology images!
AgentRxiv: Towards Collaborative Autonomous Research
·1858 words·9 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 Johns Hopkins University
AgentRxiv enables collaborative autonomous research via LLM agent preprint sharing, boosting performance and discovery.
Position: Interactive Generative Video as Next-Generation Game Engine
·1964 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Gaming 🏢 Hong Kong University of Science and Technology
Interactive Generative Video (IGV) can revolutionize game creation by using AI to generate endless, novel content for next-gen game engines.
MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow
·1815 words·9 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 National University of Singapore
MedAgent-Pro: An evidence-based reasoning agentic system for reliable multi-modal medical diagnosis.
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning
·2985 words·15 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Finance 🏢 Shanghai University of Finance and Economics
Fin-R1: Financial reasoning via RL.
AIMI: Leveraging Future Knowledge and Personalization in Sparse Event Forecasting for Treatment Adherence
·2151 words·11 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 Arizona State University
AIMI: A system leveraging future knowledge & personalized data for accurate treatment adherence forecasting, paving the way for timely mobile interventions.
ELTEX: A Framework for Domain-Driven Synthetic Data Generation
·2296 words·11 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Security 🏢 Distributed Networks Institute (DNI)
ELTEX: Domain-driven synthetic data generation framework improves LLM performance in cybersecurity with less resources.
API Agents vs. GUI Agents: Divergence and Convergence
·2038 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Microsoft
API vs. GUI Agents: Understanding the divergence and convergence in LLM-based automation.
Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning
·2655 words·13 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Shanghai Jiao Tong University
ADC: Human-robot collaboration revolutionizes data collection, slashing data needs and boosting robot learning!
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation
·1710 words·9 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Tsinghua University
KUDA unifies dynamics learning and visual prompting with keypoints for open-vocabulary robot manipulation.
AI-native Memory 2.0: Second Me
·1327 words·7 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Human-AI Interaction 🏢 Mindverse.ai
AI-native memory 2.0 presents second me, an AI system for personal knowledge management.
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
·2900 words·14 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 Yale University
MEDAGENTSBENCH: a new benchmark for assessing complex medical reasoning in LLMs, revealing performance gaps and cost-effective strategies.
Multi Agent based Medical Assistant for Edge Devices
·2191 words·11 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 Samsung Research
On-device multi-agent system overcomes privacy/latency issues in healthcare, enabling personalized, scalable AI assistance.
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving
·3004 words·15 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Autonomous Vehicles 🏢 School of Artificial Intelligence, University of Chinese Academy of Sciences
GoalFlow: A novel approach to enhance multimodal trajectory generation for autonomous driving using goal-driven flow matching.
BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities
·5279 words·25 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Stanford University
BRS: Streamlining real-world whole-body manipulation for household activities. It introduces a robot suite tackling robot dexterity with bimanual coordination, navigation, and end-effector reach.
LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding
·2588 words·13 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Software Engineering 🏢 Peking University
LONGCODEU: A new benchmark to challenge & enhance long code understanding in language models for software engineering!
SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models
·2619 words·13 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Healthcare 🏢 HistAI
SPIDER: A comprehensive pathology dataset boosts AI diagnostic models.