↓Skip to main content

🏢 Vivo AI Lab

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

27 March 2025·2964 words·14 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Vivo AI Lab

UI-R1 enhances GUI agents’ action prediction using reinforcement learning.

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

16 November 2024·3633 words·18 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Vivo AI Lab

BlueLM-V-3B: Algorithm and system co-design enables efficient, real-time multimodal language model deployment on mobile devices.