Skip to main content

🏢 Microsoft

API Agents vs. GUI Agents: Divergence and Convergence
·2038 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Microsoft
API vs. GUI Agents: Understanding the divergence and convergence in LLM-based automation.
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
·3130 words·15 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Microsoft
Phi-4: Compact Multimodal Language Models via Mixture-of-LoRAs
LongRoPE2: Near-Lossless LLM Context Window Scaling
·3732 words·18 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Microsoft
LongRoPE2: Extends LLM context windows while preserving performance and reducing training costs!
Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance
·3383 words·16 mins· loading · loading
AI Generated 🤗 Daily Papers Machine Learning Reinforcement Learning 🏢 Microsoft
DVPO: A lean RLHF framework that decouples value & policy optimization with global value guidance, cutting GPU use by 40% and training time by 35%.
Large Action Models: From Inception to Implementation
·2938 words·14 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Microsoft
From language models to action models: building AI that does things.