🏢 Microsoft
API Agents vs. GUI Agents: Divergence and Convergence
·2038 words·10 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Robotics
🏢 Microsoft
API vs. GUI Agents: Understanding the divergence and convergence in LLM-based automation.
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
·3130 words·15 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Vision-Language Models
🏢 Microsoft
Phi-4: Compact Multimodal Language Models via Mixture-of-LoRAs
LongRoPE2: Near-Lossless LLM Context Window Scaling
·3732 words·18 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 Microsoft
LongRoPE2: Extends LLM context windows while preserving performance and reducing training costs!
Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance
·3383 words·16 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Machine Learning
Reinforcement Learning
🏢 Microsoft
DVPO: A lean RLHF framework that decouples value & policy optimization with global value guidance, cutting GPU use by 40% and training time by 35%.
Large Action Models: From Inception to Implementation
·2938 words·14 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
AI Applications
Robotics
🏢 Microsoft
From language models to action models: building AI that does things.