🏢 Shanghai AI Laboratory
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
·3628 words·18 mins
AI Generated
🤗 Daily Papers
Multimodal Learning
Vision-Language Models
🏢 Shanghai AI Laboratory
OS-Atlas: A new open-source toolkit and model dramatically improves GUI agent performance by providing a massive dataset and innovative training methods, enabling superior generalization to unseen int…