Skip to main content

🏢 Shanghai AI Laboratory

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
·3628 words·18 mins
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Shanghai AI Laboratory
OS-Atlas: A new open-source toolkit and model dramatically improves GUI agent performance by providing a massive dataset and innovative training methods, enabling superior generalization to unseen int…