Skip to main content

🏢 Show Lab, National University of Singapore

ROICtrl: Boosting Instance Control for Visual Generation
·3855 words·19 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Show Lab, National University of Singapore
ROICtrl boosts visual generation’s instance control by using regional instance control via ROI-Align and a new ROI-Unpool operation, resulting in precise regional control and high efficiency.
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
·5469 words·26 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Show Lab, National University of Singapore
ShowUI, a novel vision-language-action model, efficiently manages high-resolution GUI screenshots and diverse task needs via UI-guided token selection and interleaved streaming, achieving state-of-the…
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use
·614 words·3 mins· loading · loading
AI Generated 🤗 Daily Papers AI Applications Human-AI Interaction 🏢 Show Lab, National University of Singapore
Claude 3.5 Computer Use: A groundbreaking AI model offering public beta graphical user interface (GUI) agent for computer use is comprehensively analyzed in this research. This study provides an out-o…