Skip to main content

🏢 Institute of Automation, Chinese Academy of Sciences

OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling
·2912 words·14 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Institute of Automation, Chinese Academy of Sciences
OneRef: Unified one-tower model surpasses existing methods in visual grounding and segmentation by leveraging a novel Mask Referring Modeling paradigm.
Neuro-Vision to Language: Enhancing Brain Recording-based Visual Reconstruction and Language Interaction
·2574 words·13 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Institute of Automation, Chinese Academy of Sciences
Researchers enhanced brain recording-based visual reconstruction using a novel Vision Transformer 3D framework integrated with LLMs, achieving superior performance in visual reconstruction, captioning…
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf
·2358 words·12 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Institute of Automation, Chinese Academy of Sciences
RL-instructed language models excel at strategic communication in One Night Ultimate Werewolf, demonstrating the importance of discussion tactics in complex games.
Latent Neural Operator for Solving Forward and Inverse PDE Problems
·2797 words·14 mins· loading · loading
AI Theory Optimization 🏢 Institute of Automation, Chinese Academy of Sciences
Latent Neural Operator (LNO) dramatically improves solving PDEs by using a latent space, boosting accuracy and reducing computation costs.
Happy: A Debiased Learning Framework for Continual Generalized Category Discovery
·2362 words·12 mins· loading · loading
Computer Vision Image Classification 🏢 Institute of Automation, Chinese Academy of Sciences
Happy: a novel debiased learning framework, excels at continually discovering new categories from unlabeled data while retaining knowledge of previously learned ones, overcoming existing bias issues a…
Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization
·3205 words·16 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Institute of Automation, Chinese Academy of Sciences
CP3ER, a novel consistency policy with prioritized proximal experience regularization, significantly boosts sample efficiency and stability in visual reinforcement learning, achieving state-of-the-art…