🏢 Institute of Automation, Chinese Academy of Sciences

OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling

26 September 2024·2912 words·14 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Institute of Automation, Chinese Academy of Sciences

OneRef: Unified one-tower model surpasses existing methods in visual grounding and segmentation by leveraging a novel Mask Referring Modeling paradigm.

Neuro-Vision to Language: Enhancing Brain Recording-based Visual Reconstruction and Language Interaction

26 September 2024·2574 words·13 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Institute of Automation, Chinese Academy of Sciences

Researchers enhanced brain recording-based visual reconstruction using a novel Vision Transformer 3D framework integrated with LLMs, achieving superior performance in visual reconstruction, captioning…

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf

26 September 2024·2358 words·12 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Institute of Automation, Chinese Academy of Sciences

RL-instructed language models excel at strategic communication in One Night Ultimate Werewolf, demonstrating the importance of discussion tactics in complex games.

Latent Neural Operator for Solving Forward and Inverse PDE Problems

26 September 2024·2797 words·14 mins· loading · loading

AI Theory Optimization 🏢 Institute of Automation, Chinese Academy of Sciences

Latent Neural Operator (LNO) dramatically improves solving PDEs by using a latent space, boosting accuracy and reducing computation costs.

Happy: A Debiased Learning Framework for Continual Generalized Category Discovery

26 September 2024·2362 words·12 mins· loading · loading

Computer Vision Image Classification 🏢 Institute of Automation, Chinese Academy of Sciences

Happy: a novel debiased learning framework, excels at continually discovering new categories from unlabeled data while retaining knowledge of previously learned ones, overcoming existing bias issues a…

Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization

26 September 2024·3205 words·16 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Institute of Automation, Chinese Academy of Sciences

CP3ER, a novel consistency policy with prioritized proximal experience regularization, significantly boosts sample efficiency and stability in visual reinforcement learning, achieving state-of-the-art…