Skip to main content

🏢 Shanghai AI Laboratory

SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge
·2158 words·11 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Shanghai AI Laboratory
SearchLVLMs: A plug-and-play framework efficiently augments large vision-language models with up-to-date internet knowledge via hierarchical filtering, significantly improving accuracy on visual quest…
Goal Conditioned Reinforcement Learning for Photo Finishing Tuning
·3405 words·16 mins· loading · loading
Computer Vision Image Generation 🏢 Shanghai AI Laboratory
This paper introduces a goal-conditioned reinforcement learning approach that efficiently tunes photo finishing pipelines, achieving high-quality results in fewer iterations than optimization-based me…
AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection
·3853 words·19 mins· loading · loading
AI Generated Computer Vision Object Detection 🏢 Shanghai AI Laboratory
AdaptiveISP uses reinforcement learning to create a scene-adaptive ISP pipeline that dynamically optimizes for object detection, surpassing existing methods in accuracy and efficiency.