🏢 Shanghai AI Laboratory
SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge
·2158 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Shanghai AI Laboratory
SearchLVLMs: A plug-and-play framework efficiently augments large vision-language models with up-to-date internet knowledge via hierarchical filtering, significantly improving accuracy on visual quest…
Goal Conditioned Reinforcement Learning for Photo Finishing Tuning
·3405 words·16 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Shanghai AI Laboratory
This paper introduces a goal-conditioned reinforcement learning approach that efficiently tunes photo finishing pipelines, achieving high-quality results in fewer iterations than optimization-based me…
AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection
·3853 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 Shanghai AI Laboratory
AdaptiveISP uses reinforcement learning to create a scene-adaptive ISP pipeline that dynamically optimizes for object detection, surpassing existing methods in accuracy and efficiency.