Skip to main content

🏢 OPPO Research Institute

OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning
·2043 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Multimodal Reasoning 🏢 OPPO Research Institute
OThink-MR1 enhances MLLM reasoning via dynamic reinforcement learning, achieving remarkable cross-task generalization!