🏢 Shanghai Artificial Intelligence Laboratory
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
·3977 words·19 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Video Understanding
🏢 Shanghai Artificial Intelligence Laboratory
VBench 2.0: A new benchmark suite advancing video generation evaluation with intrinsic faithfulness metrics.
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition
·3118 words·15 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 Shanghai Artificial Intelligence Laboratory
ResearchBench: Benchmarking LLMs for Scientific Discovery via Inspiration-Based Task Decomposition.
Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation
·2233 words·11 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 Shanghai Artificial Intelligence Laboratory
FakeVLM: A multimodal model & artifact-annotated dataset for detecting synthetic images with interpretable explanations, setting a new benchmark.
Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation
·2576 words·13 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
3D Vision
🏢 Shanghai Artificial Intelligence Laboratory
Infinite Mobility: Procedural generation of high-fidelity articulated objects for scalable embodied AI training.
SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing
·2729 words·13 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Multimodal Generation
🏢 Shanghai Artificial Intelligence Laboratory
SURVEYFORGE automates survey generation, improving quality and evaluation.
Iterative Value Function Optimization for Guided Decoding
·2523 words·12 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Natural Language Processing
Text Generation
🏢 Shanghai Artificial Intelligence Laboratory
IVO: Iterative Value Function Optimization for Guided Decoding
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
·2690 words·13 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Vision-Language Models
🏢 Shanghai Artificial Intelligence Laboratory
InternLM-XComposer2.5-Reward: A novel multi-modal reward model boosting Large Vision Language Model performance.
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
·4676 words·22 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 Shanghai Artificial Intelligence Laboratory
Introducing Evaluation Agent, a faster, more flexible human-like framework for evaluating visual generative AI.
Chimera: Improving Generalist Model with Domain-Specific Experts
·4776 words·23 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Vision-Language Models
🏢 Shanghai Artificial Intelligence Laboratory
Chimera boosts large multimodal models’ performance on specialized tasks by cleverly integrating domain-specific expert models, achieving state-of-the-art results on multiple benchmarks.
VLSBench: Unveiling Visual Leakage in Multimodal Safety
·5131 words·25 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Vision-Language Models
🏢 Shanghai Artificial Intelligence Laboratory
VLSBench exposes visual leakage in MLLM safety benchmarks, creating a new, leak-free benchmark to evaluate true multimodal safety.