Skip to main content

🏢 Shanghai Artificial Intelligence Laboratory

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
·2463 words·12 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Shanghai Artificial Intelligence Laboratory
Align LLMs efficiently via test-time search using smaller models!
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
·2325 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 Shanghai Artificial Intelligence Laboratory
AdaptiveDiffusion accelerates diffusion model inference by adaptively skipping noise prediction steps, achieving 2-5x speedup without quality loss.
ProSST: Protein Language Modeling with Quantized Structure and Disentangled Attention
·2439 words·12 mins· loading · loading
Machine Learning Deep Learning 🏢 Shanghai Artificial Intelligence Laboratory
ProSST, a novel protein language model, integrates protein sequences and structures using quantized structure representation and disentangled attention, achieving state-of-the-art performance in zero-…
Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
·2786 words·14 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Shanghai Artificial Intelligence Laboratory
MxDNA: Model learns optimal DNA tokenization via gradient descent, outperforming existing methods.
MindMerger: Efficiently Boosting LLM Reasoning in non-English Languages
·2639 words·13 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Shanghai Artificial Intelligence Laboratory
MindMerger efficiently boosts LLM reasoning in non-English languages by merging LLMs with external multilingual language understanding capabilities, achieving significant accuracy improvements, especi…
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
·2071 words·10 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Shanghai Artificial Intelligence Laboratory
InternLM-XComposer2-4KHD pioneers high-resolution image understanding in LVLMs, scaling processing from 336 pixels to 4K HD and beyond, achieving state-of-the-art results on multiple benchmarks.
GSDF: 3DGS Meets SDF for Improved Neural Rendering and Reconstruction
·2215 words·11 mins· loading · loading
Computer Vision 3D Vision 🏢 Shanghai Artificial Intelligence Laboratory
GSDF: A novel dual-branch neural scene representation elegantly resolves the rendering-reconstruction trade-off by synergistically combining 3D Gaussian Splatting and Signed Distance Fields via mutual…
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
·2781 words·14 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Shanghai Artificial Intelligence Laboratory
Director3D generates realistic 3D scenes and camera trajectories from text descriptions using a three-stage pipeline: Cinematographer, Decorator, and Detailer.