🏢 Shanghai Artificial Intelligence Laboratory
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
·2463 words·12 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Shanghai Artificial Intelligence Laboratory
Align LLMs efficiently via test-time search using smaller models!
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
·2325 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Shanghai Artificial Intelligence Laboratory
AdaptiveDiffusion accelerates diffusion model inference by adaptively skipping noise prediction steps, achieving 2-5x speedup without quality loss.
ProSST: Protein Language Modeling with Quantized Structure and Disentangled Attention
·2439 words·12 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Shanghai Artificial Intelligence Laboratory
ProSST, a novel protein language model, integrates protein sequences and structures using quantized structure representation and disentangled attention, achieving state-of-the-art performance in zero-…
Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
·2786 words·14 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 Shanghai Artificial Intelligence Laboratory
MxDNA: Model learns optimal DNA tokenization via gradient descent, outperforming existing methods.
MindMerger: Efficiently Boosting LLM Reasoning in non-English Languages
·2639 words·13 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Shanghai Artificial Intelligence Laboratory
MindMerger efficiently boosts LLM reasoning in non-English languages by merging LLMs with external multilingual language understanding capabilities, achieving significant accuracy improvements, especi…
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
·2071 words·10 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Shanghai Artificial Intelligence Laboratory
InternLM-XComposer2-4KHD pioneers high-resolution image understanding in LVLMs, scaling processing from 336 pixels to 4K HD and beyond, achieving state-of-the-art results on multiple benchmarks.
GSDF: 3DGS Meets SDF for Improved Neural Rendering and Reconstruction
·2215 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Shanghai Artificial Intelligence Laboratory
GSDF: A novel dual-branch neural scene representation elegantly resolves the rendering-reconstruction trade-off by synergistically combining 3D Gaussian Splatting and Signed Distance Fields via mutual…
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
·2781 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Shanghai Artificial Intelligence Laboratory
Director3D generates realistic 3D scenes and camera trajectories from text descriptions using a three-stage pipeline: Cinematographer, Decorator, and Detailer.