🏢 ByteDance
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
·2348 words·12 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 ByteDance
PeRFlow accelerates diffusion models by straightening their sampling trajectories using a piecewise reflow operation, enabling fast and high-quality image generation with minimal computational cost.
PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition
·2356 words·12 mins·
loading
·
loading
Natural Language Processing
Named Entity Recognition
🏢 ByteDance
PaDeLLM-NER massively accelerates LLM-based NER inference by up to 10x, enabling near real-time performance without accuracy loss.
Image Understanding Makes for A Good Tokenizer for Image Generation
·2230 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 ByteDance
Leveraging image understanding models for image tokenizer training dramatically boosts image generation quality, achieving state-of-the-art results.
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
·2714 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 ByteDance
Hyper-SD boosts diffusion model speed by using trajectory segmented consistency distillation and human feedback, achieving state-of-the-art performance.
HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors
·2066 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 ByteDance
HumanSplat: single image-based 3D human reconstruction using Gaussian Splatting with structural priors, achieving state-of-the-art quality and speed.
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
·2740 words·13 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 ByteDance
DeTeCtive: a novel multi-task contrastive learning framework, achieves state-of-the-art AI-generated text detection by distinguishing diverse writing styles instead of simple binary classification.
An Image is Worth 32 Tokens for Reconstruction and Generation
·2076 words·10 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 ByteDance
Image generation gets a speed boost with TiTok, a novel 1D image tokenizer that uses just 32 tokens for high-quality image reconstruction and generation, achieving up to 410x faster processing than st…
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization
·2692 words·13 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 ByteDance
DiMR boosts image generation fidelity by cleverly combining multi-resolution networks with time-dependent layer normalization in diffusion models, achieving state-of-the-art results on ImageNet.