🏢 ByteDance

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

26 September 2024·2348 words·12 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 ByteDance

PeRFlow accelerates diffusion models by straightening their sampling trajectories using a piecewise reflow operation, enabling fast and high-quality image generation with minimal computational cost.

PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition

26 September 2024·2356 words·12 mins· loading · loading

Natural Language Processing Named Entity Recognition 🏢 ByteDance

PaDeLLM-NER massively accelerates LLM-based NER inference by up to 10x, enabling near real-time performance without accuracy loss.

Image Understanding Makes for A Good Tokenizer for Image Generation

26 September 2024·2230 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 ByteDance

Leveraging image understanding models for image tokenizer training dramatically boosts image generation quality, achieving state-of-the-art results.

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

26 September 2024·2714 words·13 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 ByteDance

Hyper-SD boosts diffusion model speed by using trajectory segmented consistency distillation and human feedback, achieving state-of-the-art performance.

HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors

26 September 2024·2066 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 ByteDance

HumanSplat: single image-based 3D human reconstruction using Gaussian Splatting with structural priors, achieving state-of-the-art quality and speed.

DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning

26 September 2024·2740 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 ByteDance

DeTeCtive: a novel multi-task contrastive learning framework, achieves state-of-the-art AI-generated text detection by distinguishing diverse writing styles instead of simple binary classification.

An Image is Worth 32 Tokens for Reconstruction and Generation

26 September 2024·2076 words·10 mins· loading · loading

Computer Vision Image Generation 🏢 ByteDance

Image generation gets a speed boost with TiTok, a novel 1D image tokenizer that uses just 32 tokens for high-quality image reconstruction and generation, achieving up to 410x faster processing than st…

Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization

26 September 2024·2692 words·13 mins· loading · loading

Computer Vision Image Generation 🏢 ByteDance

DiMR boosts image generation fidelity by cleverly combining multi-resolution networks with time-dependent layer normalization in diffusion models, achieving state-of-the-art results on ImageNet.