🏢 ByteDance Seed
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
·3814 words·18 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Machine Learning
Reinforcement Learning
🏢 ByteDance Seed
This paper enhances Reinforcement Learning from Human Feedback (RLHF) by tackling reward hacking and response diversity issues through improved data construction methods.
Synthetic Video Enhances Physical Fidelity in Video Synthesis
·4236 words·20 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Video Understanding
🏢 ByteDance Seed
Synthetic data can enhance the physical realism of video synthesis, paving the way for more believable generated content.
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts
·4277 words·21 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 ByteDance Seed
Expert Race: A flexible routing strategy for scaling diffusion transformer with mixture of experts.
Frac-Connections: Fractional Extension of Hyper-Connections
·1945 words·10 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Machine Learning
Deep Learning
🏢 ByteDance Seed
Frac-Connections: An efficient alternative to Hyper-Connections that divides hidden states into fractions.
FlowTok: Flowing Seamlessly Across Text and Image Tokens
·2984 words·15 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Multimodal Generation
🏢 ByteDance Seed
FlowTok: Seamlessly flows across text and image tokens!
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
·3696 words·18 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Video Understanding
🏢 ByteDance Seed
VideoWorld shows AI can learn complex reasoning and planning skills from unlabeled videos alone, achieving professional-level performance in Go and robotics.