Skip to main content

Music Generation

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
·1645 words·8 mins· loading · loading
AI Generated 🤗 Daily Papers Speech and Audio Music Generation 🏢 Northwestern Polytechnical University
DiffRhythm: Fast & Simple End-to-End Song Generation via Latent Diffusion, creating full songs (4+ mins) with vocal & accompaniment in seconds!
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
·2399 words·12 mins· loading · loading
AI Generated 🤗 Daily Papers Speech and Audio Music Generation 🏢 Beihang University
SongGen: Single-stage autoregressive transformer for controllable text-to-song generation, simplifying the process and improving control.
XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework
·3087 words·15 mins· loading · loading
AI Generated 🤗 Daily Papers Speech and Audio Music Generation 🏢 Tencent AI Lab
XMusic: A new framework generates high-quality, emotionally controllable symbolic music from various prompts (images, videos, text, tags, humming).