↓Skip to main content

Music Generation

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

3 March 2025·1645 words·8 mins· loading · loading

AI Generated 🤗 Daily Papers Speech and Audio Music Generation 🏢 Northwestern Polytechnical University

DiffRhythm: Fast & Simple End-to-End Song Generation via Latent Diffusion, creating full songs (4+ mins) with vocal & accompaniment in seconds!

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

18 February 2025·2399 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers Speech and Audio Music Generation 🏢 Beihang University

SongGen: Single-stage autoregressive transformer for controllable text-to-song generation, simplifying the process and improving control.

XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework

15 January 2025·3087 words·15 mins· loading · loading

AI Generated 🤗 Daily Papers Speech and Audio Music Generation 🏢 Tencent AI Lab

XMusic: A new framework generates high-quality, emotionally controllable symbolic music from various prompts (images, videos, text, tags, humming).