TechCrunch — AI · · 3 min read

Stability AI releases a new audio model that can create six-minute songs

Mirrored from TechCrunch — AI for archival readability. Support the source by reading on the original site.

Stability AI, the company behind Stable Diffusion, is releasing a new family of audio models, called Stability Audio 3.0. The top model can generate professional-grade music of more than six minutes long, the company claimed.

The company is releasing four new models under the Stable Audio 3.0 name: small SFX (459M parameters), small (459M parameters), medium (1.4B parameters), and large (2.7B parameters). The duo of small models is suitable for on-device sound and music generation of up to two minutes.

Both medium and large models can create full compositions of 6 minutes 20 seconds long that can maintain musical structure and melodic tone. This is more than double the length of what Stable Audio 2.0, released in 2024, was capable of generating.

Stability AI is making small SFX, small, and medium models available with open weights for anyone to use and modify. In 2024, the company released Stable Audio Open, which allowed for music generation of up to 47 seconds. The new family of models is a big step up from the previous open versions.

Image Credits: Stability AIImage Credits:Stability AI

The large model is available only through the API and self-hosting paid services. Plus, companies with more than $1 million in revenue would need to get an enterprise license.

Many companies, including Google and ElevenLabs, are releasing models and tooling around music generation. However, as Suno and Udio’s ongoing court battles have proved, licensing of data and partnerships with music labels could become a key part of the long-term survival of these services.

Last year, Stability AI inked deals with Warner Music Group and Universal Music Group to develop models and music creation tools. The company said that its latest set of audio models is built on fully licensed data.

The AI startup is developing a new suite of products for professional musicians, but didn’t give more details on its features. Ethan Kaplan, former chief digital officer at Universal Audio and Fender, is joining the company to lead Stability’s professional music offering.

A number of AI companies are trying to bolster their credentials by hiring music execs. Earlier this year, Suno hired former Merlin CEO Jeremy Sirota as chief commercial officer. ElevenLabs has also hired Derek Cournoyer from indie music publisher Kobalt as a strategy lead for its music business.

When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

Ivan Mehta
Ivan Mehta

Ivan covers global consumer tech developments at TechCrunch. He is based out of India and has previously worked at publications including Huffington Post and The Next Web.

You can contact or verify outreach from Ivan by emailing [email protected] or via encrypted message at ivan.42 on Signal.

Event Logo
May 27
Athens, Greece


StrictlyVC Athens is up next. Hear unfiltered insights straight from Europe’s tech leaders and connect with the people shaping what’s ahead. Lock in your spot before it’s gone.

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from TechCrunch — AI