


We believe the future of generative AI is multimodality.”
#Open source uuid generator code
“We started with Stable Diffusion and have grown to include languages, code and now music. “ Stability AI is on a mission to unlock humanity’s potential by building foundational AI models across a number of content types or ‘modalities,'” Ed Newton-Rex, VP of audio for Stability AI, told TechCrunch in an email interview. Trained on audio metadata as well as audio files’ durations - and start times - Stability says that Audio Diffusion’s underlying, roughly 1.2-billion-parameter model affords greater control over the content and length of synthesized audio than the generative music tools released before it. Today marks the release of Stable Audio, a tool that Stability claims is the first capable of creating “high-quality,” 44.1 kHz music for commercial use via a technique called latent diffusion. Now, under pressure from investors to translate over $100 million in capital into revenue-generated products, Stability is recommitting to audio in a big way. (Historically, Stability has provided resources and compute to outside groups rather than build models entirely in-house.) And Dance Diffusion never gained a more polished release even today, installing it requires working directly with the source code, as there’s no user interface to speak of. The research organization Stability funded to create the model, Harmonai, stopped updating Dance Diffusion sometime last year.

But for nearly a year after Dance Diffusion was announced, all seemed quiet on the generative audio front - at least as far as it concerned Stability’s efforts. A year ago, Stability AI, the London-based startup behind the open source image-generating AI model Stable Diffusion, quietly released Dance Diffusion, a model that can generate songs and sound effects given a text description of the songs and sound effects in question.ĭance Diffusion was Stability AI’s first foray into generative audio, and it signaled a meaningful investment - and acute interest, seemingly - from the company in the nascent field of AI music creation tools.
