Stability AI, a leading AI technology company, has announced the release of Stable Audio Open 1.0, expanding its generative AI efforts into the realm of audio. Known for its stable diffusion text-to-image generation AI technology, Stability AI has been making waves in the industry with its innovative solutions.
In September 2023, Stability AI introduced Stable Audio, a text-to-audio generative AI tool. The recent release of Stable Audio 2.0 on April 3 brought improvements in clarity and length to the generated audio. Now, with the launch of Stable Audio Open, the company is offering a more limited version of its audio generation tool, focusing on shorter pieces such as sound effects.
Stable Audio Open is designed for audio researchers and producers, providing access to a specialized model optimized for creating drum beats, instrument riffs, ambient sounds, and other audio samples for music production and sound design. Unlike the commercial version of Stable Audio, which can produce longer musical tracks up to three minutes in length, Stable Audio Open generates high-quality audio data up to 47 seconds long using text prompts.
One of the key features of Stable Audio Open is the ability for users to fine-tune the model on their own custom audio data. This allows for unique and personalized creations, such as generating new beats from custom drum recordings. The model weights for Stable Audio Open are now available on Hugging Face, enabling users to further enhance their creative output.
Zach Evans, head of audio research at Stability AI, expressed excitement about the release, stating, “Our goal with Stable Audio Open is to accelerate research, adoption, and practical creative use of these incredible new tools.” With a responsible approach to training the model on non-copyrighted audio data, Stability AI is paving the way for innovative advancements in generative audio technology.