Renowned artificial intelligence company Stability AI has officially released its latest generation audio large model
The newly released model family covers a wide range of specifications, from small to large, meeting diverse needs such as music creation and sound effect production. Notably, the model supports variable-length audio generation and introduces an audio editing feature based on internal image completion technology, offering creators unprecedented flexibility.

Innovative Architecture Breaks Hardware Limitations
Thanks to this efficient compression mechanism, even on ordinary consumer-grade hardware, the model can run long-period, large-scale audio generation tasks smoothly. This not only significantly lowers the technical barriers for high-quality audio creation but also makes professional-level audio and video production at home possible for individual creators.

Ultra Efficiency Achieves Instant Rendering
With the support of variable-length technology, the new model's computational cost can dynamically scale with the user's required audio duration, completely eliminating the computing power waste caused by fixed lengths in the past. In tests on high-performance hardware, the model can render a 20-second audio in about 0.62 seconds, and generate a 380-second music in just 1.31 seconds.
Additionally, through an innovative three-stage training process,
