On September 12, MiniMax released its new music generation model, Music1.5, which marks a new breakthrough in AI music creation and opens up a new era of "one person is a band." Music1.5 not only extends the duration of generated music to 4 minutes but also features four major breakthroughs: strong controllability, natural and full vocals, rich orchestration layers, and clear song structures.
Music1.5 supports music creation up to 4 minutes, producing finished works instead of just demo samples. It allows users to customize and have strong control over the style, mood, and scene of a song. Users need only provide a simple natural language description, and Music1.5 will deliver a highly completed work. In advanced mode, users can also arrange specific lyrics for different parts of the song, such as the intro, verse, and chorus. Music1.5 supports customizing music features with 16 styles × 11 moods × 10 scenes, generating different vocal tones and singing styles. The sound is more transparent and realistic, with natural and full tone, smooth transitions without breaks, greatly enriching the emotional expression of songs.

In terms of orchestration, Music1.5 models instruments with fine-grained modeling, making the orchestration rich, the instrument layers clear, and the playing techniques varied. It also supports the generation of less common and ethnic Chinese instruments, such as multiple traditional Chinese folk instruments in the song "Jiangnan Rain and Poetry." In addition, Music1.5 achieves clear distinctions between the intro, verse, and chorus sections, with a clear chorus climax and a natural ending, providing a true "narrative-level" auditory experience.
These breakthroughs are based on MiniMax's self-developed capabilities in multi-modal areas such as text, speech, and vision. Music1.5 uses the power of text models to have a stronger understanding and control over text descriptions. It not only allows overall control over the style, emotional color, and applicable scenarios of a song, but also enables fine-grained control over vocal characteristics, generating vocal tones with different qualities.
Music1.5 is suitable for various scenarios, including using AI to inspire professional music creation, quickly customizing background music for films, games, and short videos, creating songs and MVs for virtual idols, and generating exclusive audio content for corporate brands. Additionally, Music1.5 also provides APIs for global developers, continuing to offer the highest cost-performance globally. Whether it's an application, tool, or creative workflow, it can be easily integrated, making AI become a "24-hour creative partner."
The release of Music1.5 not only lowers the barriers to music creation but also returns to the essence of hearing, allowing "good-sounding" music to naturally happen. The model is now available worldwide. Users can log in to minimaxi.com/audio/music immediately to experience it and create their next hit single.
