Kling 2.6 Will Be Released: Native Audio + 10-Second 1080P AI Video Enter the Era of Audio

Kling AI, a subsidiary of Kuaishou, launched version 2.6 on the first day of the Omni Ecosystem Week. This version introduces built-in audio generation for the first time, supporting bilingual dialogue, singing, and synchronized sound effects, achieving an "text ⇄ video ⇄ audio" one-click loop. The official slogan "See the Sound, Hear the Visual" highlights its multimodal synchronization capabilities.

Regarding technical specifications, version 2.6 maintains 10 seconds of 1080P high-definition output, requiring only 25 points every 5 seconds (a 30% reduction from the previous version). The diffusion transformer plus 3D spatiotemporal joint attention architecture brings three improvements: compliance with complex instructions increased by 15%, cross-shot character consistency reaches SOTA, and it outperforms Seedance 1.0 by 285% in blind testing.

In terms of the market, Kling 2.6 will be launched first on professional platforms such as Artlist, offering scene expansion and multi-element editing APIs, targeting film, short dramas, advertisements, and MV production. Kuaishou stated that by Q1 2026, it will release a 4K/60fps version and open a custom voice library, continuing to lower the barriers to "AI filmmaking."

Industry observers believe that audio synchronization has filled the last gap in AI video, and post-production editing processes are expected to be shortened by over 50%. With the launch of Kling 2.6, competition in AI creation tools is expanding from "visuals" to "sound," potentially leading to a new wave of supply in audio-based short videos.

New Trend in AI Music Video Creation: Instant MV 1.1 Version Achieves One-Click Production Across the Board

AI music video platform 'Liko MV' releases version 1.1, launching web and iPhone versions. The key upgrade introduces an AI video generation module, producing dynamic videos directly, replacing traditional 'image slideshow' processing. This significantly enhances video expressiveness and flexibility, lowering the barrier to MV creation.....

ByteDance Volcano Engine Seedance 2.0 Officially Opens Application for General API Customers

ByteDance's Volcano Engine opened public API applications for the Seedance2.0 multimodal video generation model on April 2, transitioning from limited testing to broader availability. The model supports text, image, audio, and video inputs, enabling character consistency, director-level shot control, and physical simulation.....

Aishike Technology Completes Series C Funding and Unveils Its First Real-Time World Model

AI video generation advances from content creation to real-time interaction. A leading company secured Series C funding led by CDH Investments, with support from notable institutions. It also launched PixVerse R1, the world's first real-time world model, marking a new phase in AI video technology.....

Kling 2.6 Will Be Released: Native Audio + 10-Second 1080P AI Video Enter the Era of Audio

Related Recommendations

New Trend in AI Music Video Creation: Instant MV 1.1 Version Achieves One-Click Production Across the Board

Volcano Engine Seedance 2.0 Series API Goes Live, Opening Up Global SOTA-Level Video Generation Capabilities

ByteDance Volcano Engine Seedance 2.0 Officially Opens Application for General API Customers

Aishike Technology Completes Series C Funding and Unveils Its First Real-Time World Model

ByteDance Launches Seedance 2.0: AI Video Enters the Era of Personal Production Teams