Shengshu Technology Launches General Multi-Modal Large Model, Introducing PixWeaver and VoxCraft Tools


The Tsinghua University TSAIL Lab and Shengshu Technology jointly open-sourced the video generation acceleration framework TurboDiffusion, which improves the inference speed of AI video diffusion models by 100 to 200 times with almost no loss in visual quality. This technology performs deep optimization on existing open-source models, achieving real-time generation from minutes to seconds on a single RTX 5090 graphics card, marking a new era in AI video creation.
Shengshu Technology launches the 'Vidu Q2 Image Creation Suite', integrating three main functions: reference-based image generation, text-to-image generation, and image editing. The new version exceeded 500,000 uses on its first day of release, indicating strong user demand. Vidu Q2 enhances image generation control, allowing precise specification of position, action, and composition in the image, and outputs 4K quality. The new image editing features include local retouching and material replacement, performing excellently in international evaluations.
Recently, Shengshu Technology, a leading company in the field of multimodal AI, announced the successful completion of an A-round funding round worth several billion yuan. This round was led by Bohua Capital, with existing investors such as Baidu's strategic investment division and the Beijing Artificial Intelligence Industry Investment Fund continuing to participate, demonstrating strong market recognition of Shengshu Technology. The company plans to use the funds to further advance model R&D and technological innovation, explore the potential of multimodal large models, and accelerate product expansion and user services. Multimodal technology, especially in the field of video generation, is currently experiencing rapid development.
On September 19, 2025, Shengshu Technology announced that it has completed a new round of A-round funding worth billions of yuan. This round of financing was led by Bohua Capital, with continued participation from existing shareholders such as Baidu Strategic Investment, Beijing Artificial Intelligence Industry Investment Fund, Qiming Venture Partners, Datayi Capital, and BV Baidu Ventures. Additionally, industrial partners such as Jianfa Emerging Investment also increased their investment. Since its establishment in 2023, Shengshu Technology has been driven by a strong core team composed of technical talents from world-renowned universities such as Tsinghua University, Peking University, Imperial College London, and Carnegie Mellon University.
At the Baidu Cloud Intelligence Conference held today, Tang Jiayu, co-founder and CEO of Shengshu Technology, announced that Vidu, the first domestic video large model, officially opens its API and integrates with Baidu Intelligent Cloud's Qianfan Large Model Platform, becoming the first video large model on this platform. Vidu's self-developed video generation technology boasts advantages such as high dynamism, diverse styles, and exceptional reasoning. It has also launched the 'Subject Reference' feature globally, effectively solving the problem of consistent video model generation. Since its launch at the end of July, Vidu has received thousands of enterprise user integrations.