Say Goodbye to Storyboarding! Shengshu Technology's Vidu Q1 Refines Video Production Process

At the WAIC 2025 World Artificial Intelligence Conference, Shengshu Technology officially launched the "Reference Video" feature for Vidu Q1, revolutionizing the traditional video production process through algorithmic innovation and bringing breakthrough progress to the field of video generation.

Say Goodbye to Storyboarding, Create Videos in One Click

The biggest highlight of "Reference Video" is skipping the complex pre-production storyboarding process. Users just need to upload reference images of characters, props, and scenes, along with text prompts, and they can directly generate complete video content. The production process has been simplified from the traditional "storyboarding — video generation — editing — final video" to "reference images — video generation — editing — final video".

For example, entering the prompt "Zhuge Liang discussing with Churchill and Napoleon in a meeting room" and uploading reference images of the three historical figures and a meeting room scene, the system can generate a complete video showing all three of them in a conversation.

Anthropomorphic Rabbit Playing Flute Animated Movie

Cracking the Core Challenges of Commercialization

The core advantage of this feature lies in solving the key bottleneck in video model commercialization — the issue of subject consistency. Vidu Q1's Reference Video currently supports up to seven subjects being input simultaneously while maintaining consistency. According to Shengshu Technology, this is sufficient for most creative scenarios.

Lu Yihang, CEO of Shengshu Technology, stated that this general-purpose creation method will better serve diverse commercial scenarios such as advertising, animation, film and television, cultural tourism, and education, achieving an essential shift from offline shooting to online AI creation.

Technical Path and Industrial Orientation

Shengshu Technology uses the U-ViT architecture, combining diffusion models with Transformer technology, and optimizing algorithm modules based on this. The Vidu model has built-in multimodal understanding capabilities and has been successfully applied to video generation.

Lu Yihang emphasized that the team prioritizes industrial application, and has not yet made the integration of understanding and generation as the top priority, saying, "Industry clients care more about content quality than technical approaches."

Expanding the Field of Embodied Intelligence

On July 25, Tsinghua University and Shengshu Technology jointly released the embodied intelligence model Vidar, achieving low-cost, few-shot generalization through the "video large model + embodied intelligence" approach.

Lu Yihang explained that video models and embodied intelligence fundamentally both handle spatiotemporal information and use the same input decision logic. Based on the Vidu video large model, the team can convert virtual videos into corresponding robotic arm movements by training with a small number of robot operation videos, effectively solving the data scarcity problem in traditional VLA approaches.

Currently, Vidu still prioritizes improving video generation capabilities, while treating embodied intelligence as a continuous exploration direction, opening up potential commercial markets for this field.

Study: Global AI Chipset Market to Exceed $700B with 31.8% CAGR

According to TMR Research, the global artificial intelligence chipset market size is expected to exceed $700 billion, with a compound annual growth rate of 31.8% from 2022 to 2031. The article discusses the development trends, application areas, and key players in the artificial intelligence chipset market, which is highly timely and valuable for readers interested in the artificial intelligence chipset market.

IBM Research: How AI & Automation Protect Businesses from Data Breaches

IBM's report provides sufficient evidence that artificial intelligence, automation, and threat intelligence can address data breaches throughout the lifecycle, reduce costs, and provide stronger evidence. The research found that integrating artificial intelligence and automation into security operations teams can reduce the lifecycle of data breaches by 33% and costs by 33.6%. However, currently, only 28% of enterprises widely apply artificial intelligence and automation. Many enterprises rely on legacy systems, which are easily bypassed by attackers. The significance of this article lies in emphasizing the effectiveness of artificial intelligence and automation in improving cybersecurity and calling on enterprises to widely adopt these technologies to protect data security.

Google's AGI Robot Breakthrough: 54 - Member Team's 7 - Month Work, High Generalization and Reasoning 解释：核心关键词为“谷歌AGI机器人”（Google's AGI Robot）和“新成果”（Breakthrough），标题简洁地概括了主要内容，以动词开头，符合英文习惯，且长度在规定范围内。

The robotics research team at Google DeepMind recently released a robotics project called RT-2. This project took 7 months to develop and uses a large model for training. RT-2 has capabilities such as symbol understanding, reasoning, and human recognition, and can think and complete tasks based on human instructions. By combining the large model with the robot's operational capabilities, RT-2 can accomplish tasks that involve logical leaps, such as from 'extinct animals' to 'plastic dinosaurs'. The results of this project performed well in various sub - category tests, with performance up to three times that of the previous generation of robot models. This research result demonstrates the potential of large models in robotics research and is expected to drive the development of robots in the future.

RWKV: Small Team Aims to Be Android of AI Era with Big Model

Meta Intelligence OS is a startup founded by Bloomberg. It has developed a series of large models based on the open-source model RWKV and aims to become the Android in the era of large models. The RWKV model has superior performance and low cost in inference tasks, thus attracting customers from industries such as finance, law firms, and smart hardware. The business model of Meta Intelligence OS is model customization based on private data and internal AI Agent development. The company hopes to solve the problems of API call latency and data security by deploying large models on terminal devices. Currently, RWKV versions are available on Windows, Mac, and Linux computers, and Android and iOS versions are also in development. Meta Intelligence OS is raising funds and collaborating with chip companies and computing power platforms to create benchmark customers. Luo Xuan said that the decisive battlefield for large models is on hardware, and both terminal devices and the cloud require dedicated chips.

SASAC Releases AI Renewal Community: Aggregates 244 Models and 158 Datasets

During the 2025 World Artificial Intelligence Conference, SASAC launched the AI Renewal Community open-source platform, jointly initiated by 25 central state-owned enterprises, private companies, and universities. The platform focuses on six key functions including computing power, models, and data. It has aggregated 244 industry models and 158 high-quality datasets, and has established special zones for domestic production and embodied intelligence, promoting the development of the domestic AI ecosystem.