Runway Launches Gen-2, AI Video Generation Quality Soars


The Zhipu team has open-sourced four core video generation technologies, including GLM-4.6V visual understanding, AutoGLM device control, GLM-ASR speech recognition, and GLM-TTS speech synthesis models, showcasing their latest progress in the multimodal field and laying the foundation for the development of video generation technology.
Tencent Yuanbao launches a new feature, allowing users to generate high-definition videos with just one sentence or one image. Based on the open-source model HunyuanVideo1.5, it uses a DiT architecture, with 830 million parameters, and supports video generation of 5-10 seconds, simplifying the content creation process.
The Wan team under Alibaba officially open-sourced the Wan2.2-Animate-14B (referred to as Wan-Animate) model, which has quickly become a focus in the AI video field. This high-fidelity character animation generation framework addresses two major pain points of 'character animation generation' and 'character replacement' with a single-model architecture. It allows users to upload a single image or video, enabling accurate transfer of expressions and actions, as well as environmental integration, greatly lowering the barrier to video creation. The model weights and inference code have been uploaded to the Hugging Face platform,
Recently, Shengshu Technology, a leading company in the field of multimodal AI, announced the successful completion of an A-round funding round worth several billion yuan. This round was led by Bohua Capital, with existing investors such as Baidu's strategic investment division and the Beijing Artificial Intelligence Industry Investment Fund continuing to participate, demonstrating strong market recognition of Shengshu Technology. The company plans to use the funds to further advance model R&D and technological innovation, explore the potential of multimodal large models, and accelerate product expansion and user services. Multimodal technology, especially in the field of video generation, is currently experiencing rapid development.
Microsoft Bing officially launched a new "Bing Video Creator" feature this Monday. This function is based on the OpenAI Sora model, marking the first free availability of video creation capabilities; ordinary users can also easily create videos through text prompts. The launch of the Bing Video Creator allows users to create their own short videos with simple text descriptions. It is worth noting that this feature currently only supports mobile devices and has not yet been released for desktop use.