Article

Multimodal Agent Receives Major Upgrade! Alibaba Officially Releases Qwen3.7-Plus with Comprehensive Enhancements in Vision and Workflow

Published in Latest AI News

Time :Jun 2, 2026

Read :3minute

The global advancement of large model technology towards "embodied intelligence" and advanced agents is accelerating rapidly. On June 2, Alibaba announced through the official channel of Qwen Large Model that it has officially launched a new multimodal agent model - Qwen3.7-Plus. This not only represents another technological breakthrough for the Tongyi Qianwen series in the multimodal field, but also marks the iteration of the core foundation for domestic large models in edge-side and complex workflow applications.

As the core highlight of this upgrade, Qwen3.7-Plus builds on the powerful native text processing capabilities of Qwen3.7 and undergoes a comprehensive and advanced evolution in vision-language (Vision-Language) capabilities. This means the model can not only better "understand" complex image and video content, but also transform this refined visual perception into deep logical reasoning, greatly expanding the practical application boundaries of multimodal interaction.

In addition to the transformation of visual capabilities, the model still maintains its top-tier hard-core strengths in the core chain of agents (Agent). In areas such as programming code generation, complex tool use (Tool-use), and high-level productivity workflows (Productivity Workflows), Qwen3.7-Plus demonstrates high task continuity and decision robustness, allowing it to adapt more smoothly to enterprise-level automation tasks and long-term intelligent scheduling scenarios.

Industry analysts point out that the competition in the second half of the large model era has clearly shifted toward multimodal and agent-based solutions. By deeply integrating visual understanding with agent action planning, Alibaba's Qwen3.7-Plus not only further raises the performance ceiling of open-source and commercial models, but also provides a more imaginative computing foundation for subsequent broader industrial intelligence and embodied robot applications.

Related Recommendations

Wang He, Founder of Galaxy General-Purpose Robot: The ChatGPT Moment of Embodied Intelligence Will Arrive by 2028!

Galaxy General Robot CTO Wang He predicted at the 2026 World AI Conference that embodied intelligence will achieve a major breakthrough before 2028, with performance comparable to ChatGPT. The foundational model, trained on massive data, can reach a 70%-80% success rate on tasks not specifically trained for, similar to early digital models.....

Jul 17, 2026

224.9k

Others Ring the Bell, We Reset: Zhipu Reveals Its Stretching Plan, Betting on a Fully Automated Intelligent Ecosystem

Zhipu founder Tang Jie announced the "Gaokao" plan, a two-year strategic investment in four core engines: long-context tasks, autonomous agents, fully self-training, and ultimate safety governance, aiming for next-gen AGI. Meanwhile, GLM-5.2 was released as open source under MIT license, supporting million-token context and leading in long-range tasks.....

Jul 13, 2026

216.4k

Pre-train from Scratch: Ant Lingbo Releases the Embodied Native World Action Model LingBot-VA 2.0

On July 10, Ant Lingbo unveiled LingBot-VA2.0, the first embodied native world action model. It shifts robot foundation models from digital grafting to native physical-world design, creating a 'brain' from primal interaction needs like dynamic modeling and causal prediction.....

Jul 10, 2026

279.1k

Step Star's First AI Intelligent Phone Will Be Released, Leading Ahead of OpenAI

Step Star announced a new AI agent device, launching its AI terminal brand, agent system, and first AI agent phone. It becomes the first native agent phone from a major model company, with market release ahead of OpenAI's plan. Manufactured by Huaqin Technology, the two have formed a deep partnership.....

Jul 9, 2026

289.5k

MiniMax to Launch New Generation Large Model with 2.7 Trillion Parameters

MiniMax reportedly prepares to launch a next-gen large model with 2.7 trillion parameters, targeting complex task handling and logical reasoning. This underscores its deep investment in core technology and reflects the industry's ambition to pursue higher intelligence through parameter scaling.....

Jul 9, 2026

283.8k

Intelligent Future, Your Artificial Intelligence Solution Think Tank

English 简体中文繁體中文にほんご