Zhipu AI Open-Source Visual Language Model CogAgent Supports GUI Graphic Interface Q&A


Alibaba Cloud BaiLian announced on May 29, 2026, that it is fully CLI-enabled and open-sources its CLI project. This move drives a full-stack integration transformation for AI Agent access and development. The CLI encapsulates core capabilities such as mainstream models, workflows, knowledge bases, memory management, web search, and multimodal file processing into a lightweight command-line interface, allowing developers to efficiently use them after installation and authentication.
Xiaomi released the MiMo-V2.5 series of large models on April 23 and initiated public testing. The series includes four models, with the core models MiMo-V2.5-Pro and MiMo-V2.5 being open-sourced globally, demonstrating its commitment to promoting an open AI ecosystem. This update is not only a product iteration but also a comprehensive upgrade of the technology foundation, featuring flagship performance that supports a context length of up to one million and complex task processing.
Ant Forest LingBot Technology opens a large-scale RGB-D depth dataset called LingBot-Depth-Dataset, containing 3 million high-quality samples, of which 2 million are collected from real scenes and 1 million are rendered. The total size reaches 2.71 TB, covering 6 mainstream depth cameras. It is currently the largest real-scene RGB-D dataset in the open-source community, providing richer data support for embodied intelligence, spatial perception, and 3D vision fields.
Microsoft has recruited top research teams from the Allen Institute for Artificial Intelligence and the University of Washington, led by Ali Farhadi, former CEO of Ai2, who joined Microsoft's newly established 'Super Intelligence' department, aiming to strengthen its general artificial intelligence strategy.
Tokyo startup InfiniMind secures $5.8 million in seed funding, founded by a former Google employee, dedicated to developing AI infrastructure that transforms massive unused video and audio dark data into searchable structured business intelligence to address enterprise data processing challenges.