Zhipu AI Open-Source Visual Language Model CogAgent Supports GUI Graphic Interface Q&A


Tencent releases open-source Hunyuan Translator 1.5, supporting 33 languages and optimized for mobile. Available in 1.8B and 7B versions, the 1.8B model uses only 1GB memory after quantization, enabling offline real-time translation on devices like smartphones with excellent inference speed.....
The Allen Artificial Intelligence Institute has released the open-source video language model Molmo2 series, including 4B and 8B versions based on Alibaba's Qwen3, and a fully open-source 7B version based on Ai2Olmo, while also making the training data publicly available, demonstrating its commitment to open source.
Meta plans to release an AI model codenamed "Avocado" in the spring of 2026, which may shift to being closed-source and was trained using Alibaba's open-source model Qwen. The news has attracted market attention, causing Alibaba's stock price to rise.
Microsoft open-sources the real-time speech model VibeVoice-Realtime-0.5B, which offers extremely low latency and near-human voice performance. The model takes an average of only 300 milliseconds from text input to voice output, far less than traditional TTS models (1-3 seconds), achieving almost zero latency real-time speech synthesis.
The Weibo AI department has launched the open-source large model VibeThinker-1.5B, which has 1.5 billion parameters. The model is optimized based on Alibaba's Qwen2.5-Math-1.5B and performs well in math and code tasks. It is now freely available on platforms such as Hugging Face, and it follows the MIT license, supporting commercial use.