Shanghai AI Lab Releases Open Source 'Shusheng・Wanjuan' 1.0 Multi-Modal Pre-trained Dataset


The Allen Artificial Intelligence Institute has released the open-source video language model Molmo2 series, including 4B and 8B versions based on Alibaba's Qwen3, and a fully open-source 7B version based on Ai2Olmo, while also making the training data publicly available, demonstrating its commitment to open source.
Meta plans to release an AI model codenamed "Avocado" in the spring of 2026, which may shift to being closed-source and was trained using Alibaba's open-source model Qwen. The news has attracted market attention, causing Alibaba's stock price to rise.
Microsoft open-sources the real-time speech model VibeVoice-Realtime-0.5B, which offers extremely low latency and near-human voice performance. The model takes an average of only 300 milliseconds from text input to voice output, far less than traditional TTS models (1-3 seconds), achieving almost zero latency real-time speech synthesis.
The Weibo AI department has launched the open-source large model VibeThinker-1.5B, which has 1.5 billion parameters. The model is optimized based on Alibaba's Qwen2.5-Math-1.5B and performs well in math and code tasks. It is now freely available on platforms such as Hugging Face, and it follows the MIT license, supporting commercial use.
Ant Group opensources the trillion-parameter inference large model Ring-1T-preview, the world's first open-source trillion-parameter inference model. The preview version shows outstanding performance in natural language reasoning, achieving a score of 92.6 on AIME25, surpassing all known open-source models such as Gemini 2.5 Pro, and approaching GPT-5's score of 94.6; it also performed well on CodeForces tests.