SenseTime Releases Upgraded Medical Health Large Model 'Da Yi' Supporting High-Quality Training and Low-Threshold Deployment


Baidu released its new language model Ernie5.1 on May 11, 2026, based on the pre-trained foundation of Ernie5.0 with 2.4 trillion parameters. Through a 'one-time elastic training framework', it achieves single training optimization for multiple model sizes, with pre-training cost only 6% of similar models. As of May 9, the model ranked fourth globally and first in China on the Arena Search ranking with 1223 points, demonstrating high resource utilization and performance balance.
Apple and The Ohio State University jointly launched the FS-DFM model, which can generate long text comparable to traditional models after only 8 iterations, achieving a writing speed improvement of up to 128 times, breaking through the efficiency bottleneck of long text generation. The model uses discrete flow matching technology, different from self-regressive models like ChatGPT that generate text character by character.
Alibaba launched Qwen3-Max-Preview, a trillion-parameter language model, setting a new AI benchmark. Available via Qwen Chat and Alibaba Cloud API, it outperforms predecessors in knowledge, dialogue, tasks, and execution.....
NVIDIA launches Jet-Nemotron language models (200M & 400M params), achieving 53.6x faster generation than SOTA with equal/higher accuracy via 'post-neural architecture search' that modifies pre-trained models.....
Factorio, a complex video game centered around building and resource management, has emerged as a novel tool for researchers to evaluate artificial intelligence capabilities. The game allows for testing the abilities of language models in planning and constructing complex systems while managing multiple resources and production chains. To this end, a research team developed a system called the "Factorio Learning Environment" (FLE), offering two distinct testing modes. The "Experiment Mode" contains 24 structured challenges with specific goals and limited resources, with tasks ranging from simple two-machine setups...