SenseTime Releases Upgraded Medical Health Large Model 'Da Yi' Supporting High-Quality Training and Low-Threshold Deployment


Apple and The Ohio State University jointly launched the FS-DFM model, which can generate long text comparable to traditional models after only 8 iterations, achieving a writing speed improvement of up to 128 times, breaking through the efficiency bottleneck of long text generation. The model uses discrete flow matching technology, different from self-regressive models like ChatGPT that generate text character by character.
Alibaba launched Qwen3-Max-Preview, a trillion-parameter language model, setting a new AI benchmark. Available via Qwen Chat and Alibaba Cloud API, it outperforms predecessors in knowledge, dialogue, tasks, and execution.....
NVIDIA launches Jet-Nemotron language models (200M & 400M params), achieving 53.6x faster generation than SOTA with equal/higher accuracy via 'post-neural architecture search' that modifies pre-trained models.....
Factorio, a complex video game centered around building and resource management, has emerged as a novel tool for researchers to evaluate artificial intelligence capabilities. The game allows for testing the abilities of language models in planning and constructing complex systems while managing multiple resources and production chains. To this end, a research team developed a system called the "Factorio Learning Environment" (FLE), offering two distinct testing modes. The "Experiment Mode" contains 24 structured challenges with specific goals and limited resources, with tasks ranging from simple two-machine setups...
Researchers from the Tübingen University's ELLIS Unit, the University of Maryland, and Lawrence Livermore National Laboratory have developed Huginn, a novel language model with a recursive architecture that significantly enhances reasoning capabilities. Unlike traditional models, Huginn doesn't require specialized 'reasoning chain' training; it can autonomously reason within the neural network's 'latent space' before outputting results. The research team developed Huginn, a novel language model with a recursive architecture that significantly enhances reasoning abilities. Unlike...