CAS Institute of Physics Releases Material Synthesis AI Large Language Model MatChat


On April 8th, 2025, NVIDIA launched Llama 3.1 Nemotron Ultra 253B, an open-source model optimized from Llama-3.1-405B. With 25.3 billion parameters, it surpasses Meta's Llama 4 Behemoth and Maverick, becoming a focal point in the AI field. This model demonstrates superior performance in benchmarks such as GPQA-Diamond, AIME 2024/25, and LiveCodeBench, achieving inference throughput comparable to DeepSeek.
SenseTime Technology, in collaboration with Thailand's DTGO Group and Quinnnova, jointly launched the Thai Language Large Model (DTLM) named "Dongfeng". This is the world's first AI large language model that can operate efficiently in Thai, Chinese, and English. The model combines SenseTime's base model and computational advantages with DTGO's deep understanding of Thai language and culture, aiming to provide localized generative AI experiences, including text comprehension and natural, fluent r
Recently at the Google I/O Berlin conference, Google announced the open-sourcing of its latest language model Gemma2, which has made significant breakthroughs in performance and efficiency. Gemma2 offers two parameter sizes: 9B and 27B, with the 27B version's performance approaching that of the 70B Llama3 model, but with a model size that is only about 40% of the latter.
On April 3, Qwen APP launched the Wanxiang 2.7 video generation model, adding video editing, continuation, and action imitation features. Users can easily replace objects, modify scenes, switch styles, and apply creative styles like animation, 3D, and clay with natural lighting details.....
Meituan launches LongCat-Next, a native multimodal AI model that uses DiNA technology to unify images, audio, and text into discrete tokens, enabling deep integration of multimodal modeling for enhanced perception of the physical world.....