Tsinghua Team Leads the Development of the First Systematic Benchmark Test for AI Agents


OpenAI launches the enterprise-level platform Frontier, aiming to help businesses build, deploy, and manage AI agents that can perform real-world tasks, promoting the evolution of AI from conversation assistants to digital colleagues. The platform is dedicated to addressing common challenges enterprises face when deploying AI, such as data silos, complex permissions, and lack of business context, bridging the gap between large models and business applications.
In IDC's latest report, "Vendor Evaluation of Chinese AI Agent Development Platforms 2025," Ant Group successfully entered the 'Leaders' quadrant by leveraging the architectural completeness, product maturity, and industry implementation effectiveness of its Agentar platform, demonstrating its leading position in the field of AI agent development in China.
Tsinghua University releases the "Guidelines for the Application of Artificial Intelligence in Education", systematically regulating the use of AI on campus, covering core scenarios such as teaching and academic research. The content is divided into three parts: General Provisions, Teaching Section, and Section on Thesis and Practical Achievements. It emphasizes positive guidance and tiered management, aiming to promote the proper application of AI in the field of education.
Tsinghua University published a study in "Nature Machine Intelligence", introducing the new concept of "ability density", challenging traditional AI evaluation standards. The research emphasizes that attention should not only be paid to the number of model parameters, but also to the level of intelligence within each parameter, questioning the scale rule that larger models are necessarily more capable.
Tsinghua University and the Kuaishou Kirin team have collaborated to launch an SVG model that replaces VAE, solving the issue of semantic entanglement, with a 6200% improvement in training efficiency and a 3500% increase in generation speed, marking the gradual phase-out of VAE in the field of image generation.