Tsinghua University Develops New Visual Language Model CogAgent to Enhance GUI Understanding and Navigation


Tsinghua University releases the "Guidelines for the Application of Artificial Intelligence in Education", systematically regulating the use of AI on campus, covering core scenarios such as teaching and academic research. The content is divided into three parts: General Provisions, Teaching Section, and Section on Thesis and Practical Achievements. It emphasizes positive guidance and tiered management, aiming to promote the proper application of AI in the field of education.
Tsinghua University published a study in "Nature Machine Intelligence", introducing the new concept of "ability density", challenging traditional AI evaluation standards. The research emphasizes that attention should not only be paid to the number of model parameters, but also to the level of intelligence within each parameter, questioning the scale rule that larger models are necessarily more capable.
Tsinghua University and the Kuaishou Kirin team have collaborated to launch an SVG model that replaces VAE, solving the issue of semantic entanglement, with a 6200% improvement in training efficiency and a 3500% increase in generation speed, marking the gradual phase-out of VAE in the field of image generation.
Recently, Tencent Charity officially launched the "Ask AI" function, which is the first time that large artificial intelligence models have been applied in the charity sector on this platform. This innovative feature allows users to ask questions about various projects and organizations of Tencent Charity, aiming to enhance the interaction and transparency between the public and charitable organizations. The launch of the "Ask AI" function marks another breakthrough for Tencent in the field of charity. Users only need to input their questions simply, and the system can instantly provide relevant information, helping users better understand and participate in various charitable activities. This convenient communication method...
On January 23, 2025, the world's first publicly accessible, ready-to-use computer intelligent agent, GLM-PC, was upgraded again, attracting widespread attention. GLM-PC is based on the multimodal large model CogAgent, capable of 'observing' and 'operating' the computer like a human, assisting users in efficiently completing various computer tasks.