Research Report: More Than One-Third of New Websites in 2025 Will Be AI-Generated, Semantic Similarity Increased by 33%

On April 28, 2026, researchers from Imperial College London, the Internet Archive, and Stanford University released a joint research report indicating that AI-generated text has deeply penetrated the global online ecosystem. Data shows that by mid-2025, approximately 35% of newly published website content worldwide was entirely or partially generated by AI, compared to nearly zero before the release of ChatGPT at the end of 2022. The research team conducted a large-scale analysis of web samples over 33 months from 2022 to 2025, confirming substantial changes AI has brought to the discourse system of the internet.

Technical analysis shows that AI-generated text is causing significant "semantic contraction" and "positive shift." Due to language models' tendency to converge toward the average of their training data, the semantic similarity of AI-generated content is 33% higher than that of human-created content. Over time, this could lead to a narrowing of the range of ideas in cyberspace. At the same time, the positive sentiment score of AI-generated text is 107% higher than that of human content, showing an artificialized optimistic tendency. This tone shift, caused by the model's "excessive compliance," is believed to potentially marginalize human dissent and distinctive viewpoints without people realizing it. Although the public generally worries that AI may exacerbate factual errors or lead to the disappearance of writing styles, no significant negative correlation has been found at the data level.

Researchers warn that the homogenization and optimism of online content are inducing the public's "apathy toward reality," where users may doubt the overall credibility of online information due to the inability to distinguish between real and fake. Additionally, the high proportion of AI content significantly increases the risk of "model collapse," where subsequent models may experience performance degradation due to training on their own outputs. This trend is prompting the industry to rethink the logic of search and recommendation algorithms, and in the future, there may be a greater focus on identifying semantic diversity and establishing encryption traceability standards.

Speak and Act! Tencent's Intelligent Agent Ecosystem Makes Its Debut in Fuzhou, Making Useful AI the New Quality Productivity

At the Digital China Construction Summit, Tencent booth technicians introduced visitors to the Tencent Cloud Intelligent Agent Development Platform ADP, helping them establish a "Solo Company" and accurately manage intelligent agents. The summit focused on making useful AI a universal productivity tool, with the Agent Intelligent Agent Ecosystem at its core, promoting the easy creation of complex skills for enterprises and achieving AI accessibility.

Accelerating the Transition of Data from Raw Materials to Practical Applications: Ant Tech Launches DataX Intelligent Agent Data Ecosystem Platform

Ant Digital Technologies launched the DataX intelligent agent data ecosystem platform at the Digital China Summit, integrating MCP protocol and DTClaw agents to lower data access barriers, shorten value conversion cycles, and address challenges like long integration times, comprehension difficulties, and cross-platform adaptation, enhancing data element allocation efficiency.....

NVIDIA Launches New Multimodal Model, Intelligent Agent Efficiency Increased Ninefold

Nvidia unveils the open multimodal model Nemotron 3 Nano Omni, integrating video, audio, image, and text reasoning. It uses a 30B-A3B mixture-of-experts architecture with built-in vision and audio encoders, eliminating extra perception models. This enhances large-scale inference efficiency and excels in complex text processing.....

Tenfold Improvement in Efficiency: Ant Group's Bai Ling Large Model Ling-2.6-flash Officially Open-Sourced

Ant Group's Bailing large model open-sourced Ling-2.6-flash today, offering BF16, FP8, INT4 quantization versions to lower AI deployment barriers. With 104B total parameters and 7.4B active parameters, it previously excelled in anonymous international benchmarks and underwent multiple optimizations for Chinese-English switching and code generation.....

Ima Launches Knowledge Agentcopilot: Built-in Memory System with Multi-Device Synchronization

Tencent's intelligent workbench ima launched the personal knowledge agent 'copilot' on April 29, upgrading AI search into an evolving knowledge partner through deep personalized memory and full-scenario perception. Its core is an autonomous evolving memory system integrating four modules: persona, user profile, long-term memory, and experience skills, marking AI's transition from a single-dialogue tool to a long-term intelligent agent.....