Tencent Reveals: The More Agents, the Better the Performance of Large Language Models

Researchers at Tencent have discovered that the performance of large language models improves with the increase in the number of instantiated agents, without the need for a complex multi-LLM agents collaboration framework. Experimental results show that an ensemble of multiple smaller LMs can outperform larger LMs in performance. The paper explores the relationship between performance enhancement and the difficulty of the problem, and proposes two optimization strategies: progressive sampling and voting, and hierarchical sampling and voting.

Gu Quanquan Confirms Departure from ByteDance's Seed Team, Previously Led the Development of SeedFold and Seed2.0 Training System

Gu Quanquan, a core researcher at ByteDance's Seed team, has confirmed her departure. She shared her research achievements in AI drug discovery and pre-training of large language models over the past three years on a social platform. The bio-molecular structure prediction model SeedFold, which she led the development of, performed excellently in multiple public benchmark tests. This departure comes as ByteDance's AI business accelerates its commercialization, drawing attention to the emerging trend of AI for Science startups.

GPT-5.5 Takes the Lead in Utilization Efficiency, DeepSeek V4 Pro Wins the Title of Best Cost-Performance! Real-World Cybersecurity Attack and Defense Report on Large Models Released

The reasoning capabilities of large language models in the field of cybersecurity are facing a serious test. Security researcher Kasra Rahjerdi conducted simulated hacker attack tests on mainstream large models by building an APK with core vulnerabilities in book review data, revealing their true level of security reasoning and vulnerability exploitation. The test lasted 2 hours with a single budget of $10, intuitively demonstrating the performance of each model in complex logical challenges.

Betting on People Rather than Code: The Zig Project's Strict Policy Prohibiting LLM-Assisted Contributions Sparks Debate

As Generative AI sweeps through the programming field, the Zig open-source project has introduced a strict policy in the opposite direction: completely prohibiting the use of code or comments generated by large language models for contributions. After Simon Willison's interpretation, it sparked a discussion within the community about the trade-off between technical efficiency and talent development. The core conflict lies in the choice between code production and talent growth. The Zig maintainers redefined 'contributions,' emphasizing originality and the learning process.

Moonshot AI Collaborates with Tsinghua University to Launch PrfaaS Architecture, Breaking the Bottleneck of Large Model Computing Power

The efficiency of large language model inference has made a breakthrough. Tsinghua University and Moonshot AI jointly proposed a new architecture called "Prefill-as-a-Service," which splits the inference process into two stages: prefilling and decoding, and optimizes the allocation of computing resources, effectively solving hardware limitations and significantly improving model service performance.

Tencent Reveals: The More Agents, the Better the Performance of Large Language Models

Related Recommendations

Gu Quanquan Confirms Departure from ByteDance's Seed Team, Previously Led the Development of SeedFold and Seed2.0 Training System

GPT-5.5 Takes the Lead in Utilization Efficiency, DeepSeek V4 Pro Wins the Title of Best Cost-Performance! Real-World Cybersecurity Attack and Defense Report on Large Models Released

Betting on People Rather than Code: The Zig Project's Strict Policy Prohibiting LLM-Assisted Contributions Sparks Debate

Moonshot AI Collaborates with Tsinghua University to Launch PrfaaS Architecture, Breaking the Bottleneck of Large Model Computing Power

AI Medicine Enters the Deep Waters: Research Indicates Generative Models Still Struggle to Independently Bear the Burden of Clinical Reasoning