Google DeepMind Uses Large Language Models to Solve Unresolved Mathematical Problems


The reasoning capabilities of large language models in the field of cybersecurity are facing a serious test. Security researcher Kasra Rahjerdi conducted simulated hacker attack tests on mainstream large models by building an APK with core vulnerabilities in book review data, revealing their true level of security reasoning and vulnerability exploitation. The test lasted 2 hours with a single budget of $10, intuitively demonstrating the performance of each model in complex logical challenges.
Google DeepMind CEO Hassabis predicts that Artificial General Intelligence (AGI) could appear as early as 2029-2030, with key breakthroughs potentially achieved within the next three years. He points out that increased investment by tech companies is accelerating the maturity of core technologies such as multimodal understanding, autonomous decision-making, and AI agents.
As Generative AI sweeps through the programming field, the Zig open-source project has introduced a strict policy in the opposite direction: completely prohibiting the use of code or comments generated by large language models for contributions. After Simon Willison's interpretation, it sparked a discussion within the community about the trade-off between technical efficiency and talent development. The core conflict lies in the choice between code production and talent growth. The Zig maintainers redefined 'contributions,' emphasizing originality and the learning process.
The efficiency of large language model inference has made a breakthrough. Tsinghua University and Moonshot AI jointly proposed a new architecture called "Prefill-as-a-Service," which splits the inference process into two stages: prefilling and decoding, and optimizes the allocation of computing resources, effectively solving hardware limitations and significantly improving model service performance.
Boston Dynamics and Google DeepMind have collaborated to integrate the Gemini Robotics-ER1.6 AI model into the Spot robot, significantly enhancing its capabilities in industrial inspection, particularly in leak detection and reading instrument data, thereby improving the robot's autonomous decision-making performance.