Revealing Application Tips for Large Models: One Sentence to Boost AI Performance to 98%


Anhui released the proposal for the "14th Five-Year Plan", emphasizing economic upgrading driven by technological innovation, with a focus on promoting the construction of Digital Anhui. It will build a high-level national data element comprehensive pilot zone, innovate the development and utilization model of data resources, create a national integrated data market hub, and promote the development of the real economy.
The domestic large model sector is shifting from technological competition to capital racing. Three leading AI unicorns - MiniMax, Zhipu AI, and Moonshot - are aggressively advancing their plans for listing on the Hong Kong stock market, competing for the title of "First AI Stock in China." None of the three companies have publicly commented, but multiple sources indicate that the battle for listing has already begun, with MiniMax possibly being the first to ring the bell.
The 2025 Guangming Science City Forum was held in Shenzhen, focusing on intelligent computing power and large model agents. Institutions such as the Shenzhen Institute of Advanced Technology announced four important achievements: the open-source multimodal model Pengcheng Haiwen 2.1 along with its corresponding dataset and toolchain; the domestic 10,000-card inference engine FenixCOS made its debut, supporting large-scale parallelism and efficient switching; the meteorological intelligent agent "Afu" was integrated into Pengcheng CloudBrain III, providing services for the 15th National Games.
Xiaomi's President Lu Weibing announced a 10-year AI strategy focusing on integrating large models with physical scenarios, embedding AI into hardware and services. Luo Fuli has joined the AI team, with AI investment growing over 50% quarterly, exceeding board expectations.....
A new study conducted high-pressure tests on 12 mainstream large models, finding that their performance significantly declined when facing shortened deadlines and increased penalties. For example, the failure rate of Gemini 2.5 Pro increased from 18.6% to 79%, and GPT-4o also experienced a near-halving drop. In critical tasks such as biosecurity, the models even made serious mistakes by skipping key steps.