Google Releases Stable Version of New Gemini 2.5 Flash-Lite: The Perfect Balance of Speed and Cost

Recently, Google officially announced that its latest Gemini 2.5 Flash-Lite model has entered the general availability (GA) stage. This version is considered the fastest and most cost-effective model, marking another important advancement in Google's artificial intelligence field. Gemini 2.5 Flash-Lite achieves a good balance between performance and cost, and it natively supports up to 1 million tokens of context, bringing many advanced features.

The pricing strategy of Gemini 2.5 Flash-Lite is also quite notable: the cost is only $0.10 per million input tokens, and $0.40 per million output tokens, which is comparable to the price of the competitor GPT-4.1 Nano. In addition, compared to the previous preview version, the pricing for audio input has been reduced by 40%, showing its sensitivity to user needs and response to market competition.

In various benchmark tests, the performance of Gemini 2.5 Flash-Lite surpassed the previous 2.0 version, covering areas such as coding, mathematics, reasoning, and multimodal understanding. The model supports a context window of 1 million tokens, has controllable thinking budgets, and offers multiple native tools, such as integration with Google search, code execution, and URL context functionality.

Developers can use the Gemini 2.5 Flash-Lite model through simple code instructions, specifically by specifying the model as gemini-2.5-flash-lite. It should be noted that the original preview version alias plan will be removed on August 25th, and developers should adapt to the new version as soon as possible.

The release of Gemini 2.5 Flash-Lite marks Google's determination to continuously innovate and optimize in the field of artificial intelligence technology, providing developers with a more efficient and cost-effective option. It is undoubtedly going to play a greater role in various application scenarios in the future.

Key points:
🌟 Gemini 2.5 Flash-Lite is Google's latest AI model, the fastest and most cost-effective, and has now entered the general availability (GA) stage.
💰 The model is priced at $0.10 per million input tokens and $0.40 per million output tokens, with a 40% reduction in the price of audio input compared to the preview version.
🔧 Developers can use the new version by specifying the model name as gemini-2.5-flash-lite. The original preview version alias will be removed on August 25th.

Study: Global AI Chipset Market to Exceed $700B with 31.8% CAGR

According to TMR Research, the global artificial intelligence chipset market size is expected to exceed $700 billion, with a compound annual growth rate of 31.8% from 2022 to 2031. The article discusses the development trends, application areas, and key players in the artificial intelligence chipset market, which is highly timely and valuable for readers interested in the artificial intelligence chipset market.

IBM Research: How AI & Automation Protect Businesses from Data Breaches

IBM's report provides sufficient evidence that artificial intelligence, automation, and threat intelligence can address data breaches throughout the lifecycle, reduce costs, and provide stronger evidence. The research found that integrating artificial intelligence and automation into security operations teams can reduce the lifecycle of data breaches by 33% and costs by 33.6%. However, currently, only 28% of enterprises widely apply artificial intelligence and automation. Many enterprises rely on legacy systems, which are easily bypassed by attackers. The significance of this article lies in emphasizing the effectiveness of artificial intelligence and automation in improving cybersecurity and calling on enterprises to widely adopt these technologies to protect data security.

Google's AGI Robot Breakthrough: 54 - Member Team's 7 - Month Work, High Generalization and Reasoning 解释：核心关键词为“谷歌AGI机器人”（Google's AGI Robot）和“新成果”（Breakthrough），标题简洁地概括了主要内容，以动词开头，符合英文习惯，且长度在规定范围内。

The robotics research team at Google DeepMind recently released a robotics project called RT-2. This project took 7 months to develop and uses a large model for training. RT-2 has capabilities such as symbol understanding, reasoning, and human recognition, and can think and complete tasks based on human instructions. By combining the large model with the robot's operational capabilities, RT-2 can accomplish tasks that involve logical leaps, such as from 'extinct animals' to 'plastic dinosaurs'. The results of this project performed well in various sub - category tests, with performance up to three times that of the previous generation of robot models. This research result demonstrates the potential of large models in robotics research and is expected to drive the development of robots in the future.

RWKV: Small Team Aims to Be Android of AI Era with Big Model

Meta Intelligence OS is a startup founded by Bloomberg. It has developed a series of large models based on the open-source model RWKV and aims to become the Android in the era of large models. The RWKV model has superior performance and low cost in inference tasks, thus attracting customers from industries such as finance, law firms, and smart hardware. The business model of Meta Intelligence OS is model customization based on private data and internal AI Agent development. The company hopes to solve the problems of API call latency and data security by deploying large models on terminal devices. Currently, RWKV versions are available on Windows, Mac, and Linux computers, and Android and iOS versions are also in development. Meta Intelligence OS is raising funds and collaborating with chip companies and computing power platforms to create benchmark customers. Luo Xuan said that the decisive battlefield for large models is on hardware, and both terminal devices and the cloud require dedicated chips.

Xunfei Xinhuo X1 Upgrade Version to Be Released Soon, Deep Reasoning Capabilities Reach New Heights

iFlytek launches Spark X1 upgrade on July 25, a Chinese AI model excelling in math, translation, and logic with fewer parameters but comparable to OpenAI/DeepSeek. Features improved accuracy, multilingual support (130+ languages), and precise responses via 'slow thinking' mode, marking a breakthrough in domestic AI tech.....

Google Releases Stable Version of New Gemini 2.5 Flash-Lite: The Perfect Balance of Speed and Cost

Related Recommendations

Study: Global AI Chipset Market to Exceed $700B with 31.8% CAGR

IBM Research: How AI & Automation Protect Businesses from Data Breaches

RWKV: Small Team Aims to Be Android of AI Era with Big Model

Xunfei Xinhuo X1 Upgrade Version to Be Released Soon, Deep Reasoning Capabilities Reach New Heights