DeepSeek Releases V3.1 Version, Doubling the Context Window to 128K Tokens

Beijing, August 19, 2025 — DeepSeek officially released the latest iteration of its flagship large language model, DeepSeek-V3.1, which significantly expands the context window capacity from 64k tokens to 128k tokens while maintaining full API compatibility, marking another important milestone in the company's open-source AI technology field.

Technical Upgrades and Market Positioning

DeepSeek-V3.1 was released only five months after the previous version, DeepSeek-V3-0324. The V3-0324 version, released in March this year, achieved significant improvements in core performance metrics such as reasoning ability, programming efficiency, and mathematical computation, laying a solid foundation for further optimization in the V3.1 version.

In the current context of increasingly fierce global AI technology competition, DeepSeek's rapid product iteration strategy fully demonstrates its technological innovation capability and market response speed. Despite the limitations on accessing high-end computing resources caused by international sanctions, DeepSeek has maintained a strong competitive advantage in the open-source large language model field through innovative efficient training methods and optimization strategies.

Core Technical Features

Expanded Context Processing Capability: The core upgrade of the V3.1 version is the doubling of the context window, allowing the model to handle longer multi-turn dialogue sequences and complex query tasks, providing users with a more coherent and in-depth interactive experience.

Optimized Output Structure: Based on early user feedback, V3.1 has made significant improvements in language expression style, and the model now tends to generate structured output content, including formatted data presentation such as tables and lists, enhancing the readability and practicality of information.

Enhanced Physical Understanding Ability: The new version shows obvious improvements in simulating physical phenomena and understanding related concepts, providing stronger support for scientific computing and engineering application scenarios.

Architecture Continuation and Optimization: V3.1 continues to use the verified expert mixture (MoE, Mixture of Experts) architecture design, and it is specifically optimized for regular reasoning tasks. In such application scenarios, there is no need to activate the "DeepThink" deep thinking mode, ensuring processing efficiency while maintaining output quality.

Deployment and Compatibility

DeepSeek-V3.1 is now available on multiple platforms, including the DeepSeek official website, mobile applications, and WeChat mini programs, the main user access channels. The company particularly emphasized the complete backward compatibility of the API interface, ensuring that existing users and developers can seamlessly migrate to the new version without modifying existing integrated code or calling methods.

Open Source Commitment and Future Plans

Although at the time of publication, Hugging Face platform has not yet provided the model weight files for the V3.1 version, DeepSeek reiterates its long-term commitment to the open source community, promising to continue following the open source release strategy and provide technical support to the global AI research community and developers.

The release of DeepSeek-V3.1 further consolidates the company's leading position in China's AI technology innovation field and contributes important technical progress to the development of global open-source large language models. With the significant improvement in context processing capabilities and the realization of multiple performance optimizations, V3.1 is expected to play a greater value in enterprise-level applications, academic research, and personal user scenarios.

DeepSeek Releases V3.1 Version, Doubling the Context Window to 128K Tokens

Technical Upgrades and Market Positioning

Core Technical Features

Deployment and Compatibility

Open Source Commitment and Future Plans

Related Recommendations

Say goodbye to the nightmare of PPT creation! Google Gemini generates presentations with one click, transforming files into slides in an instant

Unlock the Global AI Capabilities with a Single Key — We Are Building a Future-Oriented Global AI Aggregation Gateway

Anthropic Launches Claude for Life Sciences: AI Accelerates Life Science Research

Salesforce Sued for AI Model Infringement, Possibly Triggering a Trust Crisis for Enterprises

China Agricultural University Launches Shennong Model 3.0, AI Aids Rural Revitalization