Today, the AI company Anthropic officially released an upgraded version of its flagship model, Claude Opus4 - Claude Opus4.1. This update aims to comprehensively enhance the model's agentic tasks, real-world programming, and reasoning capabilities, especially in programming and data analysis, which has attracted significant attention.
According to official information, the biggest highlight of Claude Opus4.1 is its remarkable improvement in programming performance. In the SWE-bench Verified programming evaluation, it achieved a score of 74.5%, demonstrating its strong capability in handling complex code problems. GitHub feedback also confirms this, as developers generally believe that Opus4.1 performs better than its predecessor in tasks such as multi-file code refactoring. Additionally, Japan's e-commerce giant Rakuten Group pointed out that the new model can more accurately locate errors in large codebases, effectively reducing unnecessary changes and potential bugs.
In addition to the leap in programming capabilities, Opus4.1 has made significant progress in deep research and data analysis, especially in terms of detail tracking and agentic search capabilities. The benchmark test results from Windsurf show that Opus4.1's performance has improved by one standard deviation compared to Opus4, a level of advancement comparable to the jump from Sonnet3.7 to Sonnet4.
Although this upgrade brings significant performance improvements, Anthropic emphasized that Opus4.1 is a progressive improvement, not a revolutionary update. It will continue to be deployed according to the **AI Safety Level 3 (ASL-3)** standard and shows robustness in multiple safety assessments. The new model has slightly improved in refusing illegal requests, with a harm-free response rate reaching 98.76%. Additionally, in terms of child safety, political bias, and agentic ability tests, Opus4.1's risk levels remain consistent with the previous version, and its cooperation in extreme abuse scenarios has decreased by about 25%, showing stronger security.
Claude Opus4.1 is now available to all paid users, Claude Code, API, Amazon Bedrock, and Google Cloud Vertex AI, with the same price as Opus4.