OpenAI launches gpt-oss-safeguard: an open-source AI safety model that can be updated in real time

OpenAI has announced the launch of a completely new open-source security model suite gpt-oss-safeguard, designed to provide AI systems with more flexible, transparent, and auditable security classification capabilities. The model includes two versions, 120 and 20, and is released under the Apache 2.0 license, allowing developers to freely use, modify, and integrate it.

Unlike traditional security classifiers, gpt-oss-safeguard supports "real-time policy interpretation", which means that when security or content rules change, the model can adapt and update instantly without retraining. This mechanism significantly reduces the maintenance costs of security systems, enabling enterprises and organizations to respond faster to evolving compliance and content security needs.

OpenAI, ChatGPT, artificial intelligence, AI

In terms of transparency, OpenAI states that the architecture of gpt-oss-safeguard allows developers to directly view the model's decision-making process, making it easier to understand its judgment logic and facilitating auditing and optimization. This design addresses long-standing concerns about AI's "black box" problem and provides a new technological paradigm for building a trustworthy AI security ecosystem.

Notably, gpt-oss-safeguard is built upon OpenAI's own open-source model gpt-oss and is launched as a collaborative effort between OpenAI and the ROOST platform (an open-source community focused on AI security, safety, and governance infrastructure). OpenAI says that the goal of this project is to promote a more open and responsible AI security standardization process globally.

OpenAI CEO Responds to Musk's Question: I Just Want to Leave a Mark in the Universe

OpenAI CEO Sam Altman responds to questions about going public, denying a dispute with Musk. Reports say the company is considering an IPO in the second half of next year, with fundraising of at least $60 billion and a target valuation of $1 trillion, which could become one of the largest IPOs in history.

OpenAI May Go Public Next Year with a Valuation of Trillion!

Reuters reported that OpenAI plans to go public in the second half of 2026, with a valuation potentially reaching into the ten digits, and could become one of the most valuable listed companies globally. The company has recently completed a restructuring of its profit model and reached a new cooperation agreement with Microsoft, focusing on hardware and general artificial intelligence development, laying the foundation for future expansion.

India's AI Battle Intensifies! Google Offers 18 Months of Gemini Pro for Free, OpenAI Launches Free ChatGPT Go - Tech Giants Compete for 1 Billion Users

The focus of global AI competition is shifting to user scale. India, with over 1 billion internet users, lacks a local large model and has become a battleground for tech giants. In October 2025, Google and OpenAI launched free strategies simultaneously: Google partnered with Reliance Industries to offer 18 months of AI Pro service to Jio users; OpenAI provided one year of free ChatGPT Go subscription to Indian users. This competition uses subsidies to gain data, and user scale will determine the future AI landscape.

OpenAI launches gpt-oss-safeguard: an open-source AI safety model that can be updated in real time

Related Recommendations

OpenAI CEO Announces in Person! Why GPT-6 Will Be Renamed to GPT-6-7, and What Lies Behind This Move!

OpenAI CEO Responds to Musk's Question: I Just Want to Leave a Mark in the Universe

OpenAI May Go Public Next Year with a Valuation of Trillion!

India's AI Battle Intensifies! Google Offers 18 Months of Gemini Pro for Free, OpenAI Launches Free ChatGPT Go - Tech Giants Compete for 1 Billion Users

Sora Free Quota to Shrink! OpenAI Launches Paid Packages + Creator Revenue Sharing, Video Generation Enters Commercialization Deep Waters