Microsoft President Emphasizes No Superintelligent AGI in the Near Term


Google DeepMind launches Gemma Scope 2, an open explainability toolkit designed to analyze information processing at all levels of the Gemma 3 language model, ranging from 270 million to 2.7 billion parameters. The tool helps AI safety and alignment teams track internal features of the model to address issues such as jailbreaking, hallucinations, or inappropriate behavior.
OpenAI plans to introduce parental controls, including emergency contact and AI alerts, to prevent teen suicides after a 16-year-old's death linked to ChatGPT.....
Recently, Nvidia announced the addition of three safety features on its NeMo Guardrails platform, aimed at helping businesses better manage and control AI chatbots. These microservices target common challenges in AI safety and content moderation, offering a range of practical solutions. Among them, the Content Safety service can review content before the AI responds to users, detecting any potential harmful information. This service helps prevent...
Recently, OpenAI showcased its more proactive red team testing strategy in the field of AI safety, surpassing its competitors, especially in the critical areas of multi-step reinforcement learning and external red team testing. The two papers released by the company establish new industry standards for enhancing the quality, reliability, and safety of AI models. The first paper, 'OpenAI's AI Model and System External Red Team Testing Methodology,' highlights the effectiveness of specialized external teams in identifying security vulnerabilities that internal testing may overlook. These external teams consist of cyber...
In a recent interview, Microsoft's AI Chief Mustafa Suleyman and OpenAI CEO Sam Altman expressed a notable disparity regarding the timeline for achieving Artificial General Intelligence (AGI). Altman stated in a Reddit Q&A that AGI could be realized with the current hardware conditions, while Suleyman was skeptical, arguing that the current hardware technology is insufficient to support the development of AGI.