UK Government Increases Investment to Promote Safe Research on Advanced AI Models


Google DeepMind launches Gemma Scope 2, an open explainability toolkit designed to analyze information processing at all levels of the Gemma 3 language model, ranging from 270 million to 2.7 billion parameters. The tool helps AI safety and alignment teams track internal features of the model to address issues such as jailbreaking, hallucinations, or inappropriate behavior.
OpenAI plans to introduce parental controls, including emergency contact and AI alerts, to prevent teen suicides after a 16-year-old's death linked to ChatGPT.....
UK launches AI crime map challenge to develop a real-time crime prediction system for England and Wales by 2030, focusing on high-risk incidents like knife crimes. Part of a £500M R&D plan with £4M initial funding, prototype expected by 2026.....
According to Reuters, citing sources and a report by The Information, SoftBank CEO Masayoshi Son is reportedly planning to borrow $160 billion for investments in artificial intelligence (AI). The report states that company executives confirmed this intention during talks with banks last week. This move signals SoftBank's continued expansion in the AI sector, particularly amid intensifying global tech competition. Beyond the planned $160 billion, SoftBank is projected to have...
Recently, Nvidia announced the addition of three safety features on its NeMo Guardrails platform, aimed at helping businesses better manage and control AI chatbots. These microservices target common challenges in AI safety and content moderation, offering a range of practical solutions. Among them, the Content Safety service can review content before the AI responds to users, detecting any potential harmful information. This service helps prevent...