Microsoft's MD System Outperforms GPT-5.5 in Vulnerability Detection, Amazing Capabilities!

Today, as cybersecurity becomes increasingly important, Microsoft's own code security team launched a multi-model intelligent agent scanning framework called MDASH on May 13. This new system's design concept has revolutionized traditional single AI models, adopting a multi-agent collaboration strategy to enhance the accuracy and efficiency of code security detection.

The MDASH framework integrates over 100 specialized AI agents based on different cutting-edge large models or lightweight models. These agents each take on specific roles throughout the vulnerability detection process, including code preparation, vulnerability scanning, result verification, data deduplication, evidence generation, and patch validation. This clearly defined division of labor enables the system to fully leverage the strengths of each model when handling complex security testing tasks.

In authoritative CyberGym public benchmark tests, MDASH showed remarkable performance, surpassing Anthropic's Mythos model and OpenAI's GPT-5.5. After multiple rounds of testing, MDASH successfully uncovered 16 previously undiscovered vulnerabilities, including four high-risk remote code execution vulnerabilities, demonstrating its strong ability to identify vulnerabilities.

Even more impressive is that in private test driver verification with 21 manually implanted vulnerabilities, MDASH achieved a 100% identification rate with zero false positives. This achievement shows that MDASH not only accurately identifies vulnerabilities but also effectively reduces false positives, greatly improving the reliability of security testing.

Notably, retrospective test data shows that MDASH also performed well in recall rates for historical vulnerabilities, achieving a 96% recall rate for clfs.sys vulnerabilities over the past five years and a 100% recall rate for tcpip.sys. This data fully proves MDASH's strength in the field of vulnerability detection.

Currently, MDASH has begun assisting Microsoft's internal engineering teams in product security enhancements and has started internal preview testing for limited customers. It is expected that this new system will play an important role in future cybersecurity work, protecting users' digital assets.

Key Points:

🌟 MDASH uses a multi-agent collaboration strategy, integrating over 100 specialized AI agents to improve vulnerability detection efficiency.

🔍 In CyberGym tests, MDASH successfully found 16 new vulnerabilities, surpassing GPT-5.5 and Mythos models.

✅ In private tests, MDASH achieved a 100% vulnerability identification rate with no false positives, showing its high accuracy and reliability.

Kuaishou KwaiKAT Releases KAT-Coder-Pro V2.5: Say Goodbye to Code补 - The First Domestic Agentic Programming Model That Can Run the Entire Project End-to-End

Kuaishou's KwaiKAT team launches KAT-Coder-Pro V2.5, an agentic coding model tackling the gap between high benchmarks and real-world performance. Upgraded long-range engineering, general agentic abilities, and large-scale reinforcement learning push AI from code completion to autonomous software engineering. Key innovation: self-developed AutoBuilder pipeline converts runtime environments into training support.....

AI Startup Lyzr Completes $100 Million Series B Funding Using Its Own Self-Developed Agent

On July 9, three-year-old enterprise AI agent company Lyzr secured $100M in Series B funding at a ~$500M valuation. Its self-developed AI system SivaClaw independently led the entire negotiation and core process. Bloomberg called it a breakthrough for AI agents in complex commercial capital operations.....

Microsoft Began to Get Rid of OpenAI and Anthropic: In-House MAI Model Quietly Takes Over Excel and Outlook

Microsoft has begun replacing OpenAI and Anthropic models with its in-house MAI series models in core Office products such as Excel and Outlook, processing thousands of AI prompts per week. This move aims to develop a more cost-competitive self-built model and reduce high external costs. Microsoft's AI Chief Suliman stated that the company will reduce and eventually eliminate external dependence, saving a huge bill annually.

Microsoft's Major Shift in AI Strategy: Reducing Burden and Costs, Excel and Outlook Lead the Transition Away from External Dependencies

To reduce costs and build its own AI competitiveness, Microsoft is gradually replacing OpenAI and Anthropic models with its self-developed MAI model in Excel and Outlook. Currently, thousands of AI tasks are being handled independently by MAI each week, although they still represent a small portion of total usage. This marks a clear signal of the company's shift toward its own technology system.

Microsoft's MD System Outperforms GPT-5.5 in Vulnerability Detection, Amazing Capabilities!

Related Recommendations

Kuaishou KwaiKAT Releases KAT-Coder-Pro V2.5: Say Goodbye to Code补 - The First Domestic Agentic Programming Model That Can Run the Entire Project End-to-End

AI Startup Lyzr Completes $100 Million Series B Funding Using Its Own Self-Developed Agent

NVIDIA Vera CPU Is Here: Designed From Scratch for AI Agents, 1.5 Times Faster, OpenAI and Anthropic Will Use It

Microsoft Began to Get Rid of OpenAI and Anthropic: In-House MAI Model Quietly Takes Over Excel and Outlook

Microsoft's Major Shift in AI Strategy: Reducing Burden and Costs, Excel and Outlook Lead the Transition Away from External Dependencies