Article Content

Meta Unveils a White-Box Scalpel: CoT-Verifier Pins AI Reasoning Errors to an Attribution Graph

Published in Latest AI News

Time :Nov 28, 2025

Read :3minute

Today, Meta AI Lab transformed Llama3.1 into a "X-ray machine" for reasoning—the new model CoT-Verifier is now available on Hugging Face, specifically dissecting the "circuit paths" of each step in chain-of-thought (CoT), making errors no longer hidden in the black box.

Traditional verification only checks whether the output is correct. Meta takes a different perspective: first, run a forward pass through the model, then extract the attribution graph for each step. The team found that the graph structures of correct and incorrect reasoning differ significantly, like two completely different circuit boards. Training a lightweight classifier on these "graph features" directly boosts the accuracy of predicting erroneous steps to SOTA, and each task (math, logic, common sense) has its own unique "fault signature," indicating that reasoning failures are not random noise but quantifiable and classifiable computational patterns.

More importantly, the attribution graph can not only "diagnose" but also "operate." In experiments, Meta performed targeted ablation or weight shifting on high-suspect nodes, successfully improving Llama3.1's accuracy on the MATH dataset by 4.2 percentage points without retraining the main model. In other words, CoT-Verifier turns reasoning error correction from "post-mortem analysis" into "intra-operative navigation."

The model is open source, with scripts that can be reproduced with one click. Developers just need to feed the CoT path to be verified into the Verifier, and they will get a "structural anomaly score" for each step, as well as locate the most likely faulty upstream node. At the end of the paper, Meta stated: the next step is to apply the same graph intervention approach to code generation and multimodal reasoning, making "white-box surgery" the new standard for LLMs.

Related Recommendations

Meta Releases CoT Verification Model: A White-box Reasoning Error Correction Tool Based on Llama 3.1

Meta AI Lab introduces CoT-Verifier, a model based on Llama3.18B, using TopK transcoder for white-box verification to precisely identify and correct errors in AI chain-of-thought reasoning, overcoming traditional method limitations.....

Nov 28, 2025

114.3k

Meta Opensources SAM 3D: Generate Interactive 3D Models in Seconds from a Single Image, Revolutionizing Spatial Understanding

Meta AI's SAM3D generates textured 3D assets from single 2D photos, outperforming existing methods with physics-aware geometry and materials for AR/VR, robotics, and film.....

Nov 20, 2025

218.3k

Snapchat Opens Imagine Lens for Free: First Open-Source AI Image Generation Feature Available to All Users

Snapchat's AI image tool 'Imagine Lens' is now free for all users, enabling text-to-image generation and editing to compete with Meta AI and OpenAI in attracting younger audiences.....

Oct 23, 2025

150.8k

Major Policy Adjustment by Meta: User Conversations with AI Assistants Will Be Used for Advertising and Content Delivery Across the Platform

Meta announced that starting December 16, 2025, all text or voice conversations between users and Meta AI will be integrated into its advertising and content algorithms. This means that interactions in AI chats will directly influence the ads, posts, and group content that users see on platforms such as Facebook and Instagram. For example, after discussing hiking, users will receive more related ads and content in their feeds.

Oct 6, 2025

146.0k

LeCun's New Proposal: Rebuilding Language Models with CV Approaches, Performance Significantly Improved!

Yann LeCun proposed the JEPA architecture, revolutionizing the training method of large language models. By leveraging computer vision techniques, it overcomes the limitations of traditional word prediction-based approaches, promoting development in the AI field.

Sep 22, 2025

193.0k

Intelligent Future, Your Artificial Intelligence Solution Think Tank

English 简体中文繁體中文にほんご