Microsoft GitHub announced an exciting update on April 6th, introducing an experimental feature called Rubber Duck for its Copilot CLI. This new feature introduces a "second opinion" review mechanism across models, aimed at helping developers improve the accuracy and efficiency of their code, ultimately boosting AI performance by nearly 75%.

During software development, early decision errors often accumulate into larger issues, and traditional self-review methods are prone to be influenced by the model's own training biases. The introduction of Rubber Duck aims to bring in different models as independent reviewers, offering diverse perspectives and identifying potential errors promptly. This feature allows users to select the Claude series model as the main controller, followed by GPT-5.4 for code review, ensuring the accuracy and completeness of the code.
According to research using the SWE-Bench Pro benchmark, there was a significant performance gap between the standalone operation of Claude Sonnet 4.6 and Opus 4.6. However, after combining Sonnet 4.6 with Rubber Duck, it successfully closed 74.7% of the performance gap. The feature performs even more notably when handling complex tasks, scoring 3.8% higher than the baseline. Specific cases show that it can effectively identify architectural logic vulnerabilities, loop coverage errors, and conflicts across files.
Rubber Duck offers flexible usage options, supporting three types of review modes: active, passive, and user-triggered. The system automatically seeks reviews at key points such as planning, complex implementation, and test writing, and can also be triggered passively when developers face problems. Additionally, users can request reviews at any time, and the system will display feedback content and modification justifications.
Currently, the Rubber Duck feature is available in experimental mode. Users can simply install GitHub Copilot CLI and run the /experimental command to activate this new feature and experience the collaborative work of the Claude model and GPT-5.4.
Key Points:
🌟 1. Microsoft GitHub introduced the Rubber Duck feature, implementing a cross-model AI review mechanism to enhance programming efficiency.
🔍 2. This feature successfully closed 74.7% of the performance gap by combining Claude Sonnet 4.6 and GPT-5.4.
⚙️ 3. Rubber Duck supports multiple review modes, allowing users to proactively request reviews to ensure code quality.
