GitHub Policy Reversal: Private Repository Code Will Also Be Used to Train AI

GitHub, the world's largest code hosting platform, recently dropped a major bombshell: it officially announced that starting April 24, 2026, it will use user interaction data to train its AI models. This move has been humorously dubbed a "CTRL-Z" operation by many developers, as GitHub had repeatedly emphasized its commitment to respecting user private data in public statements, and this new policy clearly breaks that understanding.

Code Internet (2)

Controversy over "Default Join", Private Repositories Are No Longer Absolutely Private

According to GitHub's updated privacy policy, users of Copilot's free version, personal version (Pro), and professional enhanced version (Pro+) are all included in this data collection. The system will automatically collect detailed data including code snippets, input and output content, cursor context, and even file names and directory structures. What unsettles the community the most is that even code stored in "private repositories" may be captured for model training if the user enables Copilot while editing. GitHub's Chief Product Officer Mario Rodriguez explained that internal employee tests have shown that adding real interaction data significantly improves the AI's accuracy in detecting bugs. However, this "default on" rather than "manual opt-in" strategy has triggered strong backlash in the developer community, with opposition votes skyrocketing almost instantly under the announcement.

How to Protect Yourself: Exemptions for Enterprise Users and Manual Deactivation Guide

In this data harvesting battle, not all users are in a passive position. GitHub explicitly stated that paid organizations who purchased Copilot Business and Enterprise versions, as well as certified students and teachers, will be protected by contract terms, and their data will not be used for training.

Google Chrome Browser 148 Version Update: Local AI Data Processing Statement Attracts Attention

The update of Google Chrome version 148 has caused controversy, as it removed the key statement that the AI model data was processed locally on the device without being sent to the server. This change has been questioned by experts and users, possibly indicating a shift in privacy protection policies, although Google has not explicitly stated whether the data is still processed locally.

Kunlun Tech Launches 4+3 Strategy: From Technical Foundation to Business Cycle

According to Kunlun Tech's 2025 annual report, the company's revenue reached 8.198 billion yuan, an increase of 44.78% year-on-year, with overseas revenue reaching 7.723 billion yuan, up 49.91%. The company introduced the "4+3 Strategy", clearly defining the development direction of AI-driven content production, covering both technological and business layout.

OpenAI Launches GPT-Rosalind Model, Deeply Crossing into the Field of Pharmaceutical and Life Sciences

OpenAI launches GPT-Rosalind, an AI model for life sciences named after DNA pioneer, designed to accelerate drug discovery by analyzing biochemical data to aid in evidence synthesis, hypothesis generation, experimental planning, and protein engineering, enhancing lab efficiency and medical application.....

GitHub Policy Reversal: Private Repository Code Will Also Be Used to Train AI

Related Recommendations

Google Chrome Browser 148 Version Update: Local AI Data Processing Statement Attracts Attention

Kunlun Tech Launches 4+3 Strategy: From Technical Foundation to Business Cycle

Meta Launches an Internal Monitoring Tool to Train AI Models Using Employee Keyboard and Mouse Operation Data

Singapore Financial Regulatory Authority Calls for Strengthened Bank Cybersecurity Against AI Model Risks

OpenAI Launches GPT-Rosalind Model, Deeply Crossing into the Field of Pharmaceutical and Life Sciences