Automattic Reaches Agreement with AI Companies on Training Data, Tumblr Owner Involved in Data Sharing


WordPress founder Matt Mullenweg unveiled AI tool Telex at WordCamp US2025 to simplify website creation via AI. Currently experimental, users can generate content via telex.automattic.ai.....
Perplexity recently announced a partnership with several well-known media outlets to jointly create a customized Q&A engine powered by artificial intelligence. The company's business director, Dmitry Shevelenko, revealed that this initiative has been in the works since January of this year, aimed at promoting the company's own development.
Tumblr's parent company is negotiating with OpenAI to sell user posts for AI training. Automattic plans to release a setting that allows users to opt out of data sharing with third parties. Automattic has crawled all public posts on Tumblr published from 2014 to 2023. Reddit has signed a $60 million annual agreement with Google to use user data for AI training. Shutterstock has signed with OpenAI.
According to a report by 404 Media, the owner of Tumblr and WordPress.com is in talks with AI companies Midjourney and OpenAI. Automattic plans to share data with third parties, including training data obtained from user posts. The company has scraped all public post content from Tumblr from 2014 to 2023, including content that will not be publicly displayed.
Reddit has generated $203 million in revenue through data licensing agreements with several AI companies. Reddit emphasizes the positive impact of its relationships with AI providers on its IPO prospects. The data from Reddit is crucial for training AI models and has become the focus of the article. AI providers are actively pursuing licensing agreements to acquire high-quality data for model training.