DeepSeek Opens Wide-Ranged Image Recognition Mode: Multimodal Understanding Function Enters Internal Testing

DeepSeek has officially launched the large-scale image recognition mode for internal testing, marking that this domestic large model has fully entered the era of multimodal interaction between text and images. After a small-scale gray-scale test in late April, DeepSeek significantly expanded access to the "Image Recognition Mode" on May 9th, and most test accounts can now access this feature through an independent entry point in the chat interface. Although the system is still labeled as "in internal testing," its layout, which is listed alongside "Quick Mode" and "Expert Mode" above the input box, indicates that multimodal understanding has become a key component of its core product matrix.

Differing from traditional simple OCR text extraction, the core of DeepSeek's latest upgrade lies in deep image recognition and semantic understanding capabilities. In practical tests, this mode can logically decompose and perceive visual information, supporting users to achieve complex cross-media interactions by directly uploading images. This move fills the gap in DeepSeek's multimodal understanding field, marking a substantial step forward in its pursuit of international top models such as GPT-4o.

Liang Wenfeng Invests 20 Billion! DeepSeek Launches Record-Breaking $5 Billion Funding Round, V4.1 Set for June

The domestic large model sector has witnessed a capital storm, with DeepSeek (Deep Seek) initiating its first major funding round targeting an amount of 5 billion yuan, which, if successful, would break the industry record. Notably, the lead investor is not a venture capital firm or an internet giant, but the founder Liang Wenfeng himself, who has invested the highest personal amount, demonstrating his strong confidence in the company.

Report: DeepSeek Plans to Accelerate Model Release, V4.1 Update Expected to Debut in June

DeepSeek will release an updated V4.1 model in June, accelerating release frequency to catch up with competitors. Despite praise for technical depth, the company lagged with no new models for 140 days while rivals released about 50. The V4, originally scheduled for February 2026, was delayed to April 24 due to hardware migration issues and launched as a preview.....

DeepSeek May Be Led by the National Big Fund, Valuation Approaching 45 Billion USD

China's National Integrated Circuit Industry Investment Fund is in talks to lead DeepSeek's first funding round, with a target valuation near $45 billion, doubling from its initial $20 billion. Despite DeepSeek's current focus on large model R&D and limited commercialization, investors are optimistic about its long-term growth prospects.....

OpenAI Launches ChatGPT Images2.0, India Market Contributes the Largest User Increase in the First Week

OpenAI announced on Thursday that India became the largest user group of its new image generation tool, ChatGPT Images2.0, which handles complex prompts and generates detailed images with multilingual text, enhancing multimodal interaction. Sensor Tower data shows a global 11% week-over-week download increase in the first week, but core engagement metrics like daily active users and sessions vary by region.....

DeepSeek Gray Test Image Recognition Mode Achieves Multimodal Image Understanding

DeepSeek is currently conducting a gray-scale test of the image recognition mode, which has multimodal recognition capabilities, enabling deep image analysis and description, not just OCR text recognition. After users upload images, they can get a quick response. Some netizens have described the speed as lightning fast.