Baidu Open-sources 3B Model Unlimited OCR: Star Count Exceeds 10,000 in 5 Days, Setting a New Record for Long Document Parsing

Baidu has recently released and open-sourced a 3B parameter end-to-end OCR model called Unlimited OCR, specifically designed for long document parsing scenarios such as books and papers. After its release, the project quickly topped four trending lists on GitHub and HuggingFace, and within five days of being open-sourced, it surpassed 10,000 GitHub Stars.

Technically, Unlimited OCR activates approximately 570M parameters during inference, andintroduces the Reference Sliding Window Attention (R-SWA) mechanism for the first time. This mechanism breaks the traditional "page-by-page parsing + stitching" limitation, enabling the continuous parsing of dozens of pages in one go; at the same time, it keeps the KV Cache in the decoding phase at a constant scale, so that memory usage and computational costs no longer surge with the increase in output length.

In the OmniDocBench v1.6 benchmark test, the model set a new record with a score of 93.92%. In real-world scenarios, its inference speed is about 12.7% faster than DeepSeek OCR, and the speed advantage increases to 35% at a 6000Tokens output length, providing a new approach for massive document digitization and large model long-term memory management.

Half of Users Free Their Hands: Anthropic Survey Shows AI Can Now Handle More Than Half of Work

A recent Anthropic survey shows that nearly half of Claude users believe AI can now independently complete more than half of their daily work tasks. Among them, 33% of users estimate that AI handles 30%-60% of their workload, 14% believe it covers 60%-90%, and 4% of advanced users claim AI almost completely covers their work, reflecting the accelerating infiltration of AI into the workplace.

Report: Baidu Kunlun Chip to Go Public in Hong Kong, Target Valuation of $50 Billion

Baidu's AI chip company, Kunlun Chip, is advancing its plan to go public in Hong Kong, with a target valuation of $50 billion. As a core component of Baidu's AI hardware ecosystem, its listing not only marks a milestone in its capitalization but also injects new variables into the global AI computing power competition. Kunlun Chip has launched multiple generations of self-developed cloud AI chips, which have drawn significant attention from the industry.

AI Opens a New Way of Working: HP and OpenAI Upgrade Strategic Cooperation

HP and OpenAI expand their strategic cooperation to enhance global customer experience and accelerate operational transformation. The initial pilot projects have shown significant results, for example, software engineers have greatly improved efficiency with the AI model, verifying the potential of cutting-edge technology in business operations and laying the foundation for full-scale implementation.

Baidu Open-sources 3B Model Unlimited OCR: Star Count Exceeds 10,000 in 5 Days, Setting a New Record for Long Document Parsing

Related Recommendations

Half of Users Free Their Hands: Anthropic Survey Shows AI Can Now Handle More Than Half of Work

Key Breakthrough in Computing Power Enhancement: Peking University and DeepSeek Jointly Open-Source Large Model Inference Framework DSpark

Report: Baidu Kunlun Chip to Go Public in Hong Kong, Target Valuation of $50 Billion

AI Opens a New Way of Working: HP and OpenAI Upgrade Strategic Cooperation

Cutting Flesh Without Data? Google Forces New AI Training Rules