Tencent Releases HunyuanOCR Open-Source Model, Achieving Multiple SOTA Performances with Only 1B Parameters

Recently, Tencent officially launched its brand-new open-source model HunyuanOCR, with only 1B parameters. The model is based on Tencent's proprietary Huyuan multimodal architecture and has achieved SOTA (state-of-the-art) performance in multiple industry-standard OCR applications. Tencent stated that the "end-to-end" design philosophy of HunyuanOCR allows the model to quickly obtain the optimal results through a single forward inference.

HunyuanOCR is mainly composed of three core components: native resolution video encoder, adaptive visual adaptation lightweight Huyuan language model. Unlike other OCR models in the market, Hunyuan adopts an end-to-end training and inference approach, and demonstrates excellent reasoning capabilities through large-scale application-oriented data and online reinforcement learning.

In the test of complex document parsing, HunyuanOCR scored 94.1, surpassing multiple leading models including Google Gemini3-pro. Its text detection and recognition capabilities are also very outstanding, covering various application scenarios such as documents, artistic fonts, street scenes, handwriting, advertisements, and receipts. Compared to other open-source and commercial OCR models, it performs excellently. In OCR, this model has a total score of 860 points, becoming the top performer among models with less than 3B parameters.

HunyuanOCR also supports translation functions for 14 languages, and shows excellent performance in the translation field. The model can process complex documents electronically, organize the text in scanned images according to reading order, and can use LaTeX format to represent formulas and HTML format for complex tables.

In terms of application, HunyuanOCR is suitable for tasks such as language document parsing, invoice field extraction, video subtitle recognition, and photo translation, demonstrating broad application potential.

github: https://github.com/Tencent-Hunyuan/HunyuanOCR

Key Points:
🔍 HunyuanOCR model with 1B parameters achieves multiple SOTA results through end-to-end design.
📄 The model supports complex document parsing, text detection, and recognition, covering various application scenarios.
🌐 HunyuanOCR also has translation capabilities for 14 languages, especially suitable for photo translation features.

Directors Born in the 00s Collaborate with Tencent AI to Open a New Era in Film and Television!

Pang Haoyang, a director born in the 00s and the head of an AI film and television service provider, is promoting the integration of technology and art. He has partnered with Tencent and plans to launch 10 innovative short films this year, including AI webtoons and lifelike human dramas, showcasing the potential of AI in film and television creation. Despite the anxiety surrounding AI development in the industry, he embraces change positively.

After the 8 Billion Yuan Red Envelope, Will There Be a Trend of Unloading Large Models? The AI Application Faces a Life-or-Death Test from Traffic to Retention

During the 2026 Spring Festival, Chinese Internet giants launched an "AI Red Envelope Battle" with a total of 8 billion yuan to capture the market, and AI applications once dominated the app store rankings. However, as the holiday ended, how to improve user retention became the key challenge for AI assistants.

Tencent Releases HunyuanOCR Open-Source Model, Achieving Multiple SOTA Performances with Only 1B Parameters

Related Recommendations

Directors Born in the 00s Collaborate with Tencent AI to Open a New Era in Film and Television!

Valuation Breaks 12 Billion Dollars! Moonlight Again Secures 700 Million Dollars in Funding: The K2.5 Model Demonstrates Remarkable Commercialization Potential, Yang Zilin Says No Immediate Plan to Go Public

After the 8 Billion Yuan Red Envelope, Will There Be a Trend of Unloading Large Models? The AI Application Faces a Life-or-Death Test from Traffic to Retention

Tencent Hunyuan Welcomes a Top Scientist: Tianyu Peng Joins and Leads Multimodal Reinforcement Learning

Tencent Hunyuan Model Welcomes Top Scientist: Tsinghua PhD Peng Tianyu Joins and Leads Multi-Modal Reinforcement Learning