At the Hong Kong FinTech Festival, Ant Data introduced a revolutionary technology - the "Multilingual Multimodal Large Model Training Framework," aimed at breaking through the bottlenecks of large models in multilingual environments. This framework shows extraordinary potential, especially for resource-scarce minority languages such as Egyptian Arabic, Javanese, Bahasa Indonesia, and Sundanese.
The core of this innovative technology lies in its unique language perception optimization framework. The framework adopts a "thinking in the target language" mechanism, combined with fine-grained, multi-dimensional reward strategies and automated data solutions, greatly enhancing the understanding and processing capabilities for minority languages. According to test results, Ant Data's new framework has improved accuracy by approximately 9.5% compared to open-source models of similar scale on mainstream multilingual visual question answering (Multilingual Visual Question Answering, VQA) benchmarks. In some tasks, the framework even outperformed international mainstream closed-source models such as GPT-4o and Gemini-2.5-flash, achieving the top score in the evaluation.
In addition to breakthroughs in language models, Ant Data also launched an image security framework. This technology combines visual analysis with common sense reasoning, enabling efficient identification of forgeries and inconsistencies in images. The new framework not only accurately locates tampered areas but also provides explainable analysis, significantly improving risk control capabilities for digital content. The successful implementation of this technology will provide stronger support for digital content protection in various scenarios.
As core technologies of Ant Data's global business, these two capabilities have been widely applied in ZOLOZ's document authentication product (RealDoc), supporting 119 languages and efficiently processing various business documents, contracts, and documents, covering areas such as insurance claims, credit reviews, and cross-border trade. This not only demonstrates Ant Data's leading position in multilingual processing but also provides a better service experience for global users.
