Kuaishou Begins Developing Large Models and Multimodal Large Models Exceeding 100 Billion Scale


OpenAI integrates real-time speech and vision into ChatGPT, enabling users to speak while viewing visual content like maps and charts with synchronized text transcription. Key features include multimodal on-screen interaction and seamless continuous conversation with under 300ms latency.....
Tsinghua University and partners released UltraRAG2.1, the first open-source RAG framework with MCP architecture. It enables multi-stage reasoning and evaluation of multimodal retrieval systems via YAML configuration, requiring no coding, lowering barriers and advancing RAG technology.....
The 2025 Hong Kong Fintech Week focuses on the integration of fintech and AI, bringing together guests such as Carrie Lam and Geoffrey Hinton. Zhu Guang, CEO of Du Xiaoman, emphasized the innovative applications of large models in financial services, driving customer service from monthly surveys to real-time responses, achieving a revolutionary transformation centered around customer-centricity.
"Baidu E-commerce Selection" brand uses large model technology to optimize risk control review, achieving full machine review, instant feedback, and high interpretability, solving the problems of low efficiency and slow response in traditional manual review, and enhancing e-commerce security and user experience.
A study by the University of Chicago found significant differences in the performance of AI text detectors, with some tools having high accuracy but others frequently misclassifying, especially in short texts. The Pangram detector performed best in terms of accuracy and cost-effectiveness. The study, based on 1992 human texts and four mainstream large models, covered six types of texts and revealed shortcomings in the reliability and robustness of detectors.