Rejecting Q&A: JD.com Open-Sources Real-Time Video Interaction Model JoyAI-VL-Interaction

In the wave of artificial intelligence moving towards real-time interaction, JD.com has officially open-sourced its core achievement - the real-time video vision-language interaction model JoyAI-VL-Interaction. As the world's first fully open-source interactive vision model, the system not only received deep support from vLLM-Omni, but also marks the official transition of AI assistants from traditional "passive response" to "watch and speak" autonomous observation mode.

Compared to the previous lagging mode where processing of the video would start only after the user asked a question, JoyAI-VL-Interaction demonstrates a high level of initiative. It has the ability to continuously observe the video stream, intelligently determining when to intervene in the conversation and when to remain silent, thus providing a more natural and smooth experience in interactions.

This improvement in real-time response capability is crucial for handling dynamic information. Traditional video understanding technologies are often limited by the "upload first, then analyze" process, which is difficult to meet the needs in scenarios requiring high real-time performance such as security monitoring, live broadcasting interpretation, or operation guidance. JoyAI-VL-Interaction can process the ongoing video stream immediately, truly achieving synchronization between image changes and intelligent responses.

A more technical highlight is its "background delegation" mechanism. When facing high-level tasks such as generating code, complex reasoning, or tool calls, the model can flexibly offload tasks to the background Agent system, while the front-end model continues to maintain real-time observation of the scene. This parallel workflow of "observation and interaction" allows the AI assistant to maintain seamless communication with users while executing complex logic.

In terms of compatibility and scalability, the model supports various video input sources such as cameras, live streams, and various surveillance signals, and allows developers to flexibly replace ASR, TTS, long-term memory modules, or external API interfaces according to business needs.

EU Meta Lifts WhatsApp Access Restrictions for AI Competitors

The European Commission has taken steps to require Meta to restore free access to WhatsApp's universal AI assistant, aiming to protect AI market development and prevent irreversible harm. Meta has been under antitrust investigation since December 2025 for restricting other AI providers' access to WhatsApp, allowing only its own AI. Preliminary evidence suggests Meta may violate competition rules, with a statement of objections issued in February ....

Pioneering Digital Healthcare! Microsoft Launches Preview Version of Copilot Health, Utilizing AI to Deeply Analyze Personal Health Records

Microsoft has released a preview of its medical AI assistant, 'Copilot Health,' for Microsoft 365 subscribers. This tool integrates cross-platform health data to help users efficiently manage and analyze health information, quickly find doctors, and gain deep insights into medical data, marking a significant step for Microsoft in healthcare.....

Meituan Stages a Comeback! First-Quarter Losses Drop by Hundreds of Billions, Wang Xing Joins Hands with Tencent to Build a New AI Entry Point

Meituan reported Q1 2026 revenue of RMB 91 billion, up 5.6% YoY. Operating loss narrowed sharply from RMB 16.1 billion to RMB 6.5 billion, with core local commerce loss dropping from RMB 10 billion to RMB 2 billion, indicating strong profit recovery. CEO Wang Xing noted competition in food delivery is returning to rationality, focusing on efficiency and experience.....

Meituan AI Assistant Xiaomei Will Be Integrated with Tencent Yuanbao, Connecting AI Local Life Service Transactions

Meituan announced that its AI assistant 'Xiao Mei' will soon integrate with Tencent Yuanbao, allowing users to directly access Meituan's local services like food delivery within Yuanbao's interface. This cross-platform integration aims to leverage AI to strengthen Meituan's local services moat and create new value, marking deep interoperability between the two ecosystems.....

Tencent Launches AI Assistant Marvis at the Operating System Level, Supporting Cross-Device Control and Local Privacy Mode

Tencent officially launched Marvis, an operating system-level AI assistant, on May 20, available for download on its official website without an invitation code. Its core features include intelligent system and file management, secure end-cloud computing switching, low-barrier Agent services, and cross-device collaboration. Unlike traditional AI tools, Marvis deeply understands the computer system, computing power, files, and applications, positi....