Jan Team Releases Jan-v2-VL-Max! A 30B Multimodal Model Specializing in Long-Term Agent Tasks, Stable Execution of Long Sequences Outperforms Gemini 2.5 Pro

At a critical stage where AI agents are evolving toward complex, multi-step tasks, the open-source community welcomes a new rising star. The Jan team has officially released Jan-v2-VL-Max—a 30-billion parameter multimodal large model specifically designed for long-term, high-stability automated execution scenarios. It has surpassed Google's Gemini 2.5 Pro and DeepSeek R1 in key metrics, injecting strong momentum into the open-source agent ecosystem.

Focusing on the "error accumulation" problem, it effectively tackles "off-track" issues in multi-step execution

Currently, multimodal agents often face the "error accumulation" issue when executing long sequences of operations (such as automated UI operations or cross-application task flows), where small deviations in intermediate steps lead to significant task deviations later on. Jan-v2-VL-Max introduces a LoRA-based RLVR (Reinforced Long-horizon Vision-Language Reasoning) technology, significantly improving the consistency and interference resistance of the reasoning chain while maintaining the capabilities of the Qwen3-VL-30B base model, ensuring accurate execution even after dozens of steps.

Top in the "Hallucination-Decay Return" test, defining a new benchmark for agents

The model performs exceptionally well in the new evaluation benchmark "Hallucination-Decay Return (HDR)," which specifically measures how quickly the return rate decreases due to hallucinations or logical breakdowns as the task length increases. Jan-v2-VL-Max maintains high return stability in long-sequence tasks, surpassing Gemini 2.5 Pro and DeepSeek R1, verifying its reliability in real-world automation scenarios.

Ready to use, supporting efficient local deployment

To lower the entry barrier, the Jan team provides:

- A web-based interactive interface, allowing users to upload images, input instructions, and test multi-step automation processes;

- A vLLM-optimized local deployment solution that supports efficient operation on consumer-grade GPUs, making it easy for developers to integrate into their own agent systems.

A breakthrough in "long thinking" for the open-source community

Although Jan-v2-VL-Max achieves only a "minor improvement" in long-sequence execution compared to the base model, in the agent field, every 1% increase in stability represents a qualitative change in usability. This achievement marks that the open-source community is moving from "single-step response" to "long-term planning," providing a practical open-source foundation for high-value scenarios such as UI automation, robot control, and multi-tool collaboration.

AIbase believes that when the competition among large models shifts from "who is smarter" to "who is more reliable," the Jan team's focus on execution stability is timely. As agents are about to become the main interaction paradigm of AI, Jan-v2-VL-Max may become a key piece for developers to build "never-failing" intelligent agents.

10 Billion Dollars! Blackstone Leads the Largest Financing for Australian AI Infrastructure Company Firmus

Australian AI infrastructure startup Firmus Technologies has secured a 10 billion dollar debt financing led by Blackstone Group, setting a record for private credit in the country. The funds will be used for the South Gate Project, aiming to build ultra-large-scale AI centers across Australia by 2028, with a target computing capacity of 1.6 gigawatts, marking an intensification of global competition in AI computing infrastructure.

State Administration for Market Regulation Releases 5 Typical Cases of Unfair Competition in the AI Field: Targeting 'Impostor' Models and Algorithm Theft

The State Administration for Market Regulation has released five typical cases of unfair competition in the field of artificial intelligence, involving illegal acts such as counterfeiting, false advertising, and infringement of trade secrets. Among them, Beijing Aolande and Hangzhou Boheng were fined for counterfeiting DeepSeek, aiming to curb 'free-riding' behavior and maintain a fair market competition order.

Zuckerberg Fights Back Against OpenAI! Meta Testing Independent App for Vibes: To Become an AI Version of TikTok and Directly Compete with Sora?

Meta confirmed this Thursday that it is testing an independent app for the AI video feature Vibes, targeting OpenAI's Sora. 2024 is the year of text-to-video, and 2026 may become the year of major confrontations. Vibes aims to create a short video platform with 'digital avatars for everyone,' becoming a key move for Meta in the AI video arena.

OpenAI Releases GPT-5.3-Codex: A Leap in Programming Efficiency, Marking the Beginning of the AI Peer Practice Era

Sam Altman, CEO of OpenAI, announced the release of the programming large model GPT-5.3-Codex, which has made breakthroughs in technical indicators and application, pushing AI-assisted programming into a new stage. It achieved 57% on the SWE-Bench Pro evaluation and performed well on TerminalBench2.0 and OSWorld evaluations.

Jan Team Releases Jan-v2-VL-Max! A 30B Multimodal Model Specializing in Long-Term Agent Tasks, Stable Execution of Long Sequences Outperforms Gemini 2.5 Pro

Related Recommendations

10 Billion Dollars! Blackstone Leads the Largest Financing for Australian AI Infrastructure Company Firmus

Mysterious AI Model Pony Alpha Exposed: Free High-Performance, a Possible Spoof of GLM-5?

State Administration for Market Regulation Releases 5 Typical Cases of Unfair Competition in the AI Field: Targeting 'Impostor' Models and Algorithm Theft

Zuckerberg Fights Back Against OpenAI! Meta Testing Independent App for Vibes: To Become an AI Version of TikTok and Directly Compete with Sora?

OpenAI Releases GPT-5.3-Codex: A Leap in Programming Efficiency, Marking the Beginning of the AI Peer Practice Era