NVIDIA Launches PersonaPlex-7B-v1: A Full-Duplex Black Tech That Redefines Real-Time Voice Interaction

NVIDIA research team has officially released a full-duplex speech-to-speech dialogue model named PersonaPlex-7B-v1. This model completely breaks the traditional AI voice assistant "listen once, respond once" rigid pattern, aiming to achieve a more natural conversation experience closer to human interactions.

Unlike previous architectures that required multiple stages such as ASR (speech-to-text), LLM (large language model), and TTS (text-to-speech), PersonaPlex uses a single Transformer architecture to complete the entire process of speech understanding and generation. AIbase learned that this "end-to-end" design significantly reduces response latency and enables AI to handle natural interruptions, overlapping speech, and immediate feedback. In simple terms, it's like real human conversation; the AI listens continuously while speaking, and even if the user suddenly interrupts, it can quickly respond.

Additionally, the model performs excellently in personalization control. Through the dual guidance of "speech + text," users not only define the AI's role background but also precisely control its tone and intonation. AIbase learned that NVIDIA combined massive real call data with synthetic scenarios during training, allowing the model to have natural language habits while strictly adhering to specific industry business rules. Current evaluation results show that PersonaPlex-7B-v1 outperforms most open-source and closed-source systems in dialogue fluency and task completion rate.

Research: https://research.nvidia.com/labs/adlr/personaplex/

Key Points:

🎙️ Full-duplex Interaction: PersonaPlex-7B-v1 supports real-time speech stream processing, allowing users to interject or overlap conversations while the AI is speaking, achieving rapid response.
🧠 Single Model Architecture: It abandons the complicated plugin pipeline and uses a single Transformer structure to simultaneously predict text and speech tokens, improving the naturalness of dialogue from the ground up.
🎭 Deep Personalization: It supports system prompts of up to 200 tokens and specific speech embeddings, enabling flexible customization of the AI's personality, business knowledge, and emotional tone.

Google Tests New Features for Gemini Desktop Version: System-Level Voice Typing and Cursor Tracking

Google is testing a major voice upgrade for its macOS client, introducing system-wide voice dictation accessible via global shortcuts, a "Magic Pointer" that lets Gemini track cursor focus for visual-logical sync, and a multi-device connection menu hinting at cross-desktop collaboration. The Gemini Live interface has also been redesigned.....

AI Understands Your Thoughts: OpenAI Upgrades GPT-5.5 Instant for Smarter Shopping Recommendations

OpenAI launched a new lightweight GPT-5.5 Instant on June 25, enhancing insight and task stability. Since its May debut, hallucination rates in high-risk fields like medicine, finance, and law have dropped 52.5%, with strong mathematical reasoning. This update further boosts reliability in professional scenarios.....

Doubao Large Model's Daily Token Usage Surpasses 18 Trillion, 2.1 Pro Version Officially Released

At the 2026 Volcano Engine FORCE Conference, President Tan Dai launched the Doubao Large Model 2.1 Pro, announcing daily token usage exceeding 180 trillion, a 1500-fold increase from 1.2 billion in May 2024, showing strong business penetration. The new model focuses on code generation, intelligent agents, and multimodal capabilities.....

NVIDIA Launches PersonaPlex-7B-v1: A Full-Duplex Black Tech That Redefines Real-Time Voice Interaction

Related Recommendations

Claude Paid Users Surge 75% As It Challenges ChatGPT's Consumer Market

Google Tests New Features for Gemini Desktop Version: System-Level Voice Typing and Cursor Tracking

AI Understands Your Thoughts: OpenAI Upgrades GPT-5.5 Instant for Smarter Shopping Recommendations

Doubao Large Model's Daily Token Usage Surpasses 18 Trillion, 2.1 Pro Version Officially Released

Malaysia's AI Chat System Respond.io Secures $62.5 Million in Series B Funding, ARR Reaches $35 Million