According to AIbase, xAI has officially launched the Grok Voice Agent API, creating a dual storm of performance and cost-effectiveness in the real-time voice AI field at an extremely competitive price of just $0.05 per minute. This model leads in audio inference benchmark tests, achieving a response speed nearly five times faster than competitors with less than 1 second of initial audio delay.
Technically, Grok Voice Agent not only supports automatic detection and free switching of dozens of languages including Chinese, but also deeply integrates real-time web search and reasoning capabilities, enabling its responses to keep up with the latest online information. By supporting external tool calls, emotion control, and various voice options, developers can build AI agents that are expressive and capable of performing practical tasks.
Notably, this API is fully compatible with the OpenAI real-time API specifications, offering developers the possibility of seamless migration for high-performance, low-cost solutions. It marks a key step for Musk in challenging the industry landscape in the real-time conversation AI arena.
