NVIDIA's Leadership and Near-Monopoly in Artificial Intelligence Technology is 'Staggering'


At the 2025 GTC conference, NVIDIA introduced the 'Omniverse DSX Blueprint' design, specifically tailored for gigawatt-scale AI data centers, known as the 'AI Factory.' This solution is based on the Omniverse framework and supports various scales from 100 million watts to 1 billion watts. It aims to efficiently train and run large AI models, meeting the growing demand for AI computing, and represents a significant advancement in artificial intelligence infrastructure.
OpenAI's CEO Sam Altman has explicitly stated for the first time that the company is most likely to go public through an IPO. As the AI competition enters the 'heavy asset' era, OpenAI is making unprecedented capital and computing power investments to build the next generation of AI infrastructure. Altman pointed out that the exponential expansion of its business makes IPO an inevitable choice, providing opportunities for global investors to participate in the AI revolution.
NVIDIA released the OmniVinci all-modal understanding model, leading top models by 19.05 points in multiple benchmark tests. The model uses only 0.2 trillion training tokens, achieving six times the data efficiency of competitors. It aims to achieve unified understanding of vision, audio, and text, advancing machine multimodal cognitive capabilities.
The company Anthropic launched Claude for Excel, designed specifically for financial professionals, currently in research preview. Users can interact with the AI assistant directly through the Excel sidebar, enabling reading, analysis, and modification of workbooks. All changes are clearly tracked and explained, helping to improve efficiency in financial services.
NVIDIA released the multimodal understanding model OmniVinci, which outperformed top models by 19.05 points in benchmark tests. The model achieves excellent performance with only 1/6 of the training data. It aims to enable AI systems to simultaneously understand vision, audio, and text, simulating human multisensory perception of the world.