Inflection AI Ditches Nvidia for Intel Gaudi 3 Accelerators!

Recently, Inflection AI made a striking decision on its latest enterprise platform: to abandon Nvidia's GPUs in favor of Intel's Gaudi3 accelerators. This shift marks a strategic adjustment in the company's AI field, as its previous "Pi" customer applications were all based on Nvidia's GPUs. Now, Inflection3.0 will rely on Gaudi3, allowing users to choose to run it locally or on the cloud-based Tiber AI Cloud.

Chip AI Illustration (1)

Image source note: The image is generated by AI, provided by the image licensing service provider Midjourney

Inflection AI was founded in 2022, initially focusing on developing a conversational personal assistant named Pi. However, with founders Mustafa Suleyman and Karén Simonyan leaving for Microsoft in the spring, the company shifted its focus to building custom fine-tuned models for enterprises, enhancing service quality using customer data.

Inflection3.0 is the latest version of the platform, aiming to tailor AI applications for enterprises by fine-tuning models using proprietary datasets. Notably, Intel will be one of the first customers to use this service, sparking speculation about whether Inflection will pay the full price for these accelerators.

Although Inflection plans to run its services on Gaudi3 accelerators, it is clear that the system will not be established soon. Like the previous Inflection2.5, the latest version will also run on Intel's Tiber AI Cloud service. However, Inflection realizes that some customers may want to keep their data local, so it plans to provide physical systems based on Intel AI accelerators starting from the first quarter of 2025.

One benefit of using Gaudi3 accelerators is the significant improvement in price-performance for Inflection. Sean White, CEO of Inflection AI, stated in a blog that by using Intel's technology, they have seen up to twice the price-performance improvement compared to current competitive products. Gaudi3 is also considered faster than Nvidia's H100 in both training and inference speeds, and at a lower cost.

The technical specifications of Gaudi3 are also quite powerful, equipped with 128GB of HBM2e memory, a bandwidth of up to 3.7Tbps, and boasts 1,835 teraFLOPS of dense FP8 or BF16 performance. At 16-bit precision, Gaudi3's floating-point performance is almost twice that of H100, which is crucial for Inflection's focus on training and fine-tuning workloads.

Additionally, Intel recently announced that IBM will deploy Gaudi3 accelerators in its cloud platform and plans to launch them early in 2025. This indicates that Gaudi3 accelerators are gradually gaining market recognition.

Key Points:
🌟 Inflection AI has decided to abandon Nvidia GPUs in favor of Intel's Gaudi3 accelerators.
🚀 Inflection3.0 will be based on Gaudi3, providing customized AI applications for enterprises.
💰 Using Gaudi3, Inflection AI has achieved up to twice the price-performance improvement.

Intel's Gaudi AI Chip Experiences Disappointing Performance, 2024 Revenue Target Hard to Achieve

Amid the rapid development of artificial intelligence technology, Intel's Gaudi AI accelerator faces significant challenges. Recently, Intel CEO Pat Gelsinger stated during the company's Q3 2024 earnings call that Intel's anticipated $500 million revenue target for Gaudi will not be met. He admitted, "We will not be able to achieve the $500 million revenue target for 2024." Despite the recent launch of the new Gaudi3 accelerator, Gelsinger pointed out that Gaudi products...

Intel's AI Accelerator Gaudi 3 Set to Release Next Week, Challenging NVIDIA's Dominance!

Intel's 2024 plans are steadily progressing. The company announced that its latest AI accelerator, Gaudi 3, is expected to be officially released next week. Gaudi 3 is hailed as Intel's AI hero, designed to handle large-scale training and inference tasks with unlimited scalability. Intel CEO Pat Gelsinger first previewed this accelerator at last year's 'Intel AI Everywhere' conference. In the subsequent 'Intel Vision 2024' event, Intel reiterated...

Foxconn and Nvidia Join Forces to Build Ultra-High-Performance AI Supercomputer

Recently, Foxconn (Hon Hai) and Nvidia announced an exciting collaboration plan to build a massive supercomputer. This computer will use Nvidia's latest Blackwell chip architecture and is named the 'Hon Hai Kaohsiung Supercomputing Center'. This project was officially unveiled during the recently concluded Foxconn Technology Day event. Image credits: The image is generated by AI, and the image is licensed by service provider Midjourney. The scale of this supercomputer is enormous.

AI Clones Lei Jun's Voice, Netizens Complain: 'I've Been Cursed at by Lei Jun All Holiday'

During the National Day holiday, an amusing incident went viral online—Lei Jun's AI voice suddenly became popular! The founder of Xiaomi must have silently thought: 'This guy is definitely messing around!' Various users grabbed this AI tool to create 'Lei Jun's Sharp Reviews' videos, which are not just funny but also filled with many curses. It felt like being chased and cursed at by Lei Jun throughout the holiday. Recently, a wave of parody videos featuring Lei Jun's voice has emerged on short video platforms, entertaining many viewers.

G42 Acquires Abu Dhabi Cybersecurity Company CPX to Enhance AI Security Protections

Abu Dhabi's artificial intelligence and cloud computing company G42 recently announced the acquisition of local cybersecurity leader CPX. This strategic acquisition will inject strong cybersecurity capabilities into G42's product portfolio, making it more competitive in the artificial intelligence value chain. Image source note: Image generated by AI, image licensed from provider Midjourney. CPX is headquartered in Abu Dhabi and employs over 400 professionals dedicated to providing cybersecurity services for enterprises, government, and critical infrastructure. CPX's extensive experience and expertise.