After academia孵化多个现象级 AI 引擎, the core members of the open-source inference framework vLLM officially announced the establishment of a startup company Inferact. The company successfully completed a seed round funding of $150 million with a pre-money valuation of $800 million.

Robot, AI

Top Capital Support, Accelerating "Inference" Commercialization

This round of financing was led by Andreessen Horowitz (a16z) and Lightspeed Venture Partners. This move confirmed the market's previous speculation about vLLM's commercialization path and marks the shift of the AI industry's focus from "model training" to "application inference."

  • Technical Background: Inferact was incubated in the laboratory of Professor Ion Stoica from the University of California, Berkeley (co-founder of Databricks). Its core technology, vLLM, significantly improves the speed of large model operations and reduces energy consumption through innovative memory management technology.

  • Market Position: CEO Simon Mo said that the open-source version of vLLM has been widely adopted by major companies such as Amazon Web Services (AWS) and Amazon Shopping.

The "Berkeley Twins" in the Inference Sector

Inferact's launch followed closely behind RadixArk (originating from the commercialization of another well-known framework, SGLang). The latter recently secured funding led by Accel with a valuation of $400 million. Both companies come from the same Berkeley lab, and their consecutive investments reflect investors' willingness to invest heavily to secure faster and more cost-effective AI deployment technologies.