The Wafer-Scale Engine company has been working for 8 years to break into the top tier of AI computing. This deal might just assure its success.
We have been watching Cerebras for a long time, from its launch of the first wafer-scale engine for AI to its recent contributions to open large language model training and even building its own super computer, Andromeda. But at a rumored $2.5M a piece, success in the AI server business has been elusive. Sure, a small win here, a couple nodes there, but success in terms of a major data center win has been lacking. All that just changed.
Cerebras has announced it has won a deployment of 9 interconnected 4-Exaflop supercomputers with G42 (Group42), an artificial intelligence and cloud computing company that was founded in Abu Dhabi, United Arab Emirates in 2018. Cerebras Cloud will host and manage the G42-owned infrastructure forG42 internal applications. Both companies will resell excess capacity to other AI developers and users.
The first phase is already installed and providing 2 exaflops of AI, powered by 32 CS-2 systems in Sunnyvale, CA. Cerebras claims it only took 10 days to set up the hardware and software to enable its first training run, a testament to the push-button ease of deployment that has always been a strength of the Cerebras platform.
Big AI, Done Fast, Done Easy
To that point of easy deployment, Cerebras shared that it only requires 1200 lines of code to distribute an AI model across any number of CS-2 systems. This stands in sharp contrasts to Cerebras claim that it takes over 28,000 lines of code to distribute the same model across 1000’s of GPUs. While we have been unable to verify this claim, the point is that the work of distributed computing is done automatically by the company’s software and hardware, bit hand coded to a specific configuration.
The initial system will double in size in 10 weeks to 4-Exaflops and 54 million compute cores. Two additional US-Based 4-Exaflop systems have already been contracted, and will be linked together with the first (CG-1) instance for a total of 12-Exaflops. Then six additional CGs will be added next year to create what Cerebras believes will form the larges AI supercomputer in the world.
Why did G42 select Cerebras? In large part, the win is based on Cerebras fast hardware, support for hundreds of open-source foundation models, and the ease of deployment of additional distributed models, scaling out to a massive footprint. But we also suspect that hardware availability might have played into this as well, since the NVIDIA H100 is supposedly sold out for a year.
Conclusions
Cerebras has finally won a large cloud deal. A very large cloud deal. We suspect this deal amounts to hundreds of millions of dollars, at 256 CS-2 systems perhaps close to $.5B would be a decent guess. It is probably significantly larger than the sum of all the revenue generated by all AI startups to date.
Now the question this begs is clear: will other, more mainstream cloud providers follow suit, preferring to procure a CS-2 based cluster over waiting for NVIDIA’s backlog to clear? Waiting costs money, and missed opportunities, in the LLM land grab currently underway.
We had never heard of G42, and we suspect neither have you. But even if the Super 7 hyper-scalers decide to stick with what they have, there are probably a few more impatient CSP-wanna-be’s like G42 in the wings.
Disclosures: This article expresses the opinions of the author(s), and is not to be taken as advice to purchase from nor invest in the companies mentioned. Cambrian AI Research is fortunate to have many, if not most, semiconductor firms as our clients, including Blaize, Cadence Design, Cerebras, D-Matrix, Eliyan, Esperanto, FuriosaAI, Graphcore, GML, IBM, Intel, Mythic, NVIDIA, Qualcomm Technologies, Si-Five, SiMa.ai, Synopsys, and Tenstorrent. We have no investment positions in any of the companies mentioned in this article and do not plan to initiate any in the near future. For more information, please visit our website at https://cambrian-AI.com.
Read the full article here