AI software platform enables giant model training – eeNews Europe

Version 1.2 of the company’s Cerebras Software Platform, CSoft, features expanded support for PyTorch and TensorFlow. In addition, says the company, customers can now quickly and easily train models with billions of parameters via Cerebras’ weight streaming technology.
PyTorch, the leading machine learning framework, is used by developers to accelerate the path from research prototyping to production deployment. As model size increases and as transformer models become more popular, says the company, it is essential that machine learning practitioners have access to fast, easy to set up and use compute solutions like the Cerebras CS-2 AI system.
With the CS-2 running CSoft, says the company, the developer community has a powerful tool to enable new breakthroughs in AI.
“From the start, our goal was to seamlessly support whichever machine learning framework our customers wanted to write in,” says Emad Barsoum, Senior Director, AI Framework, at Cerebras Systems. “Our customers write in TensorFlow and in PyTorch, and our software stack, CSoft, makes it quick and easy to express your models in the framework of your choice. By doing so, our customers gain access to the 850,000 AI optimized cores and 40 Gigabytes of on-chip memory in the Cerebras CS-2.”
Claimed as the world’s fastest AI system, the Cerebras CS-2 is powered by the largest processor ever built – the Cerebras Wafer-Scale Engine 2 (WSE-2). The Cerebras WSE-2 delivers more AI optimized compute cores, more fast memory, and more fabric bandwidth than any other deep learning processor in existence, says the company.
Purpose built for AI work, the CS-2 runs CSoft, which enables machine learning practitioners to write their models in the opensource frameworks of TensorFlow or PyTorch and, without modification, run the model on the Cerebras CS-2. In fact, says the company, a model that was written for a graphics processing unit or a central processing unit can run under CSoft on the Cerebras CS-2 without any changes. With the CS-2 and CSoft, practitioners can seamlessly scale up from small models like BERT to the largest models in existence like GPT-3.
All material on this site Copyright © 2022 European Business Press SA. All rights reserved.

source
Connect with Chris Hood, a digital strategist that can help you with AI.

Leave a Reply

Your email address will not be published.

© 2022 AI Caosuo - Proudly powered by theme Octo