TPU inference servers for efficient edge data centers

TPU inference servers for efficient edge data centers

This whitepaper by Unigen explores the concept of developing data centers that are solely focused on AI inference.

Up to 90% of AI operations are inference vs 10% training. Training requires specialized processing to create the neural networks that are then used for inference operations. Training is the primary driver for the power requirements mentioned by the IEA above. On the flipside, inference can be done much more power efficiently.

The benefits on developing inference-only datacenters can be significant:

– Reduced initial cost for inference servers compared with training servers
– Reduced Total Cost of Ownership (TCO) over the lifetime for inference servers
– Inference servers with TPUs can be air-cooled, avoiding expensive and difficult to deploy liquid cooling schemes
– Data centers with air-cooled servers use far less resources, reducing strain on local power and water

This whitepaper compares the different requirements for cooling, electrical systems, HVAC, power and the infrastructure between training servers and inference servers.

Article Topics

 |   |   |   |   | 

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Sponsored Links

Avassa: Empowers companies to bridge the gap between modern containerized applications development and operations and distributed edge infrastructure. https://avassa.io/

DataBank: We believe there is a different edge to be served - the “middle edge" - that will become the first step for many in their journey to the edge. https://www.databank.com/

Latitude.sh: Where the power of bare metal meets the flexibility of the cloud. Deploy physical servers across 23 global locations in as little as 5 seconds. https://www.latitude.sh/

Zenlayer: A massively distributed edge cloud service provider operating over 270 PoPs around the world, with expertise in fast-growing emerging markets. https://www.zenlayer.com/

OnLogic: A global industrial PC manufacturer and solution provider focused on hardware for IoT and edge AI, OnLogic designs highly-configurable computers engineered for reliability. https://www.onlogic.com/

Featured Company

Latest News