AI inference

Groq

Groq is the inference infrastructure that powers AI with the speed and cost it requires. Founded in 2016, the company…

 

Why the future of AI inference lies at the edge

By Stephane Henry, Group VP of Edge AI Solutions at STMicroelectronics, AI is becoming a transformative force shaping our everyday…

 

Gcore adds NVIDIA Dynamo to boost GPU efficiency and cut AI inference latency

Edge AI solutions provider Gcore has integrated NVIDIA Dynamo into its AI inference solutions, offering up to 6x higher GPU…

 

AI inference moves closer to the grid as smaller data centers take shape

EPRI, NVIDIA, Prologis and InfraPartners have revealed they are working together to create smaller scale (5-20MW) distributed data centers closer…

Nscale

Nscale is the Hyperscaler engineered for AI, offering high-performance compute optimised for training, fine-tuning, and intensive workloads. From our data…

 

Where AI inference will land: The enterprise IT equation

By Amir Khan, President, CEO & Founder of Alkira For technology leaders in the enterprise, the question of where compute…

 

SoftBank’s $4B DigitalBridge deal signals power play for distributed AI and edge infrastructure

SoftBank announced the acquisition of DigitalBridge Group for $4 billion to enhance AI infrastructure capabilities. DigitalBridge focuses on digital infrastructure,…

 

Cisco launches Unified Edge Platform to tackle AI’s on-site compute bottleneck

Cisco has unveiled the Unified Edge Platform, combining compute, networking, storage, and security to power real-time AI inferencing and agentic…

 

Submer founder launches InferX to tackle AI’s power and latency problem

InferX, a new company driving the speed and power to achieve “Age of Intelligence”, the next phase in the life…

 

Zenlayer expands edge infrastructure with distributed inference for global AI scaling

Hyperconnected cloud company Zenlayer recently released “Distributed Inference,” a global AI inference platform for high-performance processing at Tech Week in…

Sponsored Links

Avassa: Empowers companies to bridge the gap between modern containerized applications development and operations and distributed edge infrastructure. https://avassa.io/

DataBank: We believe there is a different edge to be served - the “middle edge" - that will become the first step for many in their journey to the edge. https://www.databank.com/

Latitude.sh: Where the power of bare metal meets the flexibility of the cloud. Deploy physical servers across 23 global locations in as little as 5 seconds. https://www.latitude.sh/

Zenlayer: A massively distributed edge cloud service provider operating over 270 PoPs around the world, with expertise in fast-growing emerging markets. https://www.zenlayer.com/

OnLogic: A global industrial PC manufacturer and solution provider focused on hardware for IoT and edge AI, OnLogic designs highly-configurable computers engineered for reliability. https://www.onlogic.com/

Featured Company

Latest News