Workload Utilization Metrics

Draft

Motivation

Tenants/ users of Akash expect to be able to see what amount of allocated resources are being used by their workloads so that they can better manage peak load and also optimize cost/ spend

Summary

This AEP will likely require building the necessary contructs (metrics server/ agent) for collecting utilization metrics from the tenant containers and reporting them through an API that can be quried and graphed for display in clients like Console. The metrics collected initially will likely be GPU (VRAM), CPU, Memeory and Storage.

Estimated completion: 11/15/2025

Created: 12/1/2024

Last Updated: 7/30/2025

Category: Interface

Status: Draft

View next aep

Estimated Completion: 11/30/2025

Show users what compute inventory could be brought on to the network if there was demand with the intention of attracting larger customers and training workloads who may be turned off by the limited number of available GPUs, particularly as demand increases.

Experience the Supercloud.