Hardware Verification using Trusted Execution

Motivation

Currently, hardware provided by a provider is verified using a decentralized network of Auditors on Akash. While this approach is practical for a limited set of providers, the manual verification is proving challenging at scale, even more critical when incentives go onchain and are distributed without a human in the loop. Hardware Verification using Trusted Execution minimizes trust required to verify the accuracy of hardware provided by the providers on Akash network and serves as a fundamental building block for enabling Confidential Computing capabilities, as detailed in AEP-65.

Summary & Background

Hardware Verification is the process of verifying that the specific CPU or GPU is what the provider claims to be. In the context of Confidential Computing, this is achieved through an attestation process using a Trusted Authority

Attestation Process

The attestation process with a trusted Authority is ratified in the IETF’s Remote Attestation Procedures Architecture (RATS) RFC 9334 and can be outlined in the following block diagram. In this diagram, the “Attester” is the software running on the device (typically the CPU/ GPU), the “Relying Party” is the client (typically the application developer) and the “Reference Value Provider” is the vendor (Nvidia, Intel, AMD etc)

Attestation High-Level Flow

At a high level, the attestation process involves three main steps:

1. Measurement Collection

The system gathers cryptographic measurements from the hardware platform — including CPU, GPU, firmware, bootloader, and drivers. These measurements serve as a unique fingerprint of the environment, rooted in hardware (e.g., via Intel TDX, AMD SEV-SNP, or NVIDIA NVTrust). These may include:

Platform identity (vendor, model, firmware version)
Enclave or VM launch measurements
Device-specific attestation evidence (e.g., GPU certificate chain)

2. Verification

The collected evidence is sent to a remote verifier — either a vendor-provided service (e.g., Intel Trust Authority, AMD Attestation Service, NVIDIA NVTrust CA) or a custom verifier (sometime called a “local verifier”).

The verifier perfoms the following functions:

Authenticates the hardware’s cryptographic identity
Compares measurements against a set of trusted baseline values (aka “golden measurements”)
Validates integrity and authenticity of the platform state

3. Policy Enforcement

Based on the result of verification, an attestation policy is evaluated to determine if the workload should proceed. The policy might check for the following things:

Is the platform from an approved vendor/model?
Are all firmware and drivers up-to-date?
Was the workload launched in a verified TEE?

The outcome is a binary verdict (e.g., Attestation OK or Rejected) which can be used to:

Gate access to secrets or encrypted data
Approve running a sensitive workload
Trigger alerts or block execution in untrusted environments

Vendor SDKs

NVTrust SDK

Nvidia provides the NVTRUST SDK that abstracts a lot of the complexity involved in attesting Nvidia GPUs (primarily H100s and NVSwitches) for trusted execution. This SDK provides abstractions for gathering evidence (aka measurements) as well as a verifier (NRAS) that plugs into Nvidia’s internal build pipeline (to obtain “golden measurements” through the RIM service). For reference see NRAS documention and API.

This is what attestation with the Nvidia SDK looks like at a high level

NVTrust Attestation

Intel Trusted Authority SDK

Since GPUs do not operte standalone - they typically are part of a server that includes a CPU (and memory, storage and other things) which is where the application is typically executed (with the AI model then getting loaded into GPU memory for inference or training or fine-tuning), the attestation must encompass the CPU, GPU and the interface between them. To make this easy for customers, Intel has an SDK of its own that plugs into the NVTrust SDK and enables performing attestation for the whole system with SDKs available in python and golang.

Intel Attestation

Scope of Work

The Scope of work of this AEP is to test and document the hardware and BIOS configuration necessary to perform attestation so that this can be used to guide Akash Providers and to support the larger Confidential Computing goal.

To that end, the following will need to be done

Obtain or set up a provider with a GPU node or cluster that has the TEE capable hardware as noted in the following section
Apply BIOS configuration to allow access to the device nodes
Verify (manually) that attestation can be performed for the whole node

TEE Capable CPUs

Vendor	Feature	Required Models
Intel	TDX (Trust Domain Extensions)	Intel Xeon 5th Gen CPUs like “Sapphire Rapids” (with TDX BIOS support)
Intel	SGX (Software Guard Extensions)	Intel Xeon E3, Xeon D, and select 10th–11th Gen Core CPUs (now deprecated by Intel)
AMD	SEV	AMD EPYC “Rome” (7002 series)
AMD	SEV-ES / SNP	AMD EPYC “Milan” (7003) and “Genoa” (9004) series

TEE Capable GPUs

Vendor	Feature	Required Models
NVIDIA	NVTrust	NVIDIA H100 or H200 (Hopper architecture) with CC-on mode
AMD/Intel	None yet	No current support for GPU-based TEEs (CPU-side only)

In summary, Providers must use the following hardware:

Intel CPUs with TDX (e.g., Xeon Sapphire Rapids)
AMD CPUs with SEV-SNP (e.g., EPYC Milan/Genoa)
NVIDIA H100 or H200 GPUs (for NVTrust support)

TEE Enabled Host Kernel & BIOS configuration

BIOS configuration changes need to be made to enable TDX/ SGX (for intel) and SEV (for AMD). These typically also require a certain minimum version of the Linux Kernel to be used.

References

Intel: Enable memory encryption, TDX and SGX for Intel
AMD: Enable AMD SEV