ai infrastructure

The stack we build to train on.

Self-hosting math-native compiler, derivation-first OS, open-source agent runtimes, validated GPU training stack — built bottom-up, exercised in parallel across Blackwell, Hopper, and ARM-DGX.

QKVsoftmaxQ · Kᵀ / √dout

Workshop — hardware actually exercised

B300
Blackwell SXM6 — heaviest tier
H200
Hopper 141 GB — single-GPU primary
H100
Hopper 80 GB — multi-GPU DDP
GB10
ARM-DGX local #1 — gx10
GB10
ARM-DGX local #2

Cloud and local in parallel. The same kernels run across Blackwell, Hopper, and ARM-DGX — no vendor lock. Every launch goes through a pre-flight memory + cost gate before the meter starts. No fire-and-forget pods.