GPU
- NVIDIA GPU with at least 16 GB of VRAM (Ampere or newer recommended)
- Driver: NVIDIA 555 or later, CUDA 12.4+
- AMD MI300X / MI325 supported via ROCm 6.2+ (beta)
- Apple Silicon supported for selected models on M3 Max / M4 Pro and above (preview)
The host runtime is happy on a single GPU and will not co-tenant with other workloads at the GPU level. That keeps both customers and you safe — and avoids the long-tail latency that comes with shared VRAM.
Once you've installed the CLI, run a five-minute self-test. It exercises outbound connectivity, GPU detection, and a tiny inference. No traffic is routed to you until you opt in.
npx perchy host check --verbose