Skip to content
Skip to documentation content
Browse documentation
In progress Page being written

Air-gapped install

Deploy LLMKube in environments with no internet access: pre-staged GGUF files, private registries, and locked-down cluster networking.

What this page will cover

  • Pre-downloading GGUF models on a workstation and copying them to your cluster.
  • Using file:// and pvc:// source URIs in the Model CR instead of HTTPS.
  • Mirroring the operator and runtime images into a private registry.
  • Verifying the deployment when DNS, egress, and Hugging Face are all blocked.
Read the source on GitHub
LLMKube LLMKube

Kubernetes for Local LLMs. Deploy, manage, and scale AI inference workloads with production-grade orchestration.

© 2026 Defilan Technologies LLC

Community

Built for the Kubernetes and AI communities

LLMKube is not affiliated with or endorsed by the Cloud Native Computing Foundation or the Kubernetes project. Kubernetes® is a registered trademark of The Linux Foundation.