Quick answer

AI DevOps engineers run the infrastructure behind frontier AI — GPU clusters, inference servers, eval pipelines, deployment automation. It's a hybrid of traditional DevOps, ML platform engineering, and infrastructure-as-code. Demand is brutal: every AI lab and most AI-heavy startups are hiring. Median 2026 US comp: $250-400k.

The role barely existed in 2023. By 2026 every serious AI team has at least one AI DevOps engineer (some call them ML Platform engineers). They handle the messy infra that lets ML researchers and product engineers ship.

What they actually do

  • Provision GPU clusters (H100, MI400, TPU v7) — Kubernetes, Slurm, Ray
  • Build inference serving stacks (vLLM, SGLang, TGI, Triton)
  • Operate eval pipelines (continuous quality regression tests against models)
  • Handle model deployment, A/B routing, gradual rollout
  • Optimise cost (right-sizing GPUs, caching, batch sizing, quantisation)
  • Set up observability (latency, cost, quality drift, safety eval drift)

Skills employers actually want

  • Kubernetes for ML workloads (KServe, KubeRay, GPU operators)
  • At least one of: vLLM, SGLang, TGI, Triton — production inference serving
  • Python (not just shell scripts — real Python)
  • Cost optimisation: GPU utilisation, kv-cache management, prompt caching
  • Eval engineering — building automated quality tests against LLM outputs
  • Observability tooling — Prometheus, Grafana, OpenTelemetry, custom AI observability

How to break in if you're currently DevOps

  • Run a frontier-class model locally (Llama 4 Behemoth on a workstation) — learn the pain points
  • Build an inference server from scratch — vLLM tutorial, deploy on a single H100
  • Write one eval pipeline — pick a model, define a benchmark, run it nightly
  • Read the SRE books for AI — Anthropic's blog, OpenAI's engineering posts
  • Contribute to an open-source ML infra project (vLLM, SkyPilot, Modal)

Salary by location

  • US Bay Area: $300-500k entry, $500-900k senior
  • US other: $220-380k entry, $380-700k senior
  • London: £110-200k entry, £180-320k senior
  • EU other: €90-160k entry, €150-280k senior
  • India (remote-friendly): ₹35-80L entry, ₹80-180L senior

The fastest path in if you're an existing DevOps engineer: pick one inference framework (vLLM is a safe bet), get it running well, and write publicly about what you learned. Demonstrate competence, then apply.

Bottom line

AI DevOps is a real career track in 2026, well-paid, in-demand, and accessible if you're already a competent infra engineer. Pick an inference framework, ship something, write about it. The bar is high but the rewards are higher.