llm-inference — Guatu Labs Dev

May 25, 2026 · 8 min read · homelab

Tesla P40 in a Homelab: 24GB of Inference on a Budget

Running a Tesla P40 for LLM inference. Why I ditched GPU passthrough for host-level drivers to stop the constant Proxmox crashes.

tesla-p40nvidiaproxmoxollamallm-inferencegpu-monitoring