#llm-inference

1 post

Tesla P40 in a Homelab: 24GB of Inference on a Budget

Tesla P40 in a Homelab: 24GB of Inference on a Budget

Running a Tesla P40 for LLM inference. Why I ditched GPU passthrough for host-level drivers to stop the constant Proxmox crashes.

← All tags