FitMyGPU

Blog

Inference notes and model updates

Notes on model releases, inference changes, and how the calculator works.

2026-05-10

How inference VRAM is calculated

A practical breakdown of the terms that usually matter for inference memory: weights, KV cache, state, and runtime reserve.