ben on Nostr: what kind of gpu and how much vram in it? afaik running inference is constrained by ...what kind of gpu and how much vram in it? afaik running inference is constrained by gpu vram. 8B models run very fast on my 8GBvram gpu