The original post: /r/localllama by /u/TechNerd10191 on 2025-01-06 15:20:42.

As per this article the 5090 is rumored to have 1.8 TB/s memory bandwidth and 512 bit memory bus - which makes it better than any professional card except A100/H100 which have HBM2/3 memory, 2 TB/s memory bandwidth and 5120 bit memory bus.

Even though the VRAM is limited to 32GB (GDDR7), it could be the fastest for running any LLM <30B at Q6.