1 article tagged with this topic
NVIDIA's NVFP4 Gemma-4-26B shrinks to 18.8GB for consumer GPUs with <0.7% accuracy loss. 4-bit is now optimal, but also an ecosystem lock-in.