Back to home
quantization
2 articles tagged with this topic
Qwen3.5GGUF
Qwen3.5-9B GGUF Quant Rankings: Q8_0 Dominates KLD Scores
KLD benchmarks across community GGUF quants show Q8_0 variants cluster near 0.001 KLD, with quality degrading shar ply below Q5.
Apr 143 min read
llama.cppGemma 4
Gemma 4 26B: Q8 mmproj Unlocks 60K+ Context With Vision
Switching from F16 to Q8_0 mmproj on Gemma 4 26B adds ~30K context tokens with no vision quality loss.
Apr 62 min read