QwenGemma
Qwen 3.6 Wins Benchmarks, Fails Reality: Benchmaxing Distorts AI Perception
Qwen 3.6 won benchmarks but lost to Gemma 4 in practice, burning 8000+ tokens in a loop. Benchmaxing distorts AI perception; firms must shift to real-
4h ago·2 min read