Back to home
NVIDIA
2 articles tagged with this topic
NVIDIAGemma
NVIDIA NVFP4 Puts 26B Model on Consumer GPU With Under 1% Accuracy Loss
NVIDIA's NVFP4 Gemma-4-26B shrinks to 18.8GB for consumer GPUs with <0.7% accuracy loss. 4-bit is now optimal, but also an ecosystem lock-in.
3h ago2 min read
NVIDIAAITune
NVIDIA Open-Sources AITune: Auto-Selects Fastest PyTorch Inference Backend
NVIDIA releases AITune, an open-source toolkit that benchmarks and auto-picks the optimal inference backend for PyTorch models.
Apr 123 min read