diffusion language modelsAI inference costs
Diffusion Language Models 270% Faster: New Variables in AI Inference Cost War
NUS DMax model boosts diffusion language model parallel decoding efficiency by 3x. If this tech matures, AI inference costs plummet, forcing API-billi
Apr 10·2 min read