Back to home
MLX
4 articles tagged with this topic
MLXQwen3.5
DFlash speculative decoding on Apple Silicon: 4.1x on Qwen3.5-9B, now open source (MLX, M5 Max)
Open-source DFlash achiev es 4.13x speedup on Qwen3.5-9B using MLX on M5 Max with 89.4% token acceptance rate.
Apr 134 min read
CoreMLApple-Intelligence
Apple's On-Device AI Moat: What It Means for Edge Builders
Apple's privacy-first, on-device AI stack may become the default for builders who need inference without cloud costs .
Apr 134 min read
Gemma 4MLX
Gemma 4 audio with MLX
Google's Gemma 4 E2B model can transcribe audio locally on macOS using MLX and a single uv run command.
Apr 133 min read
HitokuGemma-4
Hitoku, open-source local macOS context aware assistant with Qwen3.5/Gemma4
Open-source macOS assistant runs Gemma 4 and Qwen 3.5 fully on-device with screen and document context .
Apr 133 min read