New Open-Source Engine Brings DeepSeek V4 Flash Local Inference to Apple Silicon Macs
A new open-source engine called ds4.c launches Metal-only local inference for DeepSeek V4 Flash on Apple Silicon Macs, featuring 2-bit quantization, disk KV cache persistence, and a 1-million-token context window — making powerful AI inference possible on MacBooks and Mac Studios with 128GB or more of RAM.