DeepSeek Unveils Compact AI Model That Rivals Larger Rivals on Math Tasks

May 29, 2025

TechCrunch

Article image for DeepSeek Unveils Compact AI Model That Rivals Larger Rivals on Math Tasks

Summary

DeepSeek's distilled new R1 AI model, DeepSeek-R1-0528-Qwen3-8B, can run on a single GPU, outperforming comparable models on math benchmarks while being less computationally demanding, making it accessible for research and industrial development focused on small-scale models.

Key Points

DeepSeek released a smaller, distilled version of its new R1 AI model called DeepSeek-R1-0528-Qwen3-8B that can run on a single GPU.
The distilled model outperforms comparably sized models like Google's Gemini 2.5 Flash on certain math benchmarks like AIME 2025.
DeepSeek-R1-0528-Qwen3-8B is available under a permissive MIT license and can be used commercially without restriction.

DeepSeek Unveils Compact AI Model That Rivals Larger Rivals on Math Tasks

Summary

Key Points

Tags