Deep Learning

485 articles found

397-Billion Parameter AI Model Runs on MacBook Pro With 48GB RAM at 4.4 Tokens Per Second Using Custom C/Metal Engine

397-Billion Parameter AI Model Runs on MacBook Pro With 48GB RAM at 4.4 Tokens Per Second Using Custom C/Metal Engine

Mar 23, 2026
GitHub

A custom C/Metal inference engine called Flash-MoE is now running a massive 397-billion parameter AI model on a standard MacBook Pro with 48GB RAM, streaming 209GB directly from SSD at 4.4 tokens per second — with 58 documented experiments revealing that Apple Silicon's unified memory architecture defies conventional optimization wisdom.

Microsoft Launches MAI-Image-2, Debuting Third on AI Image Leaderboard Behind Google and OpenAI

Microsoft Launches MAI-Image-2, Debuting Third on AI Image Leaderboard Behind Google and OpenAI

Mar 22, 2026
The Deep View

Microsoft launches MAI-Image-2, a powerful next-generation text-to-image AI model debuting at third place on the Arena.ai leaderboard, offering hyper-detailed photorealism, legible text generation, and accurate skin tones, now available in preview on the MAI Playground with Copilot and Bing Image Creator rollout underway.

Cloudflare Launches Kimi K2.5 on Workers AI, Slashing Inference Costs by 77% While Processing 7 Billion Tokens Daily

Cloudflare Launches Kimi K2.5 on Workers AI, Slashing Inference Costs by 77% While Processing 7 Billion Tokens Daily

Mar 22, 2026
The Cloudflare Blog

Cloudflare launches Kimi K2.5 on Workers AI, achieving a massive 77% cut in inference costs while processing over 7 billion tokens daily, bringing powerful frontier open-source AI capabilities including a 256k context window and vision inputs to its platform alongside new features like prefix caching and a redesigned async API.

Billion-Dollar Race to Build World Models Accelerates as AI Hits Physical Reality Limits

Billion-Dollar Race to Build World Models Accelerates as AI Hits Physical Reality Limits

Mar 22, 2026
Venturebeat

A billion-dollar race to build world models is accelerating as AI giants and startups pour massive funding into technology that can simulate physical reality, with AMI Labs raising $1.03 billion and World Labs securing $1 billion to overcome the critical limits large language models face in robotics and autonomous driving.

Previous
Page 3 of 49
Next
Showing 21 - 30 of 485 articles