Cloudflare Launches Kimi K2.5 on Workers AI, Slashing Inference Costs by 77% While Processing 7 Billion Tokens Daily
Cloudflare launches Kimi K2.5 on Workers AI, achieving a massive 77% cut in inference costs while processing over 7 billion tokens daily, bringing powerful frontier open-source AI capabilities including a 256k context window and vision inputs to its platform alongside new features like prefix caching and a redesigned async API.