Cloud Computing

849 articles found

Xiaomi Hits 1,000 Tokens Per Second on Trillion-Parameter AI, Claiming 15x Speed Advantage Over ChatGPT and Claude

Xiaomi Hits 1,000 Tokens Per Second on Trillion-Parameter AI, Claiming 15x Speed Advantage Over ChatGPT and Claude

Jun 09, 2026
Decrypt

Xiaomi and TileRT claim a major AI speed breakthrough, hitting over 1,000 tokens per second on a trillion-parameter model using just 8 commodity GPUs — roughly 15 times faster than ChatGPT and Claude — powered by FP4 quantization and speculative decoding, with an open-source model checkpoint already live on Hugging …

Apple Launches Third-Generation AI Models in Google Cloud Partnership, Powering Redesigned Siri with Major Performance Gains

Apple Launches Third-Generation AI Models in Google Cloud Partnership, Powering Redesigned Siri with Major Performance Gains

Jun 09, 2026
Apple Machine Learning Research

Apple launches its third-generation Apple Foundation Models in partnership with Google Cloud, introducing five new AI models — including on-device and server-based variants — that power a redesigned Siri with major performance gains, with human evaluations showing the new cloud model preferred over its predecessor on nearly 65% of prompts.

Previous
Page 8 of 85
Next
Showing 71 - 80 of 849 articles