Generative AI

782 articles found

NVIDIA Unveils Dynamo: Powering Large Language Models on Cloud

NVIDIA Unveils Dynamo: Powering Large Language Models on Cloud

Jul 16, 2025
Amazon Web Services

NVIDIA introduces Dynamo, an open-source framework optimizing performance and scalability for large language models and generative AI applications on the cloud, featuring innovations like disaggregated prefill and decode phases, dynamic GPU management, efficient caching, and accelerated data transfer, showcasing deployment with DeepSeek-R1-Distill-8b on Amazon EKS.

Previous
Page 54 of 79
Next
Showing 531 - 540 of 782 articles