Generative AI, Hardware and Training

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

Training a frontier model is highly compute-intensive, requiring a distributed system of hundreds, or thousands, of accelerated instances running for several weeks or months to complete a single job. For example, pre-training the Llama 3 70B model with 15 trillion training tokens took 6.5 During the training of Llama 3.1

Training

Training Artificial Inteligence Hardware Systems Review

Dulling the impact of AI-fueled cyber threats with AI

CIO

OCTOBER 24, 2024

IT leaders are placing faith in AI. Consider 76 percent of IT leaders believe that generative AI (GenAI) will significantly impact their organizations, with 76 percent increasing their budgets to pursue AI. But when it comes to cybersecurity, AI has become a double-edged sword.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Generative AI Training

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. In our tests, we’ve seen substantial improvements in scaling times for generative AI model endpoints across various frameworks.

Reduce ML training costs with Amazon SageMaker HyperPod

Dulling the impact of AI-fueled cyber threats with AI

Webinars

Trending Sources

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Webinars

Getting infrastructure right for generative AI

Together raises $20M to build open source generative AI models

The mainframe’s future in the age of AI

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Gartner projects major IT spending increases for 2025

Should finance organizations bank on Generative AI?

CIOs’ lack of success metrics dooms many AI projects

AI agents loom large as organizations pursue generative AI value

Why IT needs to be in the driver’s seat with generative AI

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Preparing the foundations for Generative AI

Daily Crunch: Amazon’s new Bedrock cloud service lets developers incorporate generative AI

Salesforce plans generative AI boost for ESG reporting with Net Zero Cloud

Putting AI to Work: Generative AI Meets the Enterprise

A secure approach to generative AI with AWS

What enterprise software vendors are doing with generative AI

Accelerating generative AI requires the right storage

Nvidia AI Enterprise adds generative AI microservices

Top 6 Annotation Tools for HITL LLMs Evaluation and Domain-Specific AI Model Training

Generative AI in the Enterprise

Nvidia points to the future of AI hardware

Faster, Smarter, Cheaper: The Networking Revolution Powering Generative AI

GenAI sticker shock sends CIOs in search of solutions

Generative AI foundation model training on Amazon SageMaker

Powering tomorrow with Generative AI: The AWS-Capgemini partnership advantage

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Gen AI without the risks

Unlocking Innovation: AWS and Anthropic push the boundaries of generative AI together

Best practices to build generative AI applications on AWS

Host concurrent LLMs with LoRAX

Beyond ChatGPT: Secret robotics plans and the $38 billion humanoid revolution

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

3 steps to get your data AI ready

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

Private cloud makes its comeback, thanks to AI

The AI continuum

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

How enterprises can navigate ethics and responsibility of generative AI

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AI on the mainframe? IBM may be onto something

Stay Connected