AWS, Machine Learning and Training

Tecton.ai nabs $35M Series B as it releases machine learning feature store

TechCrunch

DECEMBER 7, 2020

Tecton.ai , the startup founded by three former Uber engineers who wanted to bring the machine learning feature store idea to the masses, announced a $35 million Series B today, just seven months after announcing their $20 million Series A. “We help organizations put machine learning into production.

Artificial Inteligence

Artificial Inteligence Machine Learning Recruiting AWS

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

Training a frontier model is highly compute-intensive, requiring a distributed system of hundreds, or thousands, of accelerated instances running for several weeks or months to complete a single job. For example, pre-training the Llama 3 70B model with 15 trillion training tokens took 6.5 During the training of Llama 3.1

Training

Training Artificial Inteligence Hardware Systems Review

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

Training large language models (LLMs) models has become a significant expense for businesses. PEFT is a set of techniques designed to adapt pre-trained LLMs to specific tasks while minimizing the number of parameters that need to be updated.

AWS

AWS Artificial Inteligence Generative AI Training

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. AWS HealthScribe combines speech recognition and generative AI trained specifically for healthcare documentation to accelerate clinical documentation and enhance the consultation experience.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Across diverse industries—including healthcare, finance, and marketing—organizations are now engaged in pre-training and fine-tuning these increasingly larger LLMs, which often boast billions of parameters and larger input sequence length. This approach reduces memory pressure and enables efficient training of large models.

Training

Training Artificial Inteligence AWS Machine Learning

Stability AI backs effort to bring machine learning to biomed

TechCrunch

NOVEMBER 4, 2022

Called OpenBioML , the endeavor’s first projects will focus on machine learning-based approaches to DNA sequencing, protein folding and computational biochemistry. Stability AI’s ethically questionable decisions to date aside, machine learning in medicine is a minefield. Predicting protein structures.

Artificial Inteligence

Artificial Inteligence Machine Learning Biotech Training

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. API Gateway also provides a WebSocket API. These components are illustrated in the following diagram.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

The Pro tier, however, would require a highly customized LLM that has been trained on specific data and terminology, enabling it to assist with intricate tasks like drafting complex legal documents. Before migrating any of the provided solutions to production, we recommend following the AWS Well-Architected Framework.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Amazon SageMaker HyperPod makes it easier to train and fine-tune LLMs

TechCrunch

NOVEMBER 29, 2023

At its re:Invent conference today, Amazon’s AWS cloud arm announced the launch of SageMaker HyperPod, a new purpose-built service for training and fine-tuning large language models (LLMs). SageMaker HyperPod is now generally available.

Artificial Inteligence

Artificial Inteligence Training Machine Learning AWS

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Exclusive to Amazon Bedrock, the Amazon Titan family of models incorporates 25 years of experience innovating with AI and machine learning at Amazon. The AWS Command Line Interface (AWS CLI) installed on your machine to upload the dataset to Amazon S3. If enabled, its status will display as Access granted.

AWS

AWS Engineering Serverless eCommerce

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Discover, Protect and Respond with AWS and Prisma Cloud

Prisma Clud

NOVEMBER 22, 2024

Organizations are increasingly turning to cloud providers, like Amazon Web Services (AWS), to address these challenges and power their digital transformation initiatives. However, the vastness of AWS environments and the ease of spinning up new resources and services can lead to cloud sprawl and ongoing security risks.

AWS

AWS Cloud Network Compliance

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

These powerful models, trained on vast amounts of data, can generate human-like text, answer questions, and even engage in creative writing tasks. However, training and deploying such models from scratch is a complex and resource-intensive process, often requiring specialized expertise and significant computational resources.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. Nemotron-4 15B, with its impressive 15-billion-parameter architecture trained on 8 trillion text tokens, brings powerful multilingual and coding capabilities to the Amazon Bedrock. About the authors James Park is a Solutions Architect at Amazon Web Services.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

With the release of powerful publicly available foundation models, tools for training, fine tuning and hosting your own LLM have also become democratized. Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. xlarge instances are only available in these AWS Regions.

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

AWS offers new AI certifications

CIO

JUNE 11, 2024

With a shortage of IT workers with AI skills looming, Amazon Web Services (AWS) is offering two new certifications to help enterprises building AI applications on its platform to find the necessary talent. Candidates for this certification can sign up for an AWS Skill Builder subscription to check three new courses exploring various concepts.

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Machine Learning

Marsh McLennan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Demystifying RAG and model customization RAG is a technique to enhance the capability of pre-trained models by allowing the model access to external domain-specific data sources. Unlike fine-tuning, in RAG, the model doesnt undergo any training and the model weights arent updated to learn the domain knowledge.

Case Study

Case Study Artificial Inteligence Study Generative AI

IT leaders: What’s the gameplan as tech badly outpaces talent?

CIO

MARCH 13, 2025

To help address the problem, he says, companies are doing a lot of outsourcing, depending on vendors and their client engagement engineers, or sending their own people to training programs. In the Randstad survey, for example, 35% of people have been offered AI training up from just 13% in last years survey.

Part-Time VPE

Part-Time VPE Weak Development Team Fractional VPE Fractional CTO

Introducing Cloudera Fine Tuning Studio for Training, Evaluating, and Deploying LLMs with Cloudera AI

Cloudera

NOVEMBER 13, 2024

Several LLMs are publicly available through APIs from OpenAI , Anthropic , AWS , and others, which give developers instant access to industry-leading models that are capable of performing most generalized tasks. Given some example data, LLMs can quickly learn new content that wasn’t available during the initial training of the base model.

Artificial Inteligence

Artificial Inteligence Training Machine Learning Performance

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

We recommend referring to the Submit a model distillation job in Amazon Bedrock in the official AWS documentation for the most up-to-date and comprehensive information. Amazon Bedrock provides two primary methods for preparing your training data: uploading JSONL files to Amazon S3 or using historical invocation logs.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. Its a familiar NeMo-style launcher with which you can choose a recipe and run it on your infrastructure of choice (SageMaker HyperPod or training). recipes=recipe-name.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Marsh McLellan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. These potential vulnerabilities could be exploited by adversaries through various threat vectors.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Mistral-NeMo-Instruct-2407 and Mistral-NeMo-Base-2407 are now available on SageMaker JumpStart

AWS Machine Learning - AI

DECEMBER 6, 2024

You can try these models with SageMaker JumpStart, a machine learning (ML) hub that provides access to algorithms and models that can be deployed with one click for running inference. Both pre-trained base and instruction-tuned checkpoints are available under the Apache 2.0

Artificial Inteligence

Artificial Inteligence AWS Generative AI Training

Are you ready for MLOps? 🫵

Xebia

FEBRUARY 28, 2025

… that is not an awful lot. Both the tech and the skills are there: Machine Learning technology is by now easy to use and widely available. So then let me re-iterate: why, still, are teams having troubles launching Machine Learning models into production? First let’s throw in a statistic. What a waste!

Technical Review

Technical Review Weak Development Team Artificial Inteligence Machine Learning

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

AWS Machine Learning - AI

APRIL 22, 2025

Our goal is to enable you to set up automated, optimal routing between large language models (LLMs) through Amazon Bedrock Intelligent Prompt Routing and its deep understanding of model behaviors within each model family, which incorporates state-of-the-art methods for training routers for different sets of models, tasks and prompts.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Its improved architecture, based on the Multimodal Diffusion Transformer (MMDiT), combines multiple pre-trained text encoders for enhanced text understanding and uses QK-normalization to improve training stability. Use the us-west-2 AWS Region to run this demo. An Amazon SageMaker domain. Access to Stability AIs SD3.5

Generative AI

Generative AI Games Development AWS

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The Education and Training Quality Authority (BQA) plays a critical role in improving the quality of education and training services in the Kingdom Bahrain. BQA oversees a comprehensive quality assurance process, which includes setting performance standards and conducting objective reviews of education and training institutions.

Education

Education Report Technical Review Generative AI

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

By doing this, clients and servers can scale independently, making it a great fit for serverless orchestration powered by Lambda, AWS Fargate for Amazon ECS, or Fargate for Amazon EKS. The dedicated MCP server in this example is running on a local Docker container, but it can be quickly deployed to the AWS Cloud using services like Fargate.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. With its growing feature set, TorchServe is a popular choice for deploying and scaling machine learning models among inference customers.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

LoRA is a technique for efficiently adapting large pre-trained language models to new tasks or domains by introducing small trainable weight matrices, called adapters, within each linear layer of the pre-trained model. Why LoRAX for LoRA deployment on AWS? Two prominent approaches among our customers are LoRAX and vLLM.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. For human-in-the-loop evaluation, which can be done by either AWS managed or customer managed teams, you must bring your own dataset.

Applications

Applications Generative AI AWS Artificial Inteligence

The road to Software 2.0

O'Reilly Media - Data

DECEMBER 10, 2019

Roughly a year ago, we wrote “ What machine learning means for software development.” Karpathy suggests something radically different: with machine learning, we can stop thinking of programming as writing a step of instructions in a programming language like C or Java or Python. Instead, we can program by example.

Software

Software Artificial Inteligence Machine Learning Conference

SAP AI pact with AWS offers customers more gen AI options

CIO

MAY 29, 2024

SAP is expanding its AI ecosystem with a partnership with AWS. The cloud hyperscalers AWS, Google and Microsoft are also important platform partners to operate SAP’s cloud applications. The cloud hyperscalers AWS, Google and Microsoft are also important platform partners to operate SAP’s cloud applications.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

AWS Machine Learning - AI

DECEMBER 12, 2023

A generative pre-trained transformer (GPT) uses causal autoregressive updates to make prediction. Training LLMs requires colossal amount of compute time, which costs millions of dollars. Training LLMs requires colossal amount of compute time, which costs millions of dollars. We’ll outline how we cost-effectively (3.2

AWS

AWS Artificial Inteligence Training Meeting

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning - AI

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. It comes in a range of parameter sizes—7 billion, 13 billion, and 70 billion—as well as pre-trained and fine-tuned variations. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart.

Artificial Inteligence

Artificial Inteligence AWS Training Generative AI

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock

AWS Machine Learning - AI

MAY 1, 2025

Prerequisites To use this feature, make sure that you have satisfied the following requirements: An active AWS account. model customization is available in the US West (Oregon) AWS Region. Refer to Supported models and Regions for fine-tuning and continued pre-training for updates on Regional availability and quotas.

Generative AI

Generative AI AWS Artificial Inteligence Training

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Fine-tuning is a powerful approach in natural language processing (NLP) and generative AI , allowing businesses to tailor pre-trained large language models (LLMs) for specific tasks. The TAT-QA dataset has been divided into train (28,832 rows), dev (3,632 rows), and test (3,572 rows).

Artificial Inteligence

Artificial Inteligence Generative AI Training Metrics

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

It often requires managing multiple machine learning (ML) models, designing complex workflows, and integrating diverse data sources into production-ready formats. Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions.

Data

Data Generative AI Artificial Inteligence Compliance

Tecton.ai nabs $35M Series B as it releases machine learning feature store

Reduce ML training costs with Amazon SageMaker HyperPod

Webinars

Trending Sources

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Webinars

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Stability AI backs effort to bring machine learning to biomed

Build a multi-tenant generative AI environment for your enterprise on AWS

Accelerate AWS Well-Architected reviews with Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Amazon SageMaker HyperPod makes it easier to train and fine-tune LLMs

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Introducing AWS MCP Servers for code assistants (Part 1)

Discover, Protect and Respond with AWS and Prisma Cloud

Integrate foundation models into your code with Amazon Bedrock

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS offers new AI certifications

Marsh McLennan IT reorg lays foundation for gen AI

Model customization, RAG, or both: A case study with Amazon Nova

IT leaders: What’s the gameplan as tech badly outpaces talent?

Introducing Cloudera Fine Tuning Studio for Training, Evaluating, and Deploying LLMs with Cloudera AI

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Marsh McLellan IT reorg lays foundation for gen AI

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Mistral-NeMo-Instruct-2407 and Mistral-NeMo-Base-2407 are now available on SageMaker JumpStart

Are you ready for MLOps? 🫵

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

How BQA streamlines education quality reporting using Amazon Bedrock

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Host concurrent LLMs with LoRAX

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

The road to Software 2.0

SAP AI pact with AWS offers customers more gen AI options

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Stay Connected