Artificial Inteligence and AWS

Artificial Inteligence

AWS

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server. You ask the agent to Book a 5-day trip to Europe in January and we like warm weather.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Join 49,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. Access to Amazon Bedrock foundation models is not granted by default.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Tecton.ai nabs $35M Series B as it releases machine learning feature store

TechCrunch

DECEMBER 7, 2020

Tecton.ai , the startup founded by three former Uber engineers who wanted to bring the machine learning feature store idea to the masses, announced a $35 million Series B today, just seven months after announcing their $20 million Series A. “We help organizations put machine learning into production.

Artificial Inteligence

Artificial Inteligence Machine Learning Recruiting AWS

The New Tech Experience: Innovation, Optimization, and Collaboration

Speaker: Paul Weald, Contact Center Innovator

Learn how to streamline productivity and efficiency across your organization with machine learning and artificial intelligence! How you can leverage innovations in technology and machine learning to improve your customer experience and bottom line.

Artificial Inteligence

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

The use of large language models (LLMs) and generative AI has exploded over the last year. With the release of powerful publicly available foundation models, tools for training, fine tuning and hosting your own LLM have also become democratized. xlarge instances are only available in these AWS Regions.

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Generative and agentic artificial intelligence (AI) are paving the way for this evolution. Built on top of EXLerate.AI, EXLs AI orchestration platform, and Amazon Web Services (AWS), Code Harbor eliminates redundant code and optimizes performance, reducing manual assessment, conversion and testing effort by 60% to 80%.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Streamlit nabs $35M Series B to expand machine learning platform

TechCrunch

APRIL 7, 2021

As a company founded by data scientists, Streamlit may be in a unique position to develop tooling to help companies build machine learning applications. Data scientists can download the open-source project and build a machine learning application, but it requires a certain level of technical aptitude to make all the parts work.

Artificial Inteligence

Artificial Inteligence Machine Learning Open Source Recruiting

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. AWS HealthScribe combines speech recognition and generative AI trained specifically for healthcare documentation to accelerate clinical documentation and enhance the consultation experience.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

AWS Extends Generative AI Reach to Third-Party IT Platforms

DevOps.com

NOVEMBER 26, 2024

Amazon Web Services (AWS) has extended the reach of its generative artificial intelligence (AI) platform for application development to include a set of plug-in extensions, that make it possible to launch natural language queries against data residing in platforms from Datadog and Wiz.

AWS

AWS Generative AI Artificial Intelligence Artificial Inteligence

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Large language models (LLMs) have revolutionized the field of natural language processing with their ability to understand and generate humanlike text. Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. It consists of one or more components depending on the number of FM providers and number and types of custom models used.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

United Airlines sets its flight plan for gen AI success

CIO

DECEMBER 20, 2024

With the core architectural backbone of the airlines gen AI roadmap in place, including United Data Hub and an AI and ML platform dubbed Mars, Birnbaum has released a handful of models into production use for employees and customers alike.

Airlines

Airlines Generative AI Artificial Inteligence Weak Development Team

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. This a revolutionary new capability within Amazon Bedrock that serves as a centralized hub for discovering, testing, and implementing foundation models (FMs). Prior to joining AWS, Dr. Li held data science roles in the financial and retail industries.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Camelot Secure’s AI wizard eases path to cybersecurity compliance

CIO

NOVEMBER 4, 2024

Like many innovative companies, Camelot looked to artificial intelligence for a solution. Camelot has the flexibility to run on any selected GenAI LLM across cloud providers like AWS, Microsoft Azure, and GCP (Google Cloud Platform), ensuring that the company meets compliance regulations for data security.

Compliance

Compliance Artificial Inteligence Guidelines Artificial Intelligence

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. It supports a wide range of popular open source LLMs, making it a popular choice for diverse AI applications.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

The rise of large language models (LLMs) and foundation models (FMs) has revolutionized the field of natural language processing (NLP) and artificial intelligence (AI). Development environment – Set up an integrated development environment (IDE) with your preferred coding language and tools.

Software Review

Software Review Artificial Inteligence Generative AI AWS

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

Training large language models (LLMs) models has become a significant expense for businesses. For many use cases, companies are looking to use LLM foundation models (FM) with their domain-specific data. To learn more about Trainium chips and the Neuron SDK, see Welcome to AWS Neuron.

AWS

AWS Artificial Inteligence Generative AI Training

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. With the significant developments in the field of generative AI , intelligent applications powered by foundation models (FMs) can help users map out an itinerary through an intuitive natural conversation interface.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Better together? Why AWS is unifying data analytics and AI services in SageMaker

CIO

DECEMBER 6, 2024

This unification of analytics and AI services is perhaps best exemplified by a new offering inside Amazon SageMaker, Unified Studio , a preview of which AWS CEO Matt Garman unveiled at the companys annual re:Invent conference this week.

Analytics

Analytics AWS Data Generative AI

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

AWS Machine Learning - AI

APRIL 21, 2025

In this blog post, we discuss how Prompt Optimization improves the performance of large language models (LLMs) for intelligent text processing task in Yuewen Group. Evolution from Traditional NLP to LLM in Intelligent Text Processing Yuewen Group leverages AI for intelligent analysis of extensive web novel texts.

Artificial Inteligence

Artificial Inteligence Groups Applications Innovation

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

With rapid progress in the fields of machine learning (ML) and artificial intelligence (AI), it is important to deploy the AI/ML model efficiently in production environments. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

The effectiveness of RAG heavily depends on the quality of context provided to the large language model (LLM), which is typically retrieved from vector stores based on user queries. The relevance of this context directly impacts the model’s ability to generate accurate and contextually appropriate responses.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The following were some initial challenges in automation: Language diversity – The services host both Dutch and English shows. Some local shows feature Flemish dialects, which can be difficult for some large language models (LLMs) to understand. The secondary LLM is used to evaluate the summaries on a large scale.

Media

Media Video Artificial Inteligence Generative AI

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

However, as the reach of live streams expands globally, language barriers and accessibility challenges have emerged, limiting the ability of viewers to fully comprehend and participate in these immersive experiences. The extension delivers a web application implemented using the AWS SDK for JavaScript and the AWS Amplify JavaScript library.

Generative AI

Generative AI AWS Lambda Authentication

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

Principal wanted to use existing internal FAQs, documentation, and unstructured data and build an intelligent chatbot that could provide quick access to the right information for different roles. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

National Laboratory has implemented an AI-driven document processing platform that integrates named entity recognition (NER) and large language models (LLMs) on Amazon SageMaker AI. In this post, we discuss how you can build an AI-powered document processing platform with open source NER and LLMs on SageMaker.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

AWS Machine Learning - AI

APRIL 22, 2025

Over the past several months, we drove several improvements in intelligent prompt routing based on customer feedback and extensive internal testing. In GA, you can configure your own router by selecting any two models from the same model family and then configuring the response quality difference of your router.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Out-of-the-box models often lack the specific knowledge required for certain domains or organizational terminologies. To address this, businesses are turning to custom fine-tuned models, also known as domain-specific large language models (LLMs). Why LoRAX for LoRA deployment on AWS?

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning - AI

MARCH 18, 2025

This is where the integration of cutting-edge technologies, such as audio-to-text translation and large language models (LLMs), holds the potential to revolutionize the way patients receive, process, and act on vital medical information. These insights can include: Potential adverse event detection and reporting.

Artificial Inteligence

Artificial Inteligence Technical Review Healthcare Systems Review

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions. It will be marked for deletion and will be deleted when all executions are stopped.

Generative AI

Generative AI AWS Technical Review Backup

Generate financial industry-specific insights using generative AI and in-context fine-tuning

AWS Machine Learning - AI

NOVEMBER 12, 2024

In this blog post, we demonstrate prompt engineering techniques to generate accurate and relevant analysis of tabular data using industry-specific language. This is done by providing large language models (LLMs) in-context sample data with features and labels in the prompt.

Generative AI

Generative AI Artificial Inteligence Industry Analysis

EBSCOlearning scales assessment generation for their online learning content with generative AI

AWS Machine Learning - AI

DECEMBER 11, 2024

In this post, we illustrate how EBSCOlearning partnered with AWS Generative AI Innovation Center (GenAIIC) to use the power of generative AI in revolutionizing their learning assessment process. The evaluation process includes three phases: LLM-based guideline evaluation, rule-based checks, and a final evaluation.

Generative AI

Generative AI Artificial Inteligence Guidelines Education

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

With the rise of large language models (LLMs) like Meta Llama 3.1, there is an increasing need for scalable, reliable, and cost-effective solutions to deploy and serve these models. Prerequisites Before you begin, make sure you have the following utilities installed on your local machine or development environment.

AWS

AWS Load Balancer Software Review Artificial Inteligence

AWS adds Guardrails for Amazon Bedrock to help safeguard LLMs

TechCrunch

NOVEMBER 28, 2023

We are all talking about the business gains from using large language models, but there are lot of known issues with these models and finding ways to constrain the answers that a model could give is one way to apply some control to these powerful technologies. All rights reserved.

AWS

AWS Artificial Inteligence Technology Generative AI

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

Prerequisites For a successful implementation of Amazon Bedrock Model Distillation, youll need to meet several requirements. We recommend referring to the Submit a model distillation job in Amazon Bedrock in the official AWS documentation for the most up-to-date and comprehensive information. 70B and Llama 3.1

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Stability AI backs effort to bring machine learning to biomed

TechCrunch

NOVEMBER 4, 2022

Called OpenBioML , the endeavor’s first projects will focus on machine learning-based approaches to DNA sequencing, protein folding and computational biochemistry. Stability AI’s ethically questionable decisions to date aside, machine learning in medicine is a minefield. Predicting protein structures.

Artificial Inteligence

Artificial Inteligence Machine Learning Biotech Training

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

The failed instance also needs to be isolated and terminated manually, either through the AWS Management Console , AWS Command Line Interface (AWS CLI), or tools like kubectl or eksctl. About the Authors Anoop Saha is a Sr GTM Specialist at Amazon Web Services (AWS) focusing on generative AI model training and inference.

Training

Training Artificial Inteligence Hardware Systems Review

Amazon SageMaker HyperPod makes it easier to train and fine-tune LLMs

TechCrunch

NOVEMBER 29, 2023

At its re:Invent conference today, Amazon’s AWS cloud arm announced the launch of SageMaker HyperPod, a new purpose-built service for training and fine-tuning large language models (LLMs). SageMaker HyperPod is now generally available.

Artificial Inteligence

Artificial Inteligence Training Machine Learning AWS

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

Tecton.ai nabs $35M Series B as it releases machine learning feature store

The New Tech Experience: Innovation, Optimization, and Collaboration

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AI in action: Stories of how enterprises are transforming and modernizing

Streamlit nabs $35M Series B to expand machine learning platform

Accelerate AWS Well-Architected reviews with Generative AI

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Model customization, RAG, or both: A case study with Amazon Nova

AWS Extends Generative AI Reach to Third-Party IT Platforms

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Build a multi-tenant generative AI environment for your enterprise on AWS

United Airlines sets its flight plan for gen AI success

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Camelot Secure’s AI wizard eases path to cybersecurity compliance

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Integrate foundation models into your code with Amazon Bedrock

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Introducing AWS MCP Servers for code assistants (Part 1)

Better together? Why AWS is unifying data analytics and AI services in SageMaker

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

Building a Scalable ML Pipeline and API in AWS

How AWS sales uses Amazon Q Business for customer engagement

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

Host concurrent LLMs with LoRAX

Revolutionizing clinical trials with the power of voice and AI

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Generate financial industry-specific insights using generative AI and in-context fine-tuning

EBSCOlearning scales assessment generation for their online learning content with generative AI

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS adds Guardrails for Amazon Bedrock to help safeguard LLMs

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Stability AI backs effort to bring machine learning to biomed

Reduce ML training costs with Amazon SageMaker HyperPod

Amazon SageMaker HyperPod makes it easier to train and fine-tune LLMs

Stay Connected