Artificial Intelligence, AWS and Reference

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers. The user prompt is then routed to the LLM associated with the task category of the reference prompt that has the closest match.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The user signs in by entering a user name and a password.

Generative AI

Generative AI AWS Lambda Authentication

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. Shared components refer to the functionality and features shared by all tenants. You can use AWS services such as Application Load Balancer to implement this approach. API Gateway also provides a WebSocket API.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions. We're more than happy to provide further references upon request.

Generative AI

Generative AI AWS Technical Review Backup

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Access to your selected models hosted on Amazon Bedrock.

Scalability

Scalability Lambda Generative AI AWS

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

Digital transformation started creating a digital presence of everything we do in our lives, and artificial intelligence (AI) and machine learning (ML) advancements in the past decade dramatically altered the data landscape. To succeed in todays landscape, every company small, mid-sized or large must embrace a data-centric mindset.

Data

Data Technical Review Software Review Weak Development Team

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Model customization refers to adapting a pre-trained language model to better fit specific tasks, domains, or datasets. Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions.

Case Study

Case Study Artificial Inteligence Study Generative AI

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

The rise of large language models (LLMs) and foundation models (FMs) has revolutionized the field of natural language processing (NLP) and artificial intelligence (AI). You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more. If you don’t have one, you can create a new account.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. Prerequisites For this guide, you need access to the following prerequisites: An AWS account Proper permissions to deploy EC2 G6 instances.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Response latency refers to the time between the user finishing their speech and beginning to hear the AI assistants response. AWS Local Zones are a type of edge infrastructure deployment that places select AWS services close to large population and industry centers. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Security – The solution uses AWS services and adheres to AWS Cloud Security best practices so your data remains within your AWS account.

Generative AI

Generative AI Applications AWS Knowledge Base

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Event-driven operations management Operational events refer to occurrences within your organization’s cloud environment that might impact the performance, resilience, security, or cost of your workloads.

Cloud

Cloud AWS Serverless Policies

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

With this launch, you can now access Mistrals frontier-class multimodal model to build, experiment, and responsibly scale your generative AI ideas on AWS. AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. Additionally, Pixtral Large supports the Converse API and tool usage.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. Refer to the GitHub repository for deployment instructions.

Knowledge Base

Knowledge Base Technical Review Generative AI Lambda

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

AWS Machine Learning - AI

APRIL 11, 2024

AWS was delighted to present to and connect with over 18,000 in-person and 267,000 virtual attendees at NVIDIA GTC, a global artificial intelligence (AI) conference that took place March 2024 in San Jose, California, returning to a hybrid, in-person experience for the first time since 2019.

Generative AI

Generative AI AWS Artificial Inteligence Innovation

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Intelligent document processing , translation and summarization, flexible and insightful responses for customer support agents, personalized marketing content, and image and code generation are a few use cases using generative AI that organizations are rolling out in production.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Use the us-west-2 AWS Region to run this demo. Prerequisites This notebook is designed to run on AWS, using Amazon Bedrock for both Anthropics Claude 3 Sonnet and Stability AI model access. Make sure you have the following set up before moving forward: An AWS account. An Amazon SageMaker domain. Access to Stability AIs SD3.5

Generative AI

Generative AI Games Development AWS

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 14, 2024

Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. This tutorial assumes you have the necessary AWS Identity and Access Management (IAM) permissions. If you’re new to Amazon EC2, refer to the Amazon EC2 User Guide.

Software Review

Software Review Technical Review AWS Artificial Inteligence

A secure approach to generative AI with AWS

AWS Machine Learning - AI

APRIL 16, 2024

Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. At AWS, our top priority is safeguarding the security and confidentiality of our customers’ workloads. With the AWS Nitro System , we delivered a first-of-its-kind innovation on behalf of our customers.

Generative AI

Generative AI AWS Artificial Inteligence Infrastructure

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

The time taken to determine the root cause is referred to as mean time to detect (MTTD). The failed instance also needs to be isolated and terminated manually, either through the AWS Management Console , AWS Command Line Interface (AWS CLI), or tools like kubectl or eksctl.

Training

Training Artificial Inteligence Hardware Systems Review

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. For human-in-the-loop evaluation, which can be done by either AWS managed or customer managed teams, you must bring your own dataset.

Applications

Applications Generative AI AWS Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. You can monitor costs with AWS Cost Explorer.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

In these uses case, we have enough reference implementations to point to and say, Theres value to be had here.' Weve seen so many reference implementations, and weve done so many reference implementations, that were going to see massive adoption.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Launching a machine learning (ML) training cluster with Amazon SageMaker training jobs is a seamless process that begins with a straightforward API call, AWS Command Line Interface (AWS CLI) command, or AWS SDK interaction. About the Authors Kanwaljit Khurmi is a Principal Worldwide Generative AI Solutions Architect at AWS.

Training

Training Artificial Inteligence AWS Machine Learning

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning - AI

MAY 1, 2024

Large language models (LLMs) are making a significant impact in the realm of artificial intelligence (AI). Llama2 by Meta is an example of an LLM offered by AWS. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart.

AWS

AWS Artificial Inteligence Training Generative AI

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

AWS Machine Learning - AI

NOVEMBER 21, 2024

We guide you through deploying the necessary infrastructure using AWS CloudFormation , creating an internal labeling workforce, and setting up your first labeling job. This precision helps models learn the fine details that separate natural from artificial-sounding speech. We demonstrate how to use Wavesurfer.js

Video

Video Lambda AWS Generative AI

Build private and secure enterprise generative AI apps with Amazon Q Business and AWS IAM Identity Center

AWS Machine Learning - AI

APRIL 30, 2024

Amazon Q Business is a conversational assistant powered by generative artificial intelligence (AI) that enhances workforce productivity by answering questions and completing tasks based on information in your enterprise systems. This outcome is achieved with a combination of AWS IAM Identity Center and Amazon Q Business.

Generative AI

Generative AI AWS Enterprise Authentication

Amazon AWS: Dominating In Cloudcomputing, Data Analytics, Artificial Intelligence and IoT

CTOvision

AUGUST 13, 2016

We have been tracking Amazon for years, but as a reference point consider that in November 2006 BusinessWeek ran a cover story with the title "Jeff Bezos' Risky Bet" where the concept of cloud computing as a business model disruptor was catapulted into the mainstream. Research Team.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence IoT AWS

How AI is helping the NFL improve player safety

CIO

FEBRUARY 9, 2024

From the initial kickoff at Allegiant Stadium in Las Vegas for Super Bowl LVIII on Sunday, an artificial intelligence platform will be tracking every move on the field to help keep players safer. This season, the NFL has worked closely with Amazon Web Services (AWS) to debut a new joint effort: Digital Athlete.

Artificial Inteligence

Artificial Inteligence Sport Artificial Intelligence Games

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning - AI

MAY 30, 2024

CBRE is unlocking the potential of artificial intelligence (AI) to realize value across the entire commercial real estate lifecycle—from guiding investment decisions to managing buildings. AWS Prototyping developed an AWS Cloud Development Kit (AWS CDK) stack for deployment following AWS best practices.

AWS

AWS Lambda Performance Artificial Inteligence

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

AWS Machine Learning - AI

APRIL 18, 2024

Tools like Terraform and AWS CloudFormation are pivotal for such transitions, offering infrastructure as code (IaC) capabilities that define and manage complex cloud environments with precision. Generative artificial intelligence (AI) with Amazon Bedrock directly addresses these challenges.

AWS

AWS Applications Lambda Knowledge Base

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

It is designed to handle the demanding computational and latency requirements of state-of-the-art transformer models, including Llama, Falcon, Mistral, Mixtral, and GPT variants for a full list of TGI supported models refer to supported models. For a complete list of runtime configurations, please refer to text-generation-launcher arguments.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Confirm the AWS Regions where the model is available and quotas. Complete the knowledge base evaluation prerequisites related to AWS Identity and Access Management (IAM) creation and add permissions for an S3 bucket to access and write output data. Selected evaluator and generator models enabled in Amazon Bedrock.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

Dynamic video content moderation and policy evaluation using AWS generative AI services

AWS Machine Learning - AI

MAY 30, 2024

Generative artificial intelligence (AI) has unlocked fresh opportunities for these use cases. In this post, we introduce the Media Analysis and Policy Evaluation solution, which uses AWS AI and generative AI services to provide a framework to streamline video extraction and evaluation processes.

Generative AI

Generative AI Policies Video AWS

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

They are available at no additional charge in AWS Regions where the Amazon Q Business service is offered. Refer to Monitoring Amazon Q Business and Q Apps for more details. Log groups prefixed with /aws/vendedlogs/ will be created automatically. These logs are then queryable using Amazon Athena.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

5 steps to move AI beyond buzzwords to deliver true transformative impact

CIO

SEPTEMBER 8, 2022

The headlines read “Artificial Intelligence (AI) will completely transform your business.” For several decades this has been the story behind Artificial Intelligence and Machine Learning. Until now, a comprehensive list of AI and ML use cases that serve as meaningful references for business leaders simply did not exist.

Artificial Inteligence

Artificial Inteligence Technical Advisors Artificial Intelligence Machine Learning

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning - AI

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! are the sessions dedicated to AWS DeepRacer ! Generative AI is at the heart of the AWS Village this year. You marked your calendars, you booked your hotel, and you even purchased the airfare. And last but not least (and always fun!)

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Cost, security, and flexibility: the business case for open source gen AI

CIO

DECEMBER 11, 2024

In our case, we run it on AWS within our own private cloud, he says. An abundance of choice In the most general definition, open source here refers to the code thats available, and that the model can be modified and used for free in a variety of contexts. Meta itself refers to it as a community license or a bespoke commercial license.

Open Source

Open Source Artificial Inteligence Technical Review Software Review

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Kaseya

MAY 13, 2021

In this blog, we’ll compare the three leading public cloud providers, namely Amazon Web Services (AWS), Microsoft Azure and Google Cloud. Amazon Web Services (AWS) Overview. A subsidiary of Amazon, AWS was launched in 2006 and offers on-demand cloud computing services on a metered, pay-as-you-go basis. Greater Security.

Google Cloud

Google Cloud Azure AWS Cloud

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

AWS Machine Learning - AI

AUGUST 26, 2024

At AWS, we are transforming our seller and customer journeys by using generative artificial intelligence (AI) across the sales lifecycle. Product consumption – Summaries of how customers are using AWS services over time. The following screenshot shows a sample account summary. The impact goes beyond just efficiency.

Generative AI

Generative AI AWS Artificial Inteligence Technical Review

Multi-LLM routing strategies for generative AI applications on AWS

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Webinars

Trending Sources

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Build a multi-tenant generative AI environment for your enterprise on AWS

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

How AWS sales uses Amazon Q Business for customer engagement

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

The future of data: A 5-pillar approach to modern data management

Model customization, RAG, or both: A case study with Amazon Nova

Integrate foundation models into your code with Amazon Bedrock

Host concurrent LLMs with LoRAX

Enable Amazon Bedrock cross-Region inference in multi-account environments

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Empower your generative AI application with a comprehensive custom observability solution

Boost productivity by using AI in cloud operational health management

Pixtral Large is now available in Amazon Bedrock

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

Generative AI operating models in enterprise organizations with Amazon Bedrock

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Automate invoice processing with Streamlit and Amazon Bedrock

A secure approach to generative AI with AWS

Reduce ML training costs with Amazon SageMaker HyperPod

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

12 AI predictions for 2025

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

Build private and secure enterprise generative AI apps with Amazon Q Business and AWS IAM Identity Center

Amazon AWS: Dominating In Cloudcomputing, Data Analytics, Artificial Intelligence and IoT

How AI is helping the NFL improve player safety

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Dynamic video content moderation and policy evaluation using AWS generative AI services

Boost team productivity with Amazon Q Business Insights

5 steps to move AI beyond buzzwords to deliver true transformative impact

Your guide to generative AI and ML at AWS re:Invent 2023

Cost, security, and flexibility: the business case for open source gen AI

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

Stay Connected