AWS, Examples and Scalability

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

For example, a marketing content creation application might need to perform task types such as text generation, text summarization, sentiment analysis, and information extraction as part of producing high-quality, personalized content. An example is a virtual assistant for enterprise business operations.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

there is an increasing need for scalable, reliable, and cost-effective solutions to deploy and serve these models. AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment.

AWS

AWS Load Balancer Software Review Artificial Inteligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. It contains services used to onboard, manage, and operate the environment, for example, to onboard and off-board tenants, users, and models, assign quotas to different tenants, and authentication and authorization microservices.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions.

Generative AI

Generative AI AWS Technical Review Backup

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

AI practitioners and industry leaders discussed these trends, shared best practices, and provided real-world use cases during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI. And its modular architecture distributes tasks across multiple agents in parallel, increasing the speed and scalability of migrations.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. The following screenshot shows an example of an interaction with Field Advisor.

AWS

AWS Generative AI Technical Review Artificial Inteligence

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Without a scalable approach to controlling costs, organizations risk unbudgeted usage and cost overruns. Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Dzone - DevOps

APRIL 9, 2025

Amazon Web Services (AWS) provides an expansive suite of tools to help developers build and manage serverless applications with ease. By abstracting the complexities of infrastructure, AWS enables teams to focus on innovation. Why Combine AI, ML, and Serverless Computing?

Serverless

Serverless Artificial Inteligence Scalability AWS

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

For example, “A corgi dog sitting on the front porch.” Examples include “oil paint,” “digital art,” “voxel art,” or “watercolor.” For example: “A winding river through a snowy forest in 4K, illuminated by soft winter sunlight, with tree shadows across the snow and icy reflections.”

Engineering

Engineering AWS 3D Generative AI

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. For this example, we will use the 1B version, but other sizes can be deployed using these steps, along with other popular LLMs. xlarge instances are only available in these AWS Regions.

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

Best Practices for IaC using AWS CloudFormation

Perficient

MARCH 11, 2025

IaC enables developers to define infrastructure configurations using code, ensuring consistency, automation, and scalability. AWS CloudFormation, a key service in the AWS ecosystem, simplifies IaC by allowing users to easily model and set up AWS resources. Why Use AWS CloudFormation?

AWS

AWS Software Review Systems Review Policies

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. For example, your agent could take screenshots, create and edit text files, and run built-in Linux commands.

AWS

AWS Generative AI Linux Groups

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Users can access these AI capabilities through their organizations single sign-on (SSO), collaborate with team members, and refine AI applications without needing AWS Management Console access. The workflow is as follows: The user logs into SageMaker Unified Studio using their organizations SSO from AWS IAM Identity Center.

Generative AI

Generative AI Systems Review System Lambda

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services.

Generative AI

Generative AI Applications AWS Knowledge Base

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning - AI

NOVEMBER 20, 2024

In this post, we explore how you can use Amazon Q Business , the AWS generative AI-powered assistant, to build a centralized knowledge base for your organization, unifying structured and unstructured datasets from different sources to accelerate decision-making and drive productivity. For example, q-aurora-mysql-source.

Data

Data AWS Groups Knowledge Base

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. An S3 bucket prepared to store the custom model.

Generative AI

Generative AI Artificial Inteligence AWS Serverless

Enabling AWS IAM DB Authentication

Perficient

DECEMBER 23, 2024

Objective: IAM DB Authentication improves security, enables centralized user management, supports auditing, and ensures scalability for database access.

Authentication

Authentication AWS Policies Scalability

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform. Navigate to the AWS Secrets Manager console and find the secret -api-keys.

Generative AI

Generative AI Applications Technical Review Software Review

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

With Amazon Bedrock Data Automation, enterprises can accelerate AI adoption and develop solutions that are secure, scalable, and responsible. Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions. For example, a request made in the US stays within Regions in the US.

Data

Data Generative AI Artificial Inteligence Compliance

High-performance computing on AWS

Xebia

AUGUST 29, 2023

How does High-Performance Computing on AWS differ from regular computing? HPC services on AWS Compute Technically you could design and build your own HPC cluster on AWS, it will work but you will spend time on plumbing and undifferentiated heavy lifting. AWS has two services to support your HPC workload.

AWS

AWS Performance Storage Linux

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

Although the principles discussed are applicable across various industries, we use an automotive parts retailer as our primary example throughout this post. x or later The AWS CDK CLI installed Deploy the solution The following steps outline the process to deploying the solution using the AWS CDK. Python 3.9 or later Node.js

Lambda

Lambda Enterprise Automotive Knowledge Base

Building Resilient Public Networking on AWS: Part 2

Xebia

JANUARY 18, 2024

Deploy Secure Public Web Endpoints Welcome to Building Resilient Public Networking on AWS—our comprehensive blog series on advanced networking strategies tailored for regional evacuation, failover, and robust disaster recovery. We laid the groundwork for understanding the essentials that underpin the forthcoming discussions.

AWS

AWS Network Load Balancer Software Review

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. For example, Can I speak to your manager? and I would like to speak to someone higher up dont share the same keywords, but are both asking for an escalation.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 14, 2024

Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. This tutorial assumes you have the necessary AWS Identity and Access Management (IAM) permissions. Install Python 3.7 or later on your local machine.

Software Review

Software Review Technical Review AWS Artificial Inteligence

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. Refer to Guidelines for preparing your data for Amazon Nova on best practices and example formats when preparing datasets for fine-tuning Amazon Nova models.

Case Study

Case Study Artificial Inteligence Study Generative AI

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

In the current digital environment, migration to the cloud has emerged as an essential tactic for companies aiming to boost scalability, enhance operational efficiency, and reinforce resilience. Get AWS developers A step-by-step AWS migration checklist Mobilunity helps hiring dedicated development teams to businesses worldwide for 14+ years.

AWS

AWS Cloud Weak Development Team DevOps

Streamlining Workflows with Feature Branches and Logical Stacks

Xebia

JANUARY 26, 2025

This blog explores how to optimize feature branch workflows, maintain encapsulated logical stacks, and apply best practices like resource naming to improve clarity, scalability, and cost-effectiveness. This example applies to the more traditional lift and shift approaches. Simple: In the example, we needed an RDS instance.

Weak Development Team

Weak Development Team Serverless Lambda Resources

Mastering AWS Infrastructure as Code with Pulumi and Python

Perficient

MARCH 27, 2025

What Youll Learn How Pulumi works with AWS Setting up Pulumi with Python Deploying various AWS services with real-world examples Best practices and advanced tips Why Pulumi for AWS? Multi-Cloud and Multi-Language Support Deploy across AWS, Azure, and Google Cloud with Python, TypeScript, Go, or.NET.

AWS

AWS Infrastructure Lambda Load Balancer

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Use the us-west-2 AWS Region to run this demo. Prerequisites This notebook is designed to run on AWS, using Amazon Bedrock for both Anthropics Claude 3 Sonnet and Stability AI model access. Make sure you have the following set up before moving forward: An AWS account. An Amazon SageMaker domain. Access to Stability AIs SD3.5

Generative AI

Generative AI Games Development AWS

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

APRIL 23, 2024

As successful proof-of-concepts transition into production, organizations are increasingly in need of enterprise scalable solutions. The AWS Well-Architected Framework provides best practices and guidelines for designing and operating reliable, secure, efficient, and cost-effective systems in the cloud.

Knowledge Base

Knowledge Base Scalability Applications Generative AI

Creating asynchronous AI agents with Amazon Bedrock

AWS Machine Learning - AI

MARCH 13, 2025

Conversely, asynchronous event-driven systems offer greater flexibility and scalability through their distributed nature. While this approach may introduce more complexity in tracking and debugging workflows, it excels in scenarios requiring high scalability, fault tolerance, and adaptive behavior.

Artificial Inteligence

Artificial Inteligence Lambda Travel Generative AI

What does the new era of location intelligence hold for businesses?

TechCrunch

FEBRUARY 7, 2022

Scalable and data-rich location services are helping consumer-facing business drive transformation and growth along three strategic fronts: Creating richer consumer experiences. Better in-app experiences lead to improved consumer engagement and lasting loyalty.

Business Intelligence

Business Intelligence AWS Data Engineering Sustainability

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Cloudera

JANUARY 7, 2025

A prominent public health organization integrated data from multiple regional health entities within a hybrid multi-cloud environment (AWS, Azure, and on-premise). A leading meal kit provider migrated its data architecture to Cloudera on AWS, utilizing Cloudera’s Open Data Lakehouse capabilities.

Cloud

Cloud Data Scalability Compliance

Build your multilingual personal calendar assistant with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

JULY 3, 2024

To solve this problem, this post shows you how to apply AWS services such as Amazon Bedrock , AWS Step Functions , and Amazon Simple Email Service (Amazon SES) to build a fully-automated multilingual calendar artificial intelligence (AI) assistant. Here’s the generated prompt from the example message).

AWS

AWS Artificial Inteligence Generative AI Lambda

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The collaboration between BQA and AWS was facilitated through the Cloud Innovation Center (CIC) program, a joint initiative by AWS, Tamkeen , and leading universities in Bahrain, including Bahrain Polytechnic and University of Bahrain. The extracted text data is placed into another SQS queue for the next processing step.

Education

Education Report Technical Review Generative AI

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

For example, the previous best model, GPT-4o, could only solve 13% of the problems on the International Mathematics Olympiad, while the new reasoning model solved 83%. Take for example the use of AI in deciding whether to approve a loan, a medical procedure, pay an insurance claim or make employment recommendations.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Multi-LLM routing strategies for generative AI applications on AWS

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

Trending Sources

Introducing AWS MCP Servers for code assistants (Part 1)

Webinars

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Build a multi-tenant generative AI environment for your enterprise on AWS

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AI in action: Stories of how enterprises are transforming and modernizing

How AWS sales uses Amazon Q Business for customer engagement

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Best Practices for IaC using AWS CloudFormation

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Getting started with computer use in Amazon Bedrock Agents

Enable Amazon Bedrock cross-Region inference in multi-account environments

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Empower your generative AI application with a comprehensive custom observability solution

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Enabling AWS IAM DB Authentication

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

High-performance computing on AWS

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Building Resilient Public Networking on AWS: Part 2

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Automate invoice processing with Streamlit and Amazon Bedrock

Model customization, RAG, or both: A case study with Amazon Nova

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Streamlining Workflows with Feature Branches and Logical Stacks

Mastering AWS Infrastructure as Code with Pulumi and Python

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

Creating asynchronous AI agents with Amazon Bedrock

What does the new era of location intelligence hold for businesses?

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Build your multilingual personal calendar assistant with Amazon Bedrock and AWS Step Functions

How BQA streamlines education quality reporting using Amazon Bedrock

12 AI predictions for 2025

Stay Connected