AWS, Reference and Serverless

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers. The user prompt is then routed to the LLM associated with the task category of the reference prompt that has the closest match.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. Shared components refer to the functionality and features shared by all tenants. API Gateway is serverless and hence automatically scales with traffic. It’s serverless so you don’t have to manage the infrastructure.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions. We're more than happy to provide further references upon request.

Generative AI

Generative AI AWS Technical Review Backup

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Event-driven operations management Operational events refer to occurrences within your organization’s cloud environment that might impact the performance, resilience, security, or cost of your workloads.

Cloud

Cloud AWS Serverless Policies

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

This solution can serve as a valuable reference for other organizations looking to scale their cloud governance and enable their CCoE teams to drive greater impact. The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure.

Generative AI

Generative AI Government Technical Review Innovation

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Amazon Bedrock Custom Model Import enables the import and use of your customized models alongside existing FMs through a single serverless, unified API. This serverless approach eliminates the need for infrastructure management while providing enterprise-grade security and scalability. Take note of the S3 path youre using.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

With this launch, you can now access Mistrals frontier-class multimodal model to build, experiment, and responsibly scale your generative AI ideas on AWS. AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. Additionally, Pixtral Large supports the Converse API and tool usage.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

The solution presented in this post takes approximately 15–30 minutes to deploy and consists of the following key components: Amazon OpenSearch Service Serverless maintains three indexes : the inventory index, the compatible parts index, and the owner manuals index. Python 3.9 or later Node.js

Lambda

Lambda Enterprise Automotive Knowledge Base

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Security – The solution uses AWS services and adheres to AWS Cloud Security best practices so your data remains within your AWS account.

Generative AI

Generative AI Applications AWS Knowledge Base

Serverless is more than AWS Lambda

Stackery

FEBRUARY 19, 2020

Too often serverless is equated with just AWS Lambda. Yes, it’s true: Amazon Web Services (AWS) helped to pioneer what is commonly referred to as serverless today with AWS Lambda, which was first announced back in 2015.

Lambda

Lambda Serverless AWS Architecture

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Xebia

MAY 21, 2024

Cloud modernization has become a prominent topic for organizations, and AWS plays a crucial role in helping them modernize their IT infrastructure, applications, and services. Overall, discussions on AWS modernization are focused on security, faster releases, efficiency, and steps towards GenAI and improved innovation.

AWS

AWS Strategy Serverless Microservices

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. Prerequisites AWS Command Line Interface (CLI), follow instructions here. Require Python 3.11

AWS

AWS Generative AI Linux Groups

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

Solution overview In this section, we walk through a reference architecture for scalable deployment of MCP servers and MCP clients, using SageMaker AI as the hosting environment for the foundation models (FMs) and LLMs.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

That’s where the new Amazon EMR Serverless application integration in Amazon SageMaker Studio can help. In this post, we demonstrate how to leverage the new EMR Serverless integration with SageMaker Studio to streamline your data processing and machine learning workflows.

Serverless

Serverless AWS Artificial Inteligence Big Data

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. Refer to the GitHub repository for deployment instructions.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Video security analysis for privileged access management using generative AI and Amazon Bedrock

AWS Machine Learning - AI

JANUARY 22, 2025

With the Amazon Bedrock serverless experience, you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using the AWS tools without having to manage any infrastructure. The transcript is provided in tags.

Generative AI

Generative AI Video Analysis Technical Review

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Users can access these AI capabilities through their organizations single sign-on (SSO), collaborate with team members, and refine AI applications without needing AWS Management Console access. The workflow is as follows: The user logs into SageMaker Unified Studio using their organizations SSO from AWS IAM Identity Center.

Generative AI

Generative AI Systems Review System Lambda

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Large organizations often have many business units with multiple lines of business (LOBs), with a central governing entity, and typically use AWS Organizations with an Amazon Web Services (AWS) multi-account strategy. LOBs have autonomy over their AI workflows, models, and data within their respective AWS accounts.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

AWS Machine Learning - AI

APRIL 11, 2024

AWS was delighted to present to and connect with over 18,000 in-person and 267,000 virtual attendees at NVIDIA GTC, a global artificial intelligence (AI) conference that took place March 2024 in San Jose, California, returning to a hybrid, in-person experience for the first time since 2019.

Generative AI

Generative AI AWS Artificial Inteligence Innovation

High-performance computing on AWS

Xebia

AUGUST 29, 2023

How does High-Performance Computing on AWS differ from regular computing? HPC services on AWS Compute Technically you could design and build your own HPC cluster on AWS, it will work but you will spend time on plumbing and undifferentiated heavy lifting. AWS has two services to support your HPC workload.

AWS

AWS Performance Storage Linux

The Future of Serverless is … Functionless?

Stackery

APRIL 11, 2019

I first heard about this pattern a few years ago at a ServerlessConf from a consultant who was helping a “big bank” convert to serverless. 6.10, which is approaching EOL for AWS Lambda? What if, instead, we could do the following: This may seem magical, but it’s possible using advanced mechanisms built into AWS API Gateway.

Serverless

Serverless Lambda AWS Banking

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning - AI

APRIL 30, 2025

Similarly, when an incident occurs in IT, the responding team must provide a precise, documented history for future reference and troubleshooting. The following diagram illustrates the architecture using AWS services. Data sanitization workflow kicks off using AWS Step Functions consisting of AWS Lambda functions.

Artificial Inteligence

Artificial Inteligence Technical Review Software Review Generative AI

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

They are available at no additional charge in AWS Regions where the Amazon Q Business service is offered. Refer to Monitoring Amazon Q Business and Q Apps for more details. Log groups prefixed with /aws/vendedlogs/ will be created automatically. These logs are then queryable using Amazon Athena.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning - AI

MARCH 3, 2025

You can review the Mistral published benchmarks Prerequisites To try out Pixtral 12B in Amazon Bedrock Marketplace, you will need the following prerequisites: An AWS account that will contain all your AWS resources. An AWS Identity and Access Management (IAM) role to access Amazon Bedrock Marketplace and Amazon SageMaker endpoints.

Insurance

Insurance AWS eCommerce Software Review

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

AWS Machine Learning - AI

APRIL 3, 2024

In this post, we show how to build a contextual text and image search engine for product recommendations using the Amazon Titan Multimodal Embeddings model , available in Amazon Bedrock , with Amazon OpenSearch Serverless. Store embeddings into the Amazon OpenSearch Serverless as the search engine.

Serverless

Serverless Artificial Inteligence Engineering Generative AI

Using design patterns in AWS Lambda

Xebia

JANUARY 15, 2024

I have noticed the same behavior with serverless. In this blog post I will go over some reasons why you should be using design patterns in your Lambda functions Getting started To get started with AWS Lambda is quite easy, and this is also the reason why some crucial steps are skipped. Thanks Tensor Programming for the inspiration.

Lambda

Lambda AWS Software Review Serverless

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

AWS Machine Learning - AI

APRIL 22, 2025

To address these challenges, Infosys partnered with Amazon Web Services (AWS) to develop the Infosys Event AI to unlock the insights generated during events. The services used in the solution are granted least-privilege permissions through AWS Identity and Access Management (IAM) policies for security purposes.

Media

Media Knowledge Base AWS Systems Review

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

Designed with a serverless, cost-optimized architecture, the platform provisions SageMaker endpoints dynamically, providing efficient resource utilization while maintaining scalability. Click here to open the AWS console and follow along. The following diagram illustrates the solution architecture.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

AWS Machine Learning - AI

NOVEMBER 22, 2023

We also use Vector Engine for Amazon OpenSearch Serverless (currently in preview) as the vector data store to store embeddings. Asynchronous updates – To ensure the reference documents remain current, they can be updated asynchronously along with their embedding representations. An OpenSearch Serverless collection.

Artificial Inteligence

Artificial Inteligence Serverless Engineering Machine Learning

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

Spend Smarter, Not More: A Guide to AWS Storage Cost Optimization

Xebia

JANUARY 8, 2024

The cloud, particularly Amazon Web Services (AWS), has made storing vast amounts of data more uncomplicated than ever before. S3 Storage Undoubtedly, anyone who uses AWS will inevitably encounter S3, one of the platform’s most popular storage services. The following table gives you an overview of AWS storage costs.

Storage

Storage AWS Backup Policies

Can VPC Lattice replace AWS Transit Gateway?

Xebia

AUGUST 29, 2023

VPC Lattice offers a new mechanism to connect microservices across AWS accounts and across VPCs in a developer-friendly way. Or if you have an existing landing zone with AWS Transit Gateway, do you already plan to replace it with VPC Lattice? You can also use AWS PrivateLink to inter-connect your VPCs across accounts.

AWS

AWS Load Balancer Microservices Lambda

Building Resilient Public Networking on AWS: Part 2

Xebia

JANUARY 18, 2024

Deploy Secure Public Web Endpoints Welcome to Building Resilient Public Networking on AWS—our comprehensive blog series on advanced networking strategies tailored for regional evacuation, failover, and robust disaster recovery. We laid the groundwork for understanding the essentials that underpin the forthcoming discussions.

AWS

AWS Network Load Balancer Software Review

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 1, 2024

At Amazon and AWS, we are always finding innovative ways to build inclusive technology. We explore how to build a fully serverless, voice-based contextual chatbot tailored for individuals who need it. All the services that we use are serverless and fully managed by AWS. We also provide a sample chatbot application.

Serverless

Serverless Artificial Inteligence AWS Software Review

Serverless and Edge Runtime Part 2

Apiumhub

SEPTEMBER 27, 2023

This is the second post in a two-part series exploring the world of Serverless and Edge Runtime. In the previous post, we got familiar with serverless; the main focus of this post will be the Edge Runtime, where it can be useful, and what its caveats are. Edge, the Location: the concept of running servers closer to our users.

Serverless

Serverless Software Review Lambda AWS

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning - AI

MAY 30, 2024

Because Amazon Bedrock is serverless, you don’t have to manage infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. AWS Identity and Access Management (IAM) enforces the necessary permissions for the frontend application.

AWS

AWS Lambda Performance Artificial Inteligence

AWS Lambda Benchmarking

Xebia

FEBRUARY 19, 2024

In this blog post, we examine the relative costs of different language runtimes on AWS Lambda. Many languages can be used with AWS Lambda today, so we focus on four interesting ones. Rust just came to AWS Lambda in November 2023 , so probably a lot of folks are wondering whether to try it out.

Lambda

Lambda AWS Software Review Systems Review

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Our partnership with AWS and our commitment to be early adopters of innovative technologies like Amazon Bedrock underscore our dedication to making advanced HCM technology accessible for businesses of any size. We are thrilled to partner with AWS on this groundbreaking generative AI project. John Canada, VP of Engineering at Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Ground truth data in AI refers to data that is known to be factual, representing the expected use case outcome for the system being modeled. By segment, North America revenue increased 12% Y oY from $316B to $353B, International revenue grew 11% Y oY from$118B to $131B, and AWS revenue increased 13% Y oY from $80B to $91B.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Automate the process to change image backgrounds using Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

MARCH 7, 2024

However, Amazon Bedrock and AWS Step Functions make it straightforward to automate this process at scale. Step Functions allows you to create an automated workflow that seamlessly connects with Amazon Bedrock and other AWS services. The DynamoDB update triggers an AWS Lambda function, which starts a Step Functions workflow.

AWS

AWS Lambda Generative AI Report

Multi-LLM routing strategies for generative AI applications on AWS

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Trending Sources

Build a multi-tenant generative AI environment for your enterprise on AWS

Webinars

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

How AWS sales uses Amazon Q Business for customer engagement

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Boost productivity by using AI in cloud operational health management

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Pixtral Large is now available in Amazon Bedrock

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Empower your generative AI application with a comprehensive custom observability solution

Serverless is more than AWS Lambda

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Getting started with computer use in Amazon Bedrock Agents

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Video security analysis for privileged access management using generative AI and Amazon Bedrock

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Generative AI operating models in enterprise organizations with Amazon Bedrock

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

High-performance computing on AWS

The Future of Serverless is … Functionless?

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Boost team productivity with Amazon Q Business Insights

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

Using design patterns in AWS Lambda

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Spend Smarter, Not More: A Guide to AWS Storage Cost Optimization

Can VPC Lattice replace AWS Transit Gateway?

Building Resilient Public Networking on AWS: Part 2

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

Serverless and Edge Runtime Part 2

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Lambda Benchmarking

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Automate the process to change image backgrounds using Amazon Bedrock and AWS Step Functions

Stay Connected