Reference, Scalability and Serverless

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers. The user prompt is then routed to the LLM associated with the task category of the reference prompt that has the closest match.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

We demonstrate how to harness the power of LLMs to build an intelligent, scalable system that analyzes architecture documents and generates insightful recommendations based on AWS Well-Architected best practices. This scalability allows for more frequent and comprehensive reviews.

Generative AI

Generative AI Technical Review Software Review Systems Review

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

APRIL 23, 2024

As successful proof-of-concepts transition into production, organizations are increasingly in need of enterprise scalable solutions. For details on all the fields and providing configuration of various vector stores supported by Knowledge Bases for Amazon Bedrock, refer to AWS::Bedrock::KnowledgeBase.

Knowledge Base

Knowledge Base Scalability Applications Generative AI

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Integrating Key Vault Secrets with Azure Synapse Analytics

Apiumhub

DECEMBER 9, 2024

Give each secret a clear name, as youll use these names to reference them in Synapse. Add a Linked Service to the pipeline that references the Key Vault. When setting up a linked service for these sources, reference the names of the secrets stored in Key Vault instead of hard-coding the credentials.

Azure

Azure Analytics Storage Artificial Inteligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

Since Amazon Bedrock is serverless, you don’t have to manage any infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. We're more than happy to provide further references upon request.

Generative AI

Generative AI AWS Technical Review Backup

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Amazon Bedrock Custom Model Import enables the import and use of your customized models alongside existing FMs through a single serverless, unified API. This serverless approach eliminates the need for infrastructure management while providing enterprise-grade security and scalability.

Generative AI

Generative AI Artificial Inteligence AWS Serverless

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

That’s where the new Amazon EMR Serverless application integration in Amazon SageMaker Studio can help. In this post, we demonstrate how to leverage the new EMR Serverless integration with SageMaker Studio to streamline your data processing and machine learning workflows.

Serverless

Serverless AWS Artificial Inteligence Big Data

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

This solution can serve as a valuable reference for other organizations looking to scale their cloud governance and enable their CCoE teams to drive greater impact. Limited scalability – As the volume of requests increased, the CCoE team couldn’t disseminate updated directives quickly enough. About the Authors Steven Craig is a Sr.

Generative AI

Generative AI Government Technical Review Innovation

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. For more information on generating JSON using the Converse API, refer to Generating JSON with the Amazon Bedrock Converse API. In this post, we discuss the features of Pixtral Large and its possible use cases.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

Shared components refer to the functionality and features shared by all tenants. API Gateway is serverless and hence automatically scales with traffic. The advantage of using Application Load Balancer is that it can seamlessly route the request to virtually any managed, serverless or self-hosted component and can also scale well.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

If you don’t have an AWS account, refer to How do I create and activate a new Amazon Web Services account? If you don’t have an existing knowledge base, refer to Create an Amazon Bedrock knowledge base. Performance optimization The serverless architecture used in this post provides a scalable solution out of the box.

Generative AI

Generative AI Lambda Applications AWS

DeltaStream secures cash to build real-time streaming databases

TechCrunch

JULY 13, 2022

DeltaStream provides a serverless streaming database to manage, secure and process data streams. “Serverless” refers to the way DeltaStream abstracts away infrastructure, allowing developers to interact with databases without having to think about servers.

Serverless

Serverless Systems Review Storage Technical Review

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Xebia

MAY 21, 2024

Many companies across various industries prioritize modernization in the cloud for several reasons, such as greater agility, scalability, reliability, and cost efficiency, enabling them to innovate faster and stay competitive in today’s rapidly evolving digital landscape.

AWS

AWS Strategy Serverless Microservices

Build scalable Low-Code backends with Booster

The Agile Monkey

DECEMBER 22, 2022

However, these tools may not be suitable for more complex data or situations requiring scalability and robust business logic. On the other hand, using serverless solutions from scratch can be time-consuming and require a lot of effort to set up and manage. You just want to move fast and only care about your business logic , right?

Scalability

Scalability AWS Authentication Open Source

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

AWS Machine Learning - AI

APRIL 3, 2024

In this post, we show how to build a contextual text and image search engine for product recommendations using the Amazon Titan Multimodal Embeddings model , available in Amazon Bedrock , with Amazon OpenSearch Serverless. Store embeddings into the Amazon OpenSearch Serverless as the search engine.

Serverless

Serverless Artificial Inteligence Engineering Generative AI

5 Questions to Ask Before Going Serverless

Modus Create

MAY 15, 2023

When serverless architecture became all the rage a few years ago, we wondered whether it was just marketing hype. Was serverless really cloud 2.0 Serverless architecture’s popularity has risen over the past 5 years. While serverless brings immense benefits to businesses, it’s important not to rush into it.

Serverless

Serverless Lambda Architecture AWS

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Governance in the context of generative AI refers to the frameworks, policies, and processes that streamline the responsible development, deployment, and use of these technologies. For a comprehensive read about vector store and embeddings, you can refer to The role of vector databases in generative AI applications.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

The Anatomy of a Secure Serverless Platform, pt. I — Design

Stackery

APRIL 1, 2020

While a serverless focus might be justified by improving the overall speed and efficiency of your development workflow, security needs to remain a core element at every step. But serverless design also involves a shift in thinking and the daunting challenge of leveraging the massive suite of AWS tools and services.

Serverless

Serverless AWS Architecture Infrastructure

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Use the following AWS CloudFormation template , and refer to Create a stack from the CloudFormation console to launch the stack in your preferred AWS Region. The solutions scalability and flexibility allow organizations to seamlessly integrate advanced AI capabilities into existing applications, databases, and third-party systems.

Generative AI

Generative AI Systems Review System Lambda

Future of Software Development

Dzone - DevOps

FEBRUARY 14, 2024

Among the most notable trends gaining traction is serverless architecture , offering developers a paradigm shift in how they approach application development. In this article, we delve into the world of serverless architecture, exploring its key concepts, benefits, and implications for the future of software development.

Software Development

Software Development Software Development Serverless

Going Serverless: Comparing Cloud Providers

Gorilla Logic

SEPTEMBER 16, 2021

In this article, we are going to compare the leading cloud providers of serverless computing frameworks so that you have enough intel to make a sound decision when choosing one over the others. Scalability, Limits, and Restrictions. Scalability: Lambda creates a new instance to process each new concurrent event. Azure Functions.

Serverless

Serverless Lambda Cloud Azure

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

The solution presented in this post takes approximately 15–30 minutes to deploy and consists of the following key components: Amazon OpenSearch Service Serverless maintains three indexes : the inventory index, the compatible parts index, and the owner manuals index.

Lambda

Lambda Enterprise Automotive Knowledge Base

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 1, 2024

We explore how to build a fully serverless, voice-based contextual chatbot tailored for individuals who need it. The aim of this post is to provide a comprehensive understanding of how to build a voice-based, contextual chatbot that uses the latest advancements in AI and serverless computing. We discuss this later in the post.

Serverless

Serverless Artificial Inteligence AWS Software Review

Putting the stack in JAMstack

Stackery

AUGUST 11, 2020

Serverless + JAMstack is where web app architectures are going. These are often referred to as static site generators, but I’m a fan of PayPal’s Jamund Ferguson rephrasing the term as static apps in the recent talk Bringing JAMstack to the Enterprise. Stackery is focused on helping developers leverage the power of AWS managed services.

Serverless

Serverless Lambda AWS Architecture

How Cato Networks uses Amazon Bedrock to transform free text search into structured GraphQL queries

AWS Machine Learning - AI

JANUARY 22, 2025

With the Amazon Bedrock serverless experience, you can get started quickly, privately customize FMs with your own data, and quickly integrate and deploy them into your applications using AWS tools without having to manage the infrastructure.

Network

Network Artificial Inteligence Machine Learning Serverless

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

The Asure team was manually analyzing thousands of call transcripts to uncover themes and trends, a process that lacked scalability. Staying ahead in this competitive landscape demands agile, scalable, and intelligent solutions that can adapt to changing demands. Architecture The following diagram illustrates the solution architecture.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Cost optimization – This solution uses serverless technologies, making it cost-effective for the observability infrastructure. However, some components may incur additional usage-based costs.

Generative AI

Generative AI Applications AWS Knowledge Base

Improve public speaking skills using a generative AI-based virtual assistant with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 15, 2024

In the following sections, we walk you through constructing a scalable, serverless, end-to-end Public Speaking Mentor AI Assistant with Amazon Bedrock, Amazon Transcribe , and AWS Step Functions using provided sample code. Refer to Configure Amazon SNS to send messages for alerts to other destinations for more information.

Generative AI

Generative AI Virtualization Technical Advisors AWS

Are Cloud Serverless Functions Exposing Your Data?

Prisma Clud

JUNE 6, 2024

More than 25% of all publicly accessible serverless functions have access to sensitive data , as seen in internal research. The question then becomes, Are cloud serverless functions exposing your data? Just need a quick reference? Security Considerations for AWS Lambda Functions AWS’ main serverless offering is Lambda functions.

Serverless

Serverless Cloud Data Azure

Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock

AWS Machine Learning - AI

FEBRUARY 9, 2024

Because Amazon Bedrock is serverless, you don’t have to manage any infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. For more details and specific model prices, refer to Amazon Bedrock Pricing.

Lambda

Lambda Generative AI AWS Microservices

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The architecture is complemented by essential supporting services, including AWS Key Management Service (AWS KMS) for security and Amazon CloudWatch for monitoring, creating a resilient, serverless container environment that alleviates the need to manage underlying infrastructure while maintaining robust security and high availability.

AWS

AWS Generative AI Linux Groups

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Ground truth data in AI refers to data that is known to be factual, representing the expected use case outcome for the system being modeled. Document Section Targeting - Reference specific sections when the information location is relevant - Example: "In Section [X] of [Document Name], what are the steps for [specific process]?"

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Azure Container Apps – Simplifying Container Deployment Without the Kubernetes Complexity

Xebia

MAY 14, 2024

In August 2021, I was accepted to test and provide feedback on what was referred to as ‘Azure Worker Apps’, another Azure service Microsoft was developing to run containers. One of the benefits of being part of the Microsoft MVP program is the access to private previews of services and features. Kubernetes Cluster).

Azure

Azure Microservices Serverless Software Review

Serverless vs containers: Which is best for your application?

CircleCI

MAY 27, 2022

Two of the most widely-used technologies to host these deployments are serverless functions and containers. In this comparison, we will look at some important differentiators between serverless computing and containers and outline some criteria you can use to decide which to use for your next project. What is serverless?

Serverless

Serverless Applications Technical Review Microservices

Building Generative AI prompt chaining workflows with human in the loop

AWS Machine Learning - AI

MAY 17, 2024

The evaluation test suite consists of hundreds of test product reviews, a reference response to the review, and a set of rules to evaluate the LLM response against the reference response. The second task then asks the LLM to compare the generated response to the reference response using the rules and generate an evaluation score.

Generative AI

Generative AI Artificial Inteligence Systems Review Software Review

Serverless Security: Building Robust and Resilient Applications in a Cloud-Native Environment

Altexsoft

SEPTEMBER 6, 2023

Serverless security has become a significant player in the B2B tech landscape. billion in 2021, the serverless security market is projected to surge to USD 5.1 Furthermore, as per recent data , 21% of enterprises have already integrated serverless technology and an additional 39% are exploring its potential. Let’s get started.

Serverless

Serverless Applications Cloud AWS

GenAI for Aerospace: Empowering the workforce with expert knowledge on Amazon Q and Amazon Bedrock

AWS Machine Learning - AI

SEPTEMBER 26, 2024

This domain knowledge is traditionally captured in reference manuals, service bulletins, quality ticketing systems, engineering drawings, and more, but the quantity and complexity of documents is growing and takes time to learn. In RAG, these knowledge sources are often referred to as a knowledge base. Try it out!

Artificial Inteligence

Artificial Inteligence Generative AI Knowledge Base AWS

A Firewall Admin’s Introduction to Serverless Security

Palo Alto Networks

NOVEMBER 5, 2019

Ron Harnik, Senior Product Marketing Manager, Serverless Security. Serverless computing is the latest in a long line of cloud technologies, and many organizations are still wrapping their heads around it. I want to share my view from the front line to help security teams who are taking their first steps in the serverless world. .

Serverless

Serverless Firewall Lambda Weak Development Team

Altexsoft - Untitled Article

Altexsoft

JANUARY 14, 2021

The top tier is referred to as the front-end or client layer. By the level of back-end management involved: Serverless data warehouses get their functional building blocks with the help of serverless services, meaning they are fully-managed by third-party vendors. Scalability opportunities. Scalability.

Backup

Backup Azure Software Review Architecture

Catalog, query, and search audio programs with Amazon Transcribe and Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

AUGUST 5, 2024

As we continue to add new episodes, we will want to use AI services to make the task of querying and searching for specific content more scalable without the need to manually add metadata for each episode. For instructions on transcribing with the AWS Management Console or AWS CLI, refer to the Amazon Transcribe Developer guide.

Knowledge Base

Knowledge Base Artificial Inteligence Programming Generative AI

Improving air quality with generative AI

AWS Machine Learning - AI

JUNE 18, 2024

The objective is to automate data integration from various sensor manufacturers for Accra, Ghana, paving the way for scalability across West Africa. The solution had the following requirements: Cloud hosting – The solution must reside on the cloud, ensuring scalability and accessibility.

Generative AI

Generative AI Artificial Inteligence Technical Review AWS

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

This enables sales teams to interact with our internal sales enablement collateral, including sales plays and first-call decks, as well as customer references, customer- and field-facing incentive programs, and content on the AWS website, including blog posts and service documentation.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

AWS Machine Learning - AI

SEPTEMBER 17, 2024

Our solution uses an FSx for ONTAP file system as the source of unstructured data and continuously populates an Amazon OpenSearch Serverless vector database with the user’s existing files and folders and associated metadata. We use this data and ACLs to test permissions-based access to the embeddings in a RAG scenario with Amazon Bedrock.

Generative AI

Generative AI AWS Applications Serverless

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS CDK

AWS Machine Learning - AI

AUGUST 28, 2024

By using the AWS CDK, the solution sets up the necessary resources, including an AWS Identity and Access Management (IAM) role, Amazon OpenSearch Serverless collection and index, and knowledge base with its associated data source. For installation instructions, refer to the AWS CDK workshop. The AWS CDK already set up.

Knowledge Base

Knowledge Base AWS Generative AI Artificial Inteligence

Multi-LLM routing strategies for generative AI applications on AWS

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Trending Sources

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

Webinars

Integrating Key Vault Secrets with Azure Synapse Analytics

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Pixtral Large is now available in Amazon Bedrock

Build a multi-tenant generative AI environment for your enterprise on AWS

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

DeltaStream secures cash to build real-time streaming databases

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Build scalable Low-Code backends with Booster

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

5 Questions to Ask Before Going Serverless

Generative AI operating models in enterprise organizations with Amazon Bedrock

The Anatomy of a Secure Serverless Platform, pt. I — Design

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Future of Software Development

Going Serverless: Comparing Cloud Providers

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

Putting the stack in JAMstack

How Cato Networks uses Amazon Bedrock to transform free text search into structured GraphQL queries

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Empower your generative AI application with a comprehensive custom observability solution

Improve public speaking skills using a generative AI-based virtual assistant with Amazon Bedrock

Are Cloud Serverless Functions Exposing Your Data?

Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock

Getting started with computer use in Amazon Bedrock Agents

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Azure Container Apps – Simplifying Container Deployment Without the Kubernetes Complexity

Serverless vs containers: Which is best for your application?

Building Generative AI prompt chaining workflows with human in the loop

Serverless Security: Building Robust and Resilient Applications in a Cloud-Native Environment

GenAI for Aerospace: Empowering the workforce with expert knowledge on Amazon Q and Amazon Bedrock

A Firewall Admin’s Introduction to Serverless Security

Altexsoft - Untitled Article

Catalog, query, and search audio programs with Amazon Transcribe and Knowledge Bases for Amazon Bedrock

Improving air quality with generative AI

How AWS sales uses Amazon Q Business for customer engagement

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS CDK

Stay Connected