Architecture, Reference and Serverless

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. The solution incorporates the following key features: Using a Retrieval Augmented Generation (RAG) architecture, the system generates a context-aware detailed assessment.

Generative AI

Generative AI Technical Review Software Review Systems Review

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers. The user prompt is then routed to the LLM associated with the task category of the reference prompt that has the closest match.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Amazon Bedrock Custom Model Import enables the import and use of your customized models alongside existing FMs through a single serverless, unified API. This serverless approach eliminates the need for infrastructure management while providing enterprise-grade security and scalability. 8B 128K model to 8 Units for a Llama 3.1

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

By implementing this architectural pattern, organizations that use Google Workspace can empower their workforce to access groundbreaking AI solutions powered by Amazon Web Services (AWS) and make informed decisions without leaving their collaboration tool. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. Using a client-server architecture (as illustrated in the following screenshot), MCP helps developers expose their data through lightweight MCP servers while building AI applications as MCP clients that connect to these servers.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

Shared components refer to the functionality and features shared by all tenants. API Gateway is serverless and hence automatically scales with traffic. The advantage of using Application Load Balancer is that it can seamlessly route the request to virtually any managed, serverless or self-hosted component and can also scale well.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

Event-driven operations management Operational events refer to occurrences within your organization’s cloud environment that might impact the performance, resilience, security, or cost of your workloads. The following diagram illustrates the solution architecture. The full code repository is available in the accompanying GitHub repo.

Cloud

Cloud AWS Serverless Policies

Serverless is more than AWS Lambda

Stackery

FEBRUARY 19, 2020

Too often serverless is equated with just AWS Lambda. Yes, it’s true: Amazon Web Services (AWS) helped to pioneer what is commonly referred to as serverless today with AWS Lambda, which was first announced back in 2015. Lambda is just one component of a modern serverless stack.

Lambda

Lambda Serverless AWS Architecture

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

That’s where the new Amazon EMR Serverless application integration in Amazon SageMaker Studio can help. In this post, we demonstrate how to leverage the new EMR Serverless integration with SageMaker Studio to streamline your data processing and machine learning workflows.

Serverless

Serverless AWS Artificial Inteligence Big Data

The Future of Serverless is … Functionless?

Stackery

APRIL 11, 2019

Lately, I’ve seen some talk about an architectural pattern that I believe will become prevalent in the near future. I first heard about this pattern a few years ago at a ServerlessConf from a consultant who was helping a “big bank” convert to serverless. DynamoDB Tables and Aurora Serverless Databases).

Serverless

Serverless Lambda AWS Banking

Video security analysis for privileged access management using generative AI and Amazon Bedrock

AWS Machine Learning - AI

JANUARY 22, 2025

We explain the end-to-end solution workflow, the prompts needed to produce the transcript and perform security analysis, and provide a deployable solution architecture. For a comprehensive guide to prompt engineering, refer to Prompt engineering techniques and best practices: Learn by doing with Anthropics Claude 3 on Amazon Bedrock.

Generative AI

Generative AI Video Analysis Technical Review

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

Designed with a serverless, cost-optimized architecture, the platform provisions SageMaker endpoints dynamically, providing efficient resource utilization while maintaining scalability. The following diagram illustrates the solution architecture. Key architectural decisions drive both performance and cost optimization.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

This solution can serve as a valuable reference for other organizations looking to scale their cloud governance and enable their CCoE teams to drive greater impact. Oleg Chugaev is a Principal Solutions Architect and Serverless evangelist with 20+ years in IT, holding multiple AWS certifications. About the Authors Steven Craig is a Sr.

Generative AI

Generative AI Government Technical Review Innovation

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. The following diagram provides a detailed view of the architecture to enhance email support using generative AI.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

AWS Machine Learning - AI

NOVEMBER 22, 2023

In this post, we explore building a contextual chatbot for financial services organizations using a RAG architecture with the Llama 2 foundation model and the Hugging Face GPTJ-6B-FP16 embeddings model, both available in SageMaker JumpStart. For an in-depth understanding, refer to the LangChain documentation.

Artificial Inteligence

Artificial Inteligence Serverless Engineering Machine Learning

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

AWS Machine Learning - AI

APRIL 3, 2024

In this post, we show how to build a contextual text and image search engine for product recommendations using the Amazon Titan Multimodal Embeddings model , available in Amazon Bedrock , with Amazon OpenSearch Serverless. The following diagram illustrates the solution architecture. You then display the top similar results.

Serverless

Serverless Artificial Inteligence Engineering Generative AI

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we evaluate different generative AI operating model architectures that could be adopted. Governance in the context of generative AI refers to the frameworks, policies, and processes that streamline the responsible development, deployment, and use of these technologies.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

5 Questions to Ask Before Going Serverless

Modus Create

MAY 15, 2023

When serverless architecture became all the rage a few years ago, we wondered whether it was just marketing hype. Was serverless really cloud 2.0 Serverless architecture’s popularity has risen over the past 5 years. While serverless brings immense benefits to businesses, it’s important not to rush into it.

Serverless

Serverless Lambda Architecture AWS

A serverless glossary

Stackery

MAY 22, 2019

With Serverless, it’s not the technology that’s hard, it’s understanding the language of a new culture and operational model. Serverless architecture has coined some new terms and, more confusingly, re-used a few older terms with new meanings. This glossary will clarify some of them. For now, we’re sticking with ‘App’.

Serverless

Serverless Lambda AWS Resources

Capsule gets $1.5M to build ‘super simple’ decentralized social media

TechCrunch

MARCH 9, 2021

Kobeissi’s original concept for Capsule, meanwhile, was to create self-hosting microservices.

Media

Media Social Blockchain USP

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 1, 2024

We explore how to build a fully serverless, voice-based contextual chatbot tailored for individuals who need it. The aim of this post is to provide a comprehensive understanding of how to build a voice-based, contextual chatbot that uses the latest advancements in AI and serverless computing. We discuss this later in the post.

Serverless

Serverless Artificial Inteligence AWS Software Review

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Cost optimization – This solution uses serverless technologies, making it cost-effective for the observability infrastructure. However, some components may incur additional usage-based costs.

Generative AI

Generative AI Applications AWS Knowledge Base

How do we set up a proper serverless development workflow?

Stackery

MAY 29, 2019

If you’ve built a serverless application or two, you’re probably familiar with the benefits of serverless architecture. There’s another side to the serverless story: developer workflow. Understanding the benefits of serverless is easy, but building serverless apps well requires effective development workflows.

Serverless

Serverless Lambda Development AWS

The Anatomy of a Secure Serverless Platform, pt. I — Design

Stackery

APRIL 1, 2020

A good software design tool enables rapid visualization of application architectures, much like a virtual whiteboard. A great design tool validates service architectures, their communication flows and the infrastructure required to execute them—and builds a scaffold that can be seamlessly taken forward into development.

Serverless

Serverless AWS Architecture Infrastructure

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Xebia

MAY 21, 2024

Modernizing on AWS refers to migrating and transforming traditional applications, workloads, and infrastructure to leverage the benefits of cloud computing and AWS services. Adoption of Cloud-Native Technologies: Companies embrace cloud-native technologies such as containers, serverless computing, and microservices architecture.

AWS

AWS Strategy Serverless Microservices

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

AWS Machine Learning - AI

APRIL 22, 2025

Seamless live stream acquisition The solution begins with an IP-enabled camera capturing the live event feed, as shown in the following section of the architecture diagram. A serverless, event-driven workflow using Amazon EventBridge and AWS Lambda automates the post-event processing.

Media

Media Knowledge Base AWS Systems Review

Future of Software Development

Dzone - DevOps

FEBRUARY 14, 2024

Among the most notable trends gaining traction is serverless architecture , offering developers a paradigm shift in how they approach application development. In this article, we delve into the world of serverless architecture, exploring its key concepts, benefits, and implications for the future of software development.

Software Development

Software Development Software Development Serverless

AoAD2 Practice: Evolutionary System Architecture

James Shore

MAY 31, 2021

Evolutionary System Architecture. What about your system architecture? By system architecture, I mean all the components that make up your deployed system. When you do, you get evolutionary system architecture. This is a decidedly unfashionable approach to system architecture. Programmers, Operations. They serve 1.3

System Architecture

System Architecture Architecture Systems Review System

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

AWS Machine Learning - AI

MAY 31, 2024

With the Amazon Bedrock serverless experience, you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using the Amazon Web Services (AWS) tools without having to manage infrastructure. The following diagram depicts a high-level RAG architecture.

Knowledge Base

Knowledge Base Artificial Inteligence AWS Serverless

Improve public speaking skills using a generative AI-based virtual assistant with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 15, 2024

In the following sections, we walk you through constructing a scalable, serverless, end-to-end Public Speaking Mentor AI Assistant with Amazon Bedrock, Amazon Transcribe , and AWS Step Functions using provided sample code. The following diagram shows our solution architecture.

Generative AI

Generative AI Virtualization Technical Advisors AWS

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The architecture is complemented by essential supporting services, including AWS Key Management Service (AWS KMS) for security and Amazon CloudWatch for monitoring, creating a resilient, serverless container environment that alleviates the need to manage underlying infrastructure while maintaining robust security and high availability.

AWS

AWS Generative AI Linux Groups

Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock

AWS Machine Learning - AI

FEBRUARY 9, 2024

Because Amazon Bedrock is serverless, you don’t have to manage any infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. The following diagram summarizes the solution architecture and key components.

Lambda

Lambda Generative AI AWS Microservices

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

This enables sales teams to interact with our internal sales enablement collateral, including sales plays and first-call decks, as well as customer references, customer- and field-facing incentive programs, and content on the AWS website, including blog posts and service documentation.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Moreover, Amazon Bedrock offers integration with other AWS services like Amazon SageMaker , which streamlines the deployment process, and its scalable architecture makes sure the solution can adapt to increasing call volumes effortlessly. This is powered by the web app portion of the architecture diagram (provided in the next section).

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning - AI

MARCH 3, 2025

Mistral developed a novel architecture for Pixtral 12B, optimized for both computational efficiency and performance. This architecture supports processing an arbitrary number of images of varying sizes within a large context window of 128k tokens. Refer to Requesting a quota increase for access to GPU instances.

Insurance

Insurance AWS eCommerce Software Review

Revisiting “Serverless Architectures”

Mike Roberts

MAY 22, 2018

I started writing “ Serverless Architectures ” in May 2016. Fast forward to two years later and the article has had more than half a million visits, regularly appears in the top five Google search results for “Serverless”, and helped launched Symphonia ?—?my What is Serverless? I thought a few folks might be interested.

Serverless

Serverless Architecture Lambda Microservices

Revisiting “Serverless Architectures”

Mike Roberts

MAY 22, 2018

I started writing “ Serverless Architectures ” in May 2016. Fast forward to two years later and the article has had more than half a million visits, regularly appears in the top five Google search results for “Serverless”, and helped launched Symphonia ?—?my What is Serverless? I thought a few folks might be interested.

Serverless

Serverless Architecture Lambda Microservices

Putting the stack in JAMstack

Stackery

AUGUST 11, 2020

Serverless + JAMstack is where web app architectures are going. These are often referred to as static site generators, but I’m a fan of PayPal’s Jamund Ferguson rephrasing the term as static apps in the recent talk Bringing JAMstack to the Enterprise. Meaning, these are applications with dynamic interactivity.

Serverless

Serverless Lambda AWS Architecture

Building Generative AI prompt chaining workflows with human in the loop

AWS Machine Learning - AI

MAY 17, 2024

The application uses event-driven architecture (EDA), a powerful software design pattern that you can use to build decoupled systems by communicating through events. The second task then asks the LLM to compare the generated response to the reference response using the rules and generate an evaluation score.

Generative AI

Generative AI Artificial Inteligence Systems Review Software Review

GenAI for Aerospace: Empowering the workforce with expert knowledge on Amazon Q and Amazon Bedrock

AWS Machine Learning - AI

SEPTEMBER 26, 2024

This domain knowledge is traditionally captured in reference manuals, service bulletins, quality ticketing systems, engineering drawings, and more, but the quantity and complexity of documents is growing and takes time to learn. In RAG, these knowledge sources are often referred to as a knowledge base. Try it out!

Artificial Inteligence

Artificial Inteligence Generative AI Knowledge Base AWS

Running Serverless in Production: 7 Best Practices for DevOps

DevOps.com

FEBRUARY 8, 2023

Serverless in production refers to the deployment and use of serverless architecture in a live, production environment. In this context, serverless refers to a cloud computing paradigm where the cloud provider manages the infrastructure and allocates resources as needed to run and scale applications and services.

Serverless

Serverless DevOps Architecture Infrastructure

GenASL: Generative AI-powered American Sign Language avatars

AWS Machine Learning - AI

AUGUST 26, 2024

In this post, we dive into the architecture and implementation details of GenASL, which uses AWS generative AI capabilities to create human-like ASL avatar videos. The following diagram shows a high-level overview of the architecture. This tool is essential for building and deploying serverless applications.

Generative AI

Generative AI AWS 3D Video

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques. However, building and deploying trustworthy AI assistants requires a robust ground truth and evaluation framework. 201% $12.2B

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Before we dive deep into the deployment of the AI agent, lets walk through the key steps of the architecture, as shown in the following diagram. Use the following AWS CloudFormation template , and refer to Create a stack from the CloudFormation console to launch the stack in your preferred AWS Region.

Generative AI

Generative AI Systems Review System Lambda

Accelerate AWS Well-Architected reviews with Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Webinars

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Build a multi-tenant generative AI environment for your enterprise on AWS

Boost productivity by using AI in cloud operational health management

Serverless is more than AWS Lambda

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

The Future of Serverless is … Functionless?

Video security analysis for privileged access management using generative AI and Amazon Bedrock

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

Generative AI operating models in enterprise organizations with Amazon Bedrock

5 Questions to Ask Before Going Serverless

A serverless glossary

Capsule gets $1.5M to build ‘super simple’ decentralized social media

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

Empower your generative AI application with a comprehensive custom observability solution

How do we set up a proper serverless development workflow?

The Anatomy of a Secure Serverless Platform, pt. I — Design

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

Future of Software Development

AoAD2 Practice: Evolutionary System Architecture

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

Improve public speaking skills using a generative AI-based virtual assistant with Amazon Bedrock

Getting started with computer use in Amazon Bedrock Agents

Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock

How AWS sales uses Amazon Q Business for customer engagement

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Revisiting “Serverless Architectures”

Revisiting “Serverless Architectures”

Putting the stack in JAMstack

Building Generative AI prompt chaining workflows with human in the loop

GenAI for Aerospace: Empowering the workforce with expert knowledge on Amazon Q and Amazon Bedrock

Running Serverless in Production: 7 Best Practices for DevOps

GenASL: Generative AI-powered American Sign Language avatars

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Stay Connected