Architecture, Generative AI and Reference

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. In this post, we explore a generative AI solution leveraging Amazon Bedrock to streamline the WAFR process.

Generative AI

Generative AI Technical Review Software Review Systems Review

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

While organizations continue to discover the powerful applications of generative AI , adoption is often slowed down by team silos and bespoke workflows. To move faster, enterprises need robust operating models and a holistic approach that simplifies the generative AI lifecycle.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Agentic AI design: An architectural case study

CIO

NOVEMBER 19, 2024

Now that we have covered AI agents, we can see that agentic AI refers to the concept of AI systems being capable of independent action and goal achievement, while AI agents are the individual components within this system that perform each specific task.

Case Study

Case Study Study Artificial Inteligence Architecture

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Recently, we’ve been witnessing the rapid development and evolution of generative AI applications, with observability and evaluation emerging as critical aspects for developers, data scientists, and stakeholders. In the context of Amazon Bedrock , observability and evaluation become even more crucial.

Generative AI

Generative AI Applications AWS Knowledge Base

CIOs contend with gen AI growing pains

CIO

NOVEMBER 22, 2024

The road ahead for IT leaders in turning the promise of generative AI into business value remains steep and daunting, but the key components of the gen AI roadmap — data, platform, and skills — are evolving and becoming better defined. MIT event, moderated by Lan Guan, CAIO at Accenture.

Airlines

Airlines LAN Generative AI Travel

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). See a walkthrough of Steps 4-6 in the animated image below.

Generative AI

Generative AI AWS Lambda Authentication

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Generative AI can revolutionize organizations by enabling the creation of innovative applications that offer enhanced customer and employee experiences. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

In this post, we share how Hearst , one of the nation’s largest global, diversified information, services, and media companies, overcame these challenges by creating a self-service generative AI conversational assistant for business units seeking guidance from their CCoE.

Generative AI

Generative AI Government Technical Review Innovation

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Generative AI has emerged as a game changer, offering unprecedented opportunities for game designers to push boundaries and create immersive virtual worlds. At the forefront of this revolution is Stability AIs cutting-edge text-to-image AI model, Stable Diffusion 3.5 Large (SD3.5

Generative AI

Generative AI Games Development AWS

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. In our tests, we’ve seen substantial improvements in scaling times for generative AI model endpoints across various frameworks.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

Generative AI

Generative AI AWS Lambda Authentication

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Asure anticipated that generative AI could aid contact center leaders to understand their teams support performance, identify gaps and pain points in their products, and recognize the most effective strategies for training customer support representatives using call transcripts. Yasmine Rodriguez, CTO of Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Generative AI agents offer a powerful solution by automatically interfacing with company systems, executing tasks, and delivering instant insights, helping organizations scale operations without scaling complexity. The following diagram illustrates the generative AI agent solution workflow.

Generative AI

Generative AI Systems Review System Lambda

Video security analysis for privileged access management using generative AI and Amazon Bedrock

AWS Machine Learning - AI

JANUARY 22, 2025

However, to describe what is occurring in the video from what can be visually observed, we can harness the image analysis capabilities of generative AI. We explain the end-to-end solution workflow, the prompts needed to produce the transcript and perform security analysis, and provide a deployable solution architecture.

Generative AI

Generative AI Video Analysis Technical Review

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

Generative AI has seen faster and more widespread adoption than any other technology today, with many companies already seeing ROI and scaling up use cases into wide adoption. Vendors are adding gen AI across the board to enterprise software products, and AI developers havent been idle this year either.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. For instructions on how to start your Amazon Bedrock batch inference job, refer to Enhance call center efficiency using batch inference for transcript summarization with Amazon Bedrock.

Scalability

Scalability Lambda Generative AI AWS

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Generative AI question-answering applications are pushing the boundaries of enterprise productivity. These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Open foundation models (FMs) have become a cornerstone of generative AI innovation, enabling organizations to build and customize AI applications while maintaining control over their costs and deployment strategies. You can access your imported custom models on-demand and without the need to manage underlying infrastructure.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Model customization refers to adapting a pre-trained language model to better fit specific tasks, domains, or datasets. The following diagram illustrates the solution architecture. For more information, refer to the following GitHub repo , which contains sample code.

Case Study

Case Study Artificial Inteligence Study Generative AI

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

This post shows how MuleSoft introduced a generative AI -powered assistant using Amazon Q Business to enhance their internal Cloud Central dashboard. For more on MuleSofts journey to cloud computing, refer to Why a Cloud Operating Model? Every organization has unique needs when it comes to AI. Want to take it further?

Generative AI

Generative AI AWS Innovation Knowledge Base

Liberty Mutual CIO Monica Caldas on developing a digital-savvy workforce

CIO

NOVEMBER 7, 2024

We’re not at step one of that journey because, as an insurance company, we have been leveraging AI for many years, but we are thinking about generative AI in the sense of, how do we empower our employees and augment their work to help them have more capacity and for higher, more complex work sets?

Artificial Inteligence

Artificial Inteligence Development Generative AI Artificial Intelligence

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

With Amazon Bedrock and other AWS services, you can build a generative AI-based email support solution to streamline email management, enhancing overall customer satisfaction and operational efficiency. AI integration accelerates response times and increases the accuracy and relevance of communications, enhancing customer satisfaction.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

Amazon Bedrock streamlines the integration of state-of-the-art generative AI capabilities for developers, offering pre-trained models that can be customized and deployed without the need for extensive model training from scratch. Scattered throughout Foobar are pockets of tropical jungles thriving along rivers and wetlands.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. Developed by Anthropic as an open protocol, the MCP provides a standardized way to connect AI models to virtually any data source or tool. The following diagram illustrates this workflow.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

Amazon Bedrock Model Distillation is generally available, and it addresses the fundamental challenge many organizations face when deploying generative AI : how to maintain high performance while reducing costs and latency. For the most current list of supported models, refer to the Amazon Bedrock documentation.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Safeguard OT Environments with the Power of Precision AI

Palo Alto Networks

OCTOBER 21, 2024

Powered by Precision AI™ – our proprietary AI system – this solution combines machine learning, deep learning and generative AI to deliver advanced, real-time protection. Generative AI enhances the user experience with a natural language interface, making the system more intuitive and intelligent.

Compliance

Compliance Virtualization Conference Generative AI

Building Generative AI prompt chaining workflows with human in the loop

AWS Machine Learning - AI

MAY 17, 2024

Generative AI is a type of artificial intelligence (AI) that can be used to create new content, including conversations, stories, images, videos, and music. Like all AI, generative AI works by using machine learning models—very large models that are pretrained on vast amounts of data called foundation models (FMs).

Generative AI

Generative AI Artificial Inteligence Systems Review Software Review

The potential for generative AI in government and public services

CIO

FEBRUARY 20, 2024

Governments and public service agencies understand the enormous potential of generative AI. Recent research by McGuire Research Services for Avanade, shows 82% of government employees are using AI on a daily or weekly basis, while 84% of organisations plan to increase their IT investments by up to 24% to take advantage of AI.

Generative AI

Generative AI Government Artificial Inteligence Policies

Designing generative AI workloads for resilience

AWS Machine Learning - AI

FEBRUARY 1, 2024

Resilience plays a pivotal role in the development of any workload, and generative AI workloads are no different. There are unique considerations when engineering generative AI workloads through a resilience lens. This pattern achieves a statically stable architecture, which is a resiliency best practice.

Generative AI

Generative AI Disaster Recovery Artificial Inteligence AWS

Medical content creation in the age of generative AI

AWS Machine Learning - AI

JULY 3, 2024

Generative AI and transformer-based large language models (LLMs) have been in the top headlines recently. These models demonstrate impressive performance in question answering, text summarization, code, and text generation. Fact-checking and rules evaluation require special coverage and will be discussed in an upcoming post.

Artificial Inteligence

Artificial Inteligence Generative AI Lambda Healthcare

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. In just a few weeks, we were able to cut over to Amazon Q and significantly reduce the complexity of our service architecture and operations.

AWS

AWS Generative AI Technical Review Artificial Inteligence

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

By modern, I refer to an engineering-driven methodology that fully capitalizes on automation and software engineering best practices. This approach is repeatable, minimizes dependence on manual controls, harnesses technology and AI for data management and integrates seamlessly into the digital product development process.

Data

Data Technical Review Software Review Weak Development Team

Amazon Bedrock Guardrails announces IAM Policy-based enforcement to deliver safe AI interactions

AWS Machine Learning - AI

MARCH 18, 2025

As generative AI adoption accelerates across enterprises, maintaining safe, responsible, and compliant AI interactions has never been more critical. Amazon Bedrock Guardrails provides configurable safeguards that help organizations build generative AI applications with industry-leading safety protections.

Policies

Policies Generative AI Resources AWS

Ready to transform how your IT organization drives business outcomes with AIOps?

CIO

JANUARY 3, 2025

According to BMC research in partnership with Forbes Insight , more than 80% of IT leaders trust AI output and see a significant role for AI, including but not limited to generative AI outputs. Research respondents believe AI will positively impact IT complexity and improve business outcomes.

Organization

Organization Artificial Intelligence Artificial Inteligence DevOps

A secure approach to generative AI with AWS

AWS Machine Learning - AI

APRIL 16, 2024

Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. The biggest concern we hear from customers as they explore the advantages of generative AI is how to protect their highly sensitive data and investments.

Generative AI

Generative AI AWS Artificial Inteligence Infrastructure

GenASL: Generative AI-powered American Sign Language avatars

AWS Machine Learning - AI

AUGUST 26, 2024

The rise of foundation models (FMs), and the fascinating world of generative AI that we live in, is incredibly exciting and opens doors to imagine and build what wasn’t previously possible. Users can input audio, video, or text into GenASL, which generates an ASL avatar video that interprets the provided data.

Generative AI

Generative AI AWS 3D Video

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

The increased usage of generative AI models has offered tailored experiences with minimal technical expertise, and organizations are increasingly using these powerful models to drive innovation and enhance their services across various domains, from natural language processing (NLP) to content generation.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Increasingly, organizations across industries are turning to generative AI foundation models (FMs) to enhance their applications. Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Improving air quality with generative AI

AWS Machine Learning - AI

JUNE 18, 2024

This post presents a solution that uses a generative artificial intelligence (AI) to standardize air quality data from low-cost sensors in Africa, specifically addressing the air quality data integration problem of low-cost sensors. Qiong (Jo) Zhang , PhD, is a Senior Partner Solutions Architect at AWS, specializing in AI/ML.

Generative AI

Generative AI Artificial Inteligence Technical Review AWS

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

AWS Machine Learning - AI

AUGUST 8, 2024

By integrating generative AI, they can now analyze call transcripts to better understand customer pain points and improve agent productivity. Additionally, they are using generative AI to extract key call drivers, optimize agent workflows, and gain deeper insights into customer sentiment.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. AWS HealthScribe combines speech recognition and generative AI trained specifically for healthcare documentation to accelerate clinical documentation and enhance the consultation experience.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Accelerate AWS Well-Architected reviews with Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Build a multi-tenant generative AI environment for your enterprise on AWS

Webinars

Agentic AI design: An architectural case study

Empower your generative AI application with a comprehensive custom observability solution

CIOs contend with gen AI growing pains

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Generative AI operating models in enterprise organizations with Amazon Bedrock

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Video security analysis for privileged access management using generative AI and Amazon Bedrock

12 AI predictions for 2025

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Model customization, RAG, or both: A case study with Amazon Nova

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Liberty Mutual CIO Monica Caldas on developing a digital-savvy workforce

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Integrate foundation models into your code with Amazon Bedrock

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Safeguard OT Environments with the Power of Precision AI

Building Generative AI prompt chaining workflows with human in the loop

The potential for generative AI in government and public services

Designing generative AI workloads for resilience

Medical content creation in the age of generative AI

How AWS sales uses Amazon Q Business for customer engagement

The future of data: A 5-pillar approach to modern data management

Amazon Bedrock Guardrails announces IAM Policy-based enforcement to deliver safe AI interactions

Ready to transform how your IT organization drives business outcomes with AIOps?

A secure approach to generative AI with AWS

GenASL: Generative AI-powered American Sign Language avatars

Host concurrent LLMs with LoRAX

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Improving air quality with generative AI

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Stay Connected