Architecture, AWS and Reference

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers. The user prompt is then routed to the LLM associated with the task category of the reference prompt that has the closest match.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The following diagram illustrates the architecture of the application.

Generative AI

Generative AI AWS Lambda Authentication

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. Shared components refer to the functionality and features shared by all tenants. You can use AWS services such as Application Load Balancer to implement this approach. API Gateway also provides a WebSocket API.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

In these uses case, we have enough reference implementations to point to and say, Theres value to be had here.' Weve seen so many reference implementations, and weve done so many reference implementations, that were going to see massive adoption. Now, it will evolve again, says Malhotra. Agents are the next phase, he says.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

It prevents vendor lock-in, gives a lever for strong negotiation, enables business flexibility in strategy execution owing to complicated architecture or regional limitations in terms of security and legal compliance if and when they rise and promotes portability from an application architecture perspective.

Cloud

Cloud Strategy Architecture Policies

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Access to your selected models hosted on Amazon Bedrock.

Scalability

Scalability Lambda Generative AI AWS

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

By modern, I refer to an engineering-driven methodology that fully capitalizes on automation and software engineering best practices. Organizations must decide on their hosting provider, whether it be an on-prem setup, cloud solutions like AWS, GCP, Azure or specialized data platform providers such as Snowflake and Databricks.

Data

Data Technical Review Software Review Weak Development Team

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Response latency refers to the time between the user finishing their speech and beginning to hear the AI assistants response. AWS Local Zones are a type of edge infrastructure deployment that places select AWS services close to large population and industry centers. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

Amazon Q Business as a web experience makes AWS best practices readily accessible, providing cloud-centered recommendations quickly and making it straightforward to access AWS service functions, limits, and implementations. For more on MuleSofts journey to cloud computing, refer to Why a Cloud Operating Model?

Generative AI

Generative AI AWS Innovation Knowledge Base

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. 8B ) and DeepSeek-R1-Distill-Llama-70B (from base model Llama-3.3-70B-Instruct

Generative AI

Generative AI Artificial Inteligence AWS Serverless

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Event-driven operations management Operational events refer to occurrences within your organization’s cloud environment that might impact the performance, resilience, security, or cost of your workloads.

Cloud

Cloud AWS Serverless Policies

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

Prerequisites Before you dive into the integration process, make sure you have the following prerequisites in place: AWS account – You’ll need an AWS account to access and use Amazon Bedrock. You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Security – The solution uses AWS services and adheres to AWS Cloud Security best practices so your data remains within your AWS account.

Generative AI

Generative AI Applications AWS Knowledge Base

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

This solution can serve as a valuable reference for other organizations looking to scale their cloud governance and enable their CCoE teams to drive greater impact. The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure.

Generative AI

Generative AI Government Technical Review Innovation

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Its improved architecture, based on the Multimodal Diffusion Transformer (MMDiT), combines multiple pre-trained text encoders for enhanced text understanding and uses QK-normalization to improve training stability. Use the us-west-2 AWS Region to run this demo. An Amazon SageMaker domain. Access to Stability AIs SD3.5

Generative AI

Generative AI Games Development AWS

Best Practices for IaC using AWS CloudFormation

Perficient

MARCH 11, 2025

AWS CloudFormation, a key service in the AWS ecosystem, simplifies IaC by allowing users to easily model and set up AWS resources. This blog explores the best practices for utilizing AWS CloudFormation to achieve reliable, secure, and efficient infrastructure management. Why Use AWS CloudFormation? Example: 3.

AWS

AWS Software Review Systems Review Policies

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. The following diagram illustrates the solution architecture. AWS CDK CLI, follow instructions here.

AWS

AWS Generative AI Linux Groups

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Model customization refers to adapting a pre-trained language model to better fit specific tasks, domains, or datasets. Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions.

Case Study

Case Study Artificial Inteligence Study Generative AI

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The general architecture of the metadata pipeline consists of two primary steps: Generate transcriptions of audio tracks: use speech recognition models to generate accurate transcripts of the audio content. To evaluate the metadata quality, the team used reference-free LLM metrics, inspired by LangSmith.

Media

Media Video Artificial Inteligence Generative AI

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

IBM and AWS Create a Path to Modernization Via Industry-Specific Solutions

CIO

OCTOBER 13, 2022

As part of their partnership, IBM and Amazon Web Services (AWS) are pursuing a variety of industry-specific blueprints and solutions designed to help customers modernize apps for a hybrid IT environment, which includes AWS Cloud. AWS/IBM’s Industry Edge.

AWS

AWS Industry Technical Review Disaster Recovery

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. The following diagram provides a detailed view of the architecture to enhance email support using generative AI.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Large organizations often have many business units with multiple lines of business (LOBs), with a central governing entity, and typically use AWS Organizations with an Amazon Web Services (AWS) multi-account strategy. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Discover insights from Gmail using the Gmail connector for Amazon Q Business

AWS Machine Learning - AI

OCTOBER 31, 2024

The web application that the user uses to retrieve answers is connected to an identity provider (IdP) or AWS IAM Identity Center. The user’s credentials from the IdP or IAM Identity Center are referred to here as the federated user credentials. Refer to How Amazon Q Business connector crawls Gmail ACLs for more information.

AWS

AWS Generative AI Groups Applications

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

For a comprehensive overview of metadata filtering and its benefits, refer to Amazon Bedrock Knowledge Bases now supports metadata filtering to improve retrieval accuracy. The following diagram illustrates high level RAG architecture with dynamic metadata filtering. Finally, the generated response is returned to the user.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning - AI

NOVEMBER 20, 2024

In this post, we explore how you can use Amazon Q Business , the AWS generative AI-powered assistant, to build a centralized knowledge base for your organization, unifying structured and unstructured datasets from different sources to accelerate decision-making and drive productivity. In this post, we use IAM Identity Center as the SAML 2.0-aligned

Data

Data AWS Groups Knowledge Base

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. These recipes are processed through the HyperPod recipe launcher, which serves as the orchestration layer responsible for launching a job on the corresponding architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

AWS Machine Learning - AI

APRIL 18, 2024

Tools like Terraform and AWS CloudFormation are pivotal for such transitions, offering infrastructure as code (IaC) capabilities that define and manage complex cloud environments with precision. AWS Landing Zone addresses this need by offering a standardized approach to deploying AWS resources.

AWS

AWS Applications Lambda Knowledge Base

AWS Security Essentials: From Prevention to Detection

Xebia

SEPTEMBER 24, 2024

AWS offers a range of security services like AWS Security Hub, AWS GuardDuty, Amazon Inspector, Amazon Macie etc. This post will dive into how we can monitor these AWS Security services and build a layered security approach, emphasizing the importance of both prevention and detection.

AWS

AWS Lambda Firewall Policies

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

The time taken to determine the root cause is referred to as mean time to detect (MTTD). The failed instance also needs to be isolated and terminated manually, either through the AWS Management Console , AWS Command Line Interface (AWS CLI), or tools like kubectl or eksctl.

Training

Training Artificial Inteligence Hardware Systems Review

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Our proposed architecture provides a scalable and customizable solution for online LLM monitoring, enabling teams to tailor your monitoring solution to your specific use cases and requirements. A modular architecture, where each module can intake model inference data and produce its own metrics, is necessary.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Users can access these AI capabilities through their organizations single sign-on (SSO), collaborate with team members, and refine AI applications without needing AWS Management Console access. Before we dive deep into the deployment of the AI agent, lets walk through the key steps of the architecture, as shown in the following diagram.

Generative AI

Generative AI Systems Review System Lambda

High-performance computing on AWS

Xebia

AUGUST 29, 2023

How does High-Performance Computing on AWS differ from regular computing? HPC services on AWS Compute Technically you could design and build your own HPC cluster on AWS, it will work but you will spend time on plumbing and undifferentiated heavy lifting. AWS has two services to support your HPC workload.

AWS

AWS Performance Storage Linux

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Xebia

MAY 21, 2024

Cloud modernization has become a prominent topic for organizations, and AWS plays a crucial role in helping them modernize their IT infrastructure, applications, and services. Overall, discussions on AWS modernization are focused on security, faster releases, efficiency, and steps towards GenAI and improved innovation.

AWS

AWS Strategy Serverless Microservices

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions. Amazon Bedrock Data Automation optimizes for available AWS Regional capacity by automatically routing across regions within the same geographic area to maximize throughput at no additional cost.

Data

Data Generative AI Artificial Inteligence Compliance

Can VPC Lattice replace AWS Transit Gateway?

Xebia

AUGUST 29, 2023

VPC Lattice offers a new mechanism to connect microservices across AWS accounts and across VPCs in a developer-friendly way. Or if you have an existing landing zone with AWS Transit Gateway, do you already plan to replace it with VPC Lattice? You can also use AWS PrivateLink to inter-connect your VPCs across accounts.

AWS

AWS Load Balancer Microservices Lambda

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

A secure approach to generative AI with AWS

AWS Machine Learning - AI

APRIL 16, 2024

At AWS, our top priority is safeguarding the security and confidentiality of our customers’ workloads. With the AWS Nitro System , we delivered a first-of-its-kind innovation on behalf of our customers. The Nitro System is an unparalleled computing backbone for AWS, with security and performance at its core.

Generative AI

Generative AI AWS Artificial Inteligence Infrastructure

Building Resilient Public Networking on AWS: Part 4

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Build a multi-tenant generative AI environment for your enterprise on AWS

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

How AWS sales uses Amazon Q Business for customer engagement

12 AI predictions for 2025

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

The future of data: A 5-pillar approach to modern data management

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Boost productivity by using AI in cloud operational health management

Integrate foundation models into your code with Amazon Bedrock

Enable Amazon Bedrock cross-Region inference in multi-account environments

Empower your generative AI application with a comprehensive custom observability solution

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Best Practices for IaC using AWS CloudFormation

Getting started with computer use in Amazon Bedrock Agents

Model customization, RAG, or both: A case study with Amazon Nova

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

IBM and AWS Create a Path to Modernization Via Industry-Specific Solutions

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Generative AI operating models in enterprise organizations with Amazon Bedrock

Discover insights from Gmail using the Gmail connector for Amazon Q Business

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

AWS Security Essentials: From Prevention to Detection

Reduce ML training costs with Amazon SageMaker HyperPod

Techniques and approaches for monitoring large language models on AWS

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

High-performance computing on AWS

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Can VPC Lattice replace AWS Transit Gateway?

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

A secure approach to generative AI with AWS

Stay Connected