Architecture, AWS and Examples

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

For example, a marketing content creation application might need to perform task types such as text generation, text summarization, sentiment analysis, and information extraction as part of producing high-quality, personalized content. An example is a virtual assistant for enterprise business operations.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. It contains services used to onboard, manage, and operate the environment, for example, to onboard and off-board tenants, users, and models, assign quotas to different tenants, and authentication and authorization microservices.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. You can also customize your distributed training.

AWS

AWS Artificial Inteligence Generative AI Training

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. The following screenshot shows an example of an interaction with Field Advisor.

AWS

AWS Generative AI Technical Review Artificial Inteligence

United Airlines sets its flight plan for gen AI success

CIO

DECEMBER 20, 2024

With the core architectural backbone of the airlines gen AI roadmap in place, including United Data Hub and an AI and ML platform dubbed Mars, Birnbaum has released a handful of models into production use for employees and customers alike. CIO Jason Birnbaum has ambitious plans for generative AI at United Airlines.

Airlines

Airlines Generative AI Artificial Inteligence Weak Development Team

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Enter the next phase of Industry 4.0 with edge AI

CIO

DECEMBER 3, 2024

Generally speaking, a healthy application and data architecture is at the heart of successful modernisation. For example, IBM has developed hundreds of tools and approaches (or “journeys”) over the last 25 years which facilitate the modernisation process in organisations and meet a broad range of requirements.

Industry

Industry AWS Banking Agile

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

Generative AI

Generative AI AWS Lambda Authentication

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

For example, searching for a specific red leather handbag with a gold chain using text alone can be cumbersome and imprecise, often yielding results that don’t directly match the user’s intent. The AWS Command Line Interface (AWS CLI) installed on your machine to upload the dataset to Amazon S3.

AWS

AWS Engineering Serverless eCommerce

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. Using a client-server architecture (as illustrated in the following screenshot), MCP helps developers expose their data through lightweight MCP servers while building AI applications as MCP clients that connect to these servers.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

Harness the power of MCP servers with Amazon Bedrock Agents

AWS Machine Learning - AI

APRIL 1, 2025

invoke(input_text=Convert 11am from NYC time to London time) We showcase an example of building an agent to understand your Amazon Web Service (AWS) spend by connecting to AWS Cost Explorer , Amazon CloudWatch , and Perplexity AI through MCP. This gives you an AI agent that can transform the way you manage your AWS spend.

Generative AI

Generative AI AWS Artificial Inteligence Software Review

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

Organizations must decide on their hosting provider, whether it be an on-prem setup, cloud solutions like AWS, GCP, Azure or specialized data platform providers such as Snowflake and Databricks. Not my original quote, but a cardinal sin of cloud-native data architecture is copying data from one location to another.

Data

Data Technical Review Software Review Weak Development Team

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. Red teaming is critical for uncovering vulnerabilities before they are exploited.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. 8B ) and DeepSeek-R1-Distill-Llama-70B (from base model Llama-3.3-70B-Instruct

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

Throughout this post, we provide detailed code examples and explanations for each step, helping you seamlessly integrate Amazon Bedrock FMs into your code base. You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more. We walk through a Python example in this post.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Enter the next phase of Industry 4.0 with edge AI

CIO

DECEMBER 9, 2024

Generally speaking, a healthy application and data architecture is at the heart of successful modernisation. For example, IBM has developed hundreds of tools and approaches (or journeys) over the last 25 years which facilitate the modernisation process in organisations and meet a broad range of requirements.

Industry

Industry Banking AWS Agile

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. For example, your agent could take screenshots, create and edit text files, and run built-in Linux commands.

AWS

AWS Generative AI Linux Groups

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. versions, catering to different programming preferences.

Generative AI

Generative AI Applications AWS Knowledge Base

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

TechCrunch

MARCH 9, 2022

As organizations continue to build out their digital architecture, a new category of enterprise software has emerged to help them manage that process. “Enterprise architecture today is very much about the scaffolding in the organization,” he said. This means that you can also then run, for example, scenario analysis.

Architecture

Architecture Enterprise Network Organization

Overcoming the 6 barriers to IT modernization

CIO

NOVEMBER 26, 2024

For instance, Capital One successfully transitioned from mainframe systems to a cloud-first strategy by gradually migrating critical applications to Amazon Web Services (AWS). It adopted a microservices architecture to decouple legacy components, allowing for incremental updates without disrupting the entire system.

Weak Development Team

Weak Development Team Compliance Culture Budget

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

Cloudera

DECEMBER 2, 2024

Cloudera is committed to providing the most optimal architecture for data processing, advanced analytics, and AI while advancing our customers’ cloud journeys. Together, Cloudera and AWS empower businesses to optimize performance for data processing, analytics, and AI while minimizing their resource consumption and carbon footprint.

Sustainability

Sustainability AWS Analytics Infrastructure

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications. By assigning AWS cost allocation tags, the organization can effectively monitor and track their Bedrock spend patterns.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

AWS Machine Learning - AI

NOVEMBER 22, 2024

Seamless integration of latest foundation models (FMs), Prompts, Agents, Knowledge Bases, Guardrails, and other AWS services. Prerequisites Before implementing the new capabilities, make sure that you have the following: An AWS account In Amazon Bedrock: Create and test your base prompts for customer service interactions in Prompt Management.

Generative AI

Generative AI Artificial Inteligence Knowledge Base AWS

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform.

Generative AI

Generative AI Applications Technical Review Software Review

Trade routes of the digital age: How data gravity shapes cloud strategy

CIO

APRIL 15, 2025

One of the most striking examples is the Silk Road , a vast network of trade routes that connected the East and West for centuries. However, as companies expand their operations and adopt multi-cloud architectures, they are faced with an invisible but powerful challenge: Data gravity.

Strategy

Strategy Cloud Data Technical Review

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

For example, pre-training the Llama 3 70B model with 15 trillion training tokens took 6.5 Based on these examples, its realistic to expect that in a single hour of large-scale distributed training, an instance will fail about 0.02%0.06% million H100 GPU hours. MPT-7B was trained on 1 trillion tokens over the course of 9.5

Training

Training Artificial Inteligence Hardware Systems Review

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

For example, the previous best model, GPT-4o, could only solve 13% of the problems on the International Mathematics Olympiad, while the new reasoning model solved 83%. Take for example the use of AI in deciding whether to approve a loan, a medical procedure, pay an insurance claim or make employment recommendations.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

This advancement makes sophisticated agent architectures more accessible and economically viable across a broader range of applications and scales of deployment. We recommend referring to the Submit a model distillation job in Amazon Bedrock in the official AWS documentation for the most up-to-date and comprehensive information.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Some examples of AWS-sourced operational events include: AWS Health events — Notifications related to AWS service availability, operational issues, or scheduled maintenance that might affect your AWS resources.

Cloud

Cloud AWS Serverless Policies

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. The following diagram provides a detailed view of the architecture to enhance email support using generative AI.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

AWS launches no-code service AppFabric with generative AI assistance

CIO

JUNE 28, 2023

Amazon Web Services (AWS) on Tuesday unveiled a new no-code offering, dubbed AppFabric, designed to simplify SaaS integration for enterprises by increasing application observability and reducing operational costs associated with building point-to-point solutions. AppFabric, which is available across AWS’ US East (N.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

Amazon Q Business as a web experience makes AWS best practices readily accessible, providing cloud-centered recommendations quickly and making it straightforward to access AWS service functions, limits, and implementations. The following demos are examples of what the Amazon Q Business web experience looks like.

Generative AI

Generative AI AWS Innovation Knowledge Base

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

The result was a compromised availability architecture. For example, the database team we worked with in an organization new to the cloud launched all the AWS RDS database servers from dev through production, incurring a $600K a month cloud bill nine months before the scheduled production launch.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Accelerating modernization As an example of this transformative potential, EXL demonstrated Code Harbor , its generative AI (genAI)-powered code migration tool. And its modular architecture distributes tasks across multiple agents in parallel, increasing the speed and scalability of migrations. Its a driver of transformation.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Building Resilient Public Networking on AWS: Part 4

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Introducing AWS MCP Servers for code assistants (Part 1)

Build a multi-tenant generative AI environment for your enterprise on AWS

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

How AWS sales uses Amazon Q Business for customer engagement

United Airlines sets its flight plan for gen AI success

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Enter the next phase of Industry 4.0 with edge AI

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

Harness the power of MCP servers with Amazon Bedrock Agents

The future of data: A 5-pillar approach to modern data management

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Integrate foundation models into your code with Amazon Bedrock

Enter the next phase of Industry 4.0 with edge AI

Getting started with computer use in Amazon Bedrock Agents

Empower your generative AI application with a comprehensive custom observability solution

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

Overcoming the 6 barriers to IT modernization

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Enable Amazon Bedrock cross-Region inference in multi-account environments

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Trade routes of the digital age: How data gravity shapes cloud strategy

Reduce ML training costs with Amazon SageMaker HyperPod

12 AI predictions for 2025

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Boost productivity by using AI in cloud operational health management

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS launches no-code service AppFabric with generative AI assistance

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Why GreenOps will succeed where FinOps is failing

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AI in action: Stories of how enterprises are transforming and modernizing

Stay Connected