Architecture, AWS and Presentation

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

However, this method presents trade-offs. However, it also presents some trade-offs. Implementation of dynamic routing In this section, we explore different approaches to implementing dynamic routing on AWS, covering both built-in routing features and custom solutions that you can use as a starting point to build your own.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The following diagram illustrates the architecture of the application.

Generative AI

Generative AI AWS Lambda Authentication

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

In addition to cost, performing fine tuning for LLMs at scale presents significant technical challenges. To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. To learn more about Trainium chips and the Neuron SDK, see Welcome to AWS Neuron.

AWS

AWS Artificial Inteligence Generative AI Training

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

Generative AI

Generative AI AWS Lambda Authentication

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. Using a client-server architecture (as illustrated in the following screenshot), MCP helps developers expose their data through lightweight MCP servers while building AI applications as MCP clients that connect to these servers.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

The device further processes this response, including text-to-speech (TTS) conversion for voice agents, before presenting it to the user. AWS Local Zones are a type of edge infrastructure deployment that places select AWS services close to large population and industry centers. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Trade routes of the digital age: How data gravity shapes cloud strategy

CIO

APRIL 15, 2025

However, as companies expand their operations and adopt multi-cloud architectures, they are faced with an invisible but powerful challenge: Data gravity. This gravitational effect presents a paradox for IT leaders. The adoption of cloud-native architectures further mitigates the impact of data gravity.

Strategy

Strategy Cloud Data Technical Review

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. The following diagram illustrates the solution architecture.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This post presents a solution where you can upload a recording of your meeting (a feature available in most modern digital communication services such as Amazon Chime ) to a centralized video insights and summarization engine. You can invoke Lambda functions from over 200 AWS services and software-as-a-service (SaaS) applications.

Generative AI

Generative AI Video Engineering Artificial Inteligence

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The collaboration between BQA and AWS was facilitated through the Cloud Innovation Center (CIC) program, a joint initiative by AWS, Tamkeen , and leading universities in Bahrain, including Bahrain Polytechnic and University of Bahrain. The following diagram illustrates the solution architecture.

Education

Education Report Technical Review Generative AI

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

However, managing cloud operational events presents significant challenges, particularly in complex organizational structures. It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. AWS Cost Anomaly Detection alerts – Notifications about unusual spending patterns or cost spikes.

Cloud

Cloud AWS Serverless Policies

Improve Amazon Nova migration performance with data-aware prompt optimization

AWS Machine Learning - AI

APRIL 29, 2025

In the era of generative AI , new large language models (LLMs) are continually emerging, each with unique capabilities, architectures, and optimizations. In this post, we present an LLM migration paradigm and architecture, including a continuous process of model evaluation, prompt generation using Amazon Bedrock, and data-aware optimization.

Artificial Inteligence

Artificial Inteligence Performance Data Generative AI

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

However, using generative AI models in enterprise environments presents unique challenges. Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. The following diagram is the solution architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

Building generative AI applications presents significant challenges for organizations: they require specialized ML expertise, complex infrastructure management, and careful orchestration of multiple services. The following diagram illustrates the conceptual architecture of an AI assistant with Amazon Bedrock IDE.

Generative AI

Generative AI Applications Technical Review Software Review

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

This advancement makes sophisticated agent architectures more accessible and economically viable across a broader range of applications and scales of deployment. We recommend referring to the Submit a model distillation job in Amazon Bedrock in the official AWS documentation for the most up-to-date and comprehensive information.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Although these advancements offer remarkable capabilities, they also present significant challenges. In a transformer architecture, such layers are the embedding layers and the multilayer perceptron (MLP) layers. and prior Llama models) and Mistral model architectures for context parallelism. supports the Llama 3.1 (and

Training

Training Artificial Inteligence AWS Machine Learning

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Approach and base model overview In this section, we discuss the differences between a fine-tuning and RAG approach, present common use cases for each approach, and provide an overview of the base model used for experiments. The following diagram illustrates the solution architecture.

Case Study

Case Study Artificial Inteligence Study Generative AI

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Insights in implementing production-ready solutions with generative AI

AWS Machine Learning - AI

APRIL 30, 2025

However, the journey from production-ready solutions to full-scale implementation can present distinct operational and technical considerations. Il Sole 24 Ore leveraged its vast internal knowledge with a Retrieval Augmented Generation (RAG) solution powered by AWS.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Storm in the stratosphere: how the cloud will be reshuffled

Erik Bernhardsson

NOVEMBER 30, 2021

Here's a theory I have about cloud vendors (AWS, Azure, GCP): Cloud vendors 1 will increasingly focus on the lowest layers in the stack: basically leasing capacity in their data centers through an API. Redshift is a data warehouse (aka OLAP database) offered by AWS. If you're an ambitious person, do you go work at AWS?

Cloud

Cloud AWS Weak Development Team Serverless

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. These recipes are processed through the HyperPod recipe launcher, which serves as the orchestration layer responsible for launching a job on the corresponding architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

The failed instance also needs to be isolated and terminated manually, either through the AWS Management Console , AWS Command Line Interface (AWS CLI), or tools like kubectl or eksctl. About the Authors Anoop Saha is a Sr GTM Specialist at Amazon Web Services (AWS) focusing on generative AI model training and inference.

Training

Training Artificial Inteligence Hardware Systems Review

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

AWS Machine Learning - AI

NOVEMBER 21, 2024

The path to creating effective AI models for audio and video generation presents several distinct challenges. We guide you through deploying the necessary infrastructure using AWS CloudFormation , creating an internal labeling workforce, and setting up your first labeling job. We demonstrate how to use Wavesurfer.js

Video

Video Lambda AWS Generative AI

Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

AWS Machine Learning - AI

JULY 31, 2024

Amazon Web Services (AWS) is committed to supporting the development of cutting-edge generative artificial intelligence (AI) technologies by companies and organizations across the globe. Let’s dive in and explore how these organizations are transforming what’s possible with generative AI on AWS.

Artificial Inteligence

Artificial Inteligence AWS Programming Innovation

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning - AI

MARCH 3, 2025

Mistral developed a novel architecture for Pixtral 12B, optimized for both computational efficiency and performance. This architecture supports processing an arbitrary number of images of varying sizes within a large context window of 128k tokens. For more Mistral resources on AWS, check out the GitHub repo.

Insurance

Insurance AWS eCommerce Software Review

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning - AI

APRIL 30, 2025

Because Amazon Bedrock is serverless, you dont have to manage infrastructure to securely integrate and deploy generative AI capabilities into your application, handle spiky traffic patterns, and enable new features like cross-Region inference, which helps provide scalability and reliability across AWS Regions.

Artificial Inteligence

Artificial Inteligence Technical Review Software Review Generative AI

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

The Mozart application rapidly compares policy documents and presents comprehensive change details, such as descriptions, locations, excerpts, in a tracked change format. In this post, we describe the development journey of the generative AI companion for Mozart, the data, the architecture, and the evaluation of the pipeline.

Generative AI

Generative AI Technical Review Insurance Policies

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

Designed with a serverless, cost-optimized architecture, the platform provisions SageMaker endpoints dynamically, providing efficient resource utilization while maintaining scalability. The following diagram illustrates the solution architecture. Click here to open the AWS console and follow along.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Video security analysis for privileged access management using generative AI and Amazon Bedrock

AWS Machine Learning - AI

JANUARY 22, 2025

We explain the end-to-end solution workflow, the prompts needed to produce the transcript and perform security analysis, and provide a deployable solution architecture. This architecture can be used for demonstration purposes and testing with your own video recordings and prompts; however, it is not suitable for a production use.

Generative AI

Generative AI Video Analysis Technical Review

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning - AI

MARCH 18, 2025

These reports can be presented to clinical trial teams, regulatory bodies, and safety monitoring committees, supporting informed decision-making processes. Insights and reporting The processed data and insights derived from the LLM are presented through interactive dashboards, visualizations, and reports. An AWS account.

Artificial Inteligence

Artificial Inteligence Technical Review Healthcare Systems Review

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

AWS Machine Learning - AI

APRIL 22, 2025

To address these challenges, Infosys partnered with Amazon Web Services (AWS) to develop the Infosys Event AI to unlock the insights generated during events. The services used in the solution are granted least-privilege permissions through AWS Identity and Access Management (IAM) policies for security purposes.

Media

Media Knowledge Base AWS Systems Review

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

AWS Machine Learning - AI

AUGUST 26, 2024

At AWS, we are transforming our seller and customer journeys by using generative artificial intelligence (AI) across the sales lifecycle. Prospecting, opportunity progression, and customer engagement present exciting opportunities to utilize generative AI, using historical data, to drive efficiency and effectiveness.

Generative AI

Generative AI AWS Artificial Inteligence Technical Review

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

Model Variants The current DeepSeek model collection consists of the following models: DeepSeek-V3 An LLM that uses a Mixture-of-Experts (MoE) architecture. These models retain their existing architecture while gaining additional reasoning capabilities through a distillation process. GenAI Data Scientist at AWS.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Can VPC Lattice replace AWS Transit Gateway?

Xebia

AUGUST 29, 2023

VPC Lattice offers a new mechanism to connect microservices across AWS accounts and across VPCs in a developer-friendly way. Or if you have an existing landing zone with AWS Transit Gateway, do you already plan to replace it with VPC Lattice? You can also use AWS PrivateLink to inter-connect your VPCs across accounts.

AWS

AWS Load Balancer Microservices Lambda

The Present and Future of Arm and AWS Graviton at Honeycomb

Honeycomb

MAY 23, 2022

As many of you may have read, Amazon has released C7g instances powered by the highly anticipated AWS Graviton3 Processors. Based on the success we had with this experiment (don’t worry, we discuss it below) we can only expect great things to come out of the new AWS Graviton3 Processors. Background. Reservations[]|.Instances[]'

AWS

AWS Lambda Architecture Testing

FlowFixation: AWS Apache Airflow Service Takeover Vulnerability and Why Neglecting Guardrails Puts Major CSPs at Risk

Tenable

MARCH 21, 2024

Tenable Research discovered a one-click account takeover vulnerability in the AWS Managed Workflows Apache Airflow service that could have allowed full takeover of a victim’s web management panel of the Airflow instance. By abusing the vulnerability, an attacker could have forced victims to use and authenticate the attacker’s known session.

AWS

AWS Azure Authentication Software Review

Build a gen AI–powered financial assistant with Amazon Bedrock multi-agent collaboration

AWS Machine Learning - AI

MAY 2, 2025

This architecture demonstrates the significant advantages of deploying multiple specialized agents, each designed to handle distinct aspects of complex tasks such as financial analysis. Furthermore, the systems modular architecture facilitates seamless maintenance, updates, and scalability.

Real Estate

Real Estate Artificial Inteligence Knowledge Base Lambda

Dynamic video content moderation and policy evaluation using AWS generative AI services

AWS Machine Learning - AI

MAY 30, 2024

In this post, we introduce the Media Analysis and Policy Evaluation solution, which uses AWS AI and generative AI services to provide a framework to streamline video extraction and evaluation processes. This solution, powered by AWS AI and generative AI services, meets these needs.

Generative AI

Generative AI Policies Video AWS

Building Resilient Public Networking on AWS: Part 4

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Webinars

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Build a multi-tenant generative AI environment for your enterprise on AWS

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Trade routes of the digital age: How data gravity shapes cloud strategy

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Build a video insights and summarization engine using generative AI with Amazon Bedrock

How BQA streamlines education quality reporting using Amazon Bedrock

Boost productivity by using AI in cloud operational health management

Improve Amazon Nova migration performance with data-aware prompt optimization

Host concurrent LLMs with LoRAX

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Model customization, RAG, or both: A case study with Amazon Nova

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Insights in implementing production-ready solutions with generative AI

Storm in the stratosphere: how the cloud will be reshuffled

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Reduce ML training costs with Amazon SageMaker HyperPod

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Video security analysis for privileged access management using generative AI and Amazon Bedrock

Revolutionizing clinical trials with the power of voice and AI

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Can VPC Lattice replace AWS Transit Gateway?

The Present and Future of Arm and AWS Graviton at Honeycomb

FlowFixation: AWS Apache Airflow Service Takeover Vulnerability and Why Neglecting Guardrails Puts Major CSPs at Risk

Build a gen AI–powered financial assistant with Amazon Bedrock multi-agent collaboration

Dynamic video content moderation and policy evaluation using AWS generative AI services

Stay Connected