Architecture, AWS and Machine Learning

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. Choose the us-east-1 AWS Region from the top right corner. Choose Manage model access.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. AWS HealthScribe combines speech recognition and generative AI trained specifically for healthcare documentation to accelerate clinical documentation and enhance the consultation experience.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Implementation of dynamic routing In this section, we explore different approaches to implementing dynamic routing on AWS, covering both built-in routing features and custom solutions that you can use as a starting point to build your own. The architecture of this system is illustrated in the following figure. 70B and 8B.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. You can also customize your distributed training.

AWS

AWS Artificial Inteligence Generative AI Training

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The following diagram illustrates the architecture of the application.

Generative AI

Generative AI AWS Lambda Authentication

Stability AI backs effort to bring machine learning to biomed

TechCrunch

NOVEMBER 4, 2022

Called OpenBioML , the endeavor’s first projects will focus on machine learning-based approaches to DNA sequencing, protein folding and computational biochemistry. Stability AI’s ethically questionable decisions to date aside, machine learning in medicine is a minefield. Predicting protein structures.

Artificial Inteligence

Artificial Inteligence Machine Learning Biotech Training

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

With rapid progress in the fields of machine learning (ML) and artificial intelligence (AI), it is important to deploy the AI/ML model efficiently in production environments. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Exclusive to Amazon Bedrock, the Amazon Titan family of models incorporates 25 years of experience innovating with AI and machine learning at Amazon. The AWS Command Line Interface (AWS CLI) installed on your machine to upload the dataset to Amazon S3. If enabled, its status will display as Access granted.

AWS

AWS Engineering Serverless eCommerce

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

Prerequisites Before you dive into the integration process, make sure you have the following prerequisites in place: AWS account – You’ll need an AWS account to access and use Amazon Bedrock. You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Harness the power of MCP servers with Amazon Bedrock Agents

AWS Machine Learning - AI

APRIL 1, 2025

invoke(input_text=Convert 11am from NYC time to London time) We showcase an example of building an agent to understand your Amazon Web Service (AWS) spend by connecting to AWS Cost Explorer , Amazon CloudWatch , and Perplexity AI through MCP. This gives you an AI agent that can transform the way you manage your AWS spend.

Generative AI

Generative AI AWS Artificial Inteligence Software Review

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. Nemotron-4 15B, with its impressive 15-billion-parameter architecture trained on 8 trillion text tokens, brings powerful multilingual and coding capabilities to the Amazon Bedrock. About the authors James Park is a Solutions Architect at Amazon Web Services.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Oracle inks deal with AWS to offer database services

CIO

SEPTEMBER 10, 2024

In continuation of its efforts to help enterprises migrate to the cloud, Oracle said it is partnering with Amazon Web Services (AWS) to offer database services on the latter’s infrastructure. Oracle Database@AWS is expected to be available in preview later in the year with broader availability expected in 2025.

AWS

AWS Azure Database Administration Google Cloud

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

The failed instance also needs to be isolated and terminated manually, either through the AWS Management Console , AWS Command Line Interface (AWS CLI), or tools like kubectl or eksctl. About the Authors Anoop Saha is a Sr GTM Specialist at Amazon Web Services (AWS) focusing on generative AI model training and inference.

Training

Training Artificial Inteligence Hardware Systems Review

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. The following diagram illustrates the solution architecture. At the time of writing, Amazon Nova model fine-tuning is exclusively available in us-east-1.

Case Study

Case Study Artificial Inteligence Study Generative AI

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. versions, catering to different programming preferences.

Generative AI

Generative AI Applications AWS Knowledge Base

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. The following diagram illustrates the solution architecture. AWS CDK CLI, follow instructions here.

AWS

AWS Generative AI Linux Groups

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. 8B ) and DeepSeek-R1-Distill-Llama-70B (from base model Llama-3.3-70B-Instruct

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

AWS Machine Learning - AI

NOVEMBER 22, 2024

Seamless integration of latest foundation models (FMs), Prompts, Agents, Knowledge Bases, Guardrails, and other AWS services. Prerequisites Before implementing the new capabilities, make sure that you have the following: An AWS account In Amazon Bedrock: Create and test your base prompts for customer service interactions in Prompt Management.

Generative AI

Generative AI Artificial Inteligence Knowledge Base AWS

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. These recipes are processed through the HyperPod recipe launcher, which serves as the orchestration layer responsible for launching a job on the corresponding architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. With its growing feature set, TorchServe is a popular choice for deploying and scaling machine learning models among inference customers.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

Marsh McLennan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

In a transformer architecture, such layers are the embedding layers and the multilayer perceptron (MLP) layers. and prior Llama models) and Mistral model architectures for context parallelism. Delving deeper into FP8’s architecture, we discover two distinct subtypes: E4M3 and E5M2. supports the Llama 3.1 (and

Training

Training Artificial Inteligence AWS Machine Learning

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. The following diagram illustrates the solution architecture.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Insights in implementing production-ready solutions with generative AI

AWS Machine Learning - AI

APRIL 30, 2025

This post explores key insights and lessons learned from AWS customers in Europe, Middle East, and Africa (EMEA) who have successfully navigated this transition, providing a roadmap for others looking to follow suit. For more information, you can watch the AWS Summit Milan 2024 presentation.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications. By assigning AWS cost allocation tags, the organization can effectively monitor and track their Bedrock spend patterns.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

Amazon Q Business as a web experience makes AWS best practices readily accessible, providing cloud-centered recommendations quickly and making it straightforward to access AWS service functions, limits, and implementations. This post covers how to integrate Amazon Q Business into your enterprise setup.

Generative AI

Generative AI AWS Innovation Knowledge Base

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Built on top of EXLerate.AI, EXLs AI orchestration platform, and Amazon Web Services (AWS), Code Harbor eliminates redundant code and optimizes performance, reducing manual assessment, conversion and testing effort by 60% to 80%. The EXLerate.AI

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Its improved architecture, based on the Multimodal Diffusion Transformer (MMDiT), combines multiple pre-trained text encoders for enhanced text understanding and uses QK-normalization to improve training stability. Use the us-west-2 AWS Region to run this demo. An Amazon SageMaker domain. Access to Stability AIs SD3.5

Generative AI

Generative AI Games Development AWS

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

Digital transformation started creating a digital presence of everything we do in our lives, and artificial intelligence (AI) and machine learning (ML) advancements in the past decade dramatically altered the data landscape. The choice of vendors should align with the broader cloud or on-premises strategy.

Data

Data Technical Review Software Review Weak Development Team

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The general architecture of the metadata pipeline consists of two primary steps: Generate transcriptions of audio tracks: use speech recognition models to generate accurate transcripts of the audio content. Tom Lauwers is a machine learning engineer on the video personalization team for DPG Media.

Media

Media Video Artificial Inteligence Generative AI

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. The following diagram is the solution architecture. This is the Amazon Machine Image (AMI) ID you will use. For Name , enter LoRAX - Inference Server.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Build and deploy a UI for your generative AI applications with AWS and Python

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Trending Sources

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Webinars

Multi-LLM routing strategies for generative AI applications on AWS

Build a multi-tenant generative AI environment for your enterprise on AWS

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Introducing AWS MCP Servers for code assistants (Part 1)

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Stability AI backs effort to bring machine learning to biomed

Building a Scalable ML Pipeline and API in AWS

How AWS sales uses Amazon Q Business for customer engagement

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Integrate foundation models into your code with Amazon Bedrock

Harness the power of MCP servers with Amazon Bedrock Agents

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Oracle inks deal with AWS to offer database services

Reduce ML training costs with Amazon SageMaker HyperPod

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Model customization, RAG, or both: A case study with Amazon Nova

Empower your generative AI application with a comprehensive custom observability solution

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

Enable Amazon Bedrock cross-Region inference in multi-account environments

Getting started with computer use in Amazon Bedrock Agents

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Marsh McLennan IT reorg lays foundation for gen AI

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Insights in implementing production-ready solutions with generative AI

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AI in action: Stories of how enterprises are transforming and modernizing

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

The future of data: A 5-pillar approach to modern data management

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Host concurrent LLMs with LoRAX

Stay Connected