Architecture, Artificial Intelligence and AWS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Implementation of dynamic routing In this section, we explore different approaches to implementing dynamic routing on AWS, covering both built-in routing features and custom solutions that you can use as a starting point to build your own. For example, Amazon Bedrock can intelligently route requests between Anthropics Claude 3.5

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

With rapid progress in the fields of machine learning (ML) and artificial intelligence (AI), it is important to deploy the AI/ML model efficiently in production environments. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The following diagram illustrates the architecture of the application.

Generative AI

Generative AI AWS Lambda Authentication

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

Principal wanted to use existing internal FAQs, documentation, and unstructured data and build an intelligent chatbot that could provide quick access to the right information for different roles. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. You can also customize your distributed training.

AWS

AWS Artificial Inteligence Generative AI Training

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Generative and agentic artificial intelligence (AI) are paving the way for this evolution. Built on top of EXLerate.AI, EXLs AI orchestration platform, and Amazon Web Services (AWS), Code Harbor eliminates redundant code and optimizes performance, reducing manual assessment, conversion and testing effort by 60% to 80%.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

Digital transformation started creating a digital presence of everything we do in our lives, and artificial intelligence (AI) and machine learning (ML) advancements in the past decade dramatically altered the data landscape. The choice of vendors should align with the broader cloud or on-premises strategy.

Data

Data Technical Review Software Review Weak Development Team

Enter the next phase of Industry 4.0 with edge AI

CIO

DECEMBER 3, 2024

Generally speaking, a healthy application and data architecture is at the heart of successful modernisation. IBM and Amazon Web Services (AWS) have partnered up to make this easier. All kinds of things can be automated The question is, how should businesses go about modernising their own applications effectively?

Industry

Industry AWS Banking Agile

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

The rise of large language models (LLMs) and foundation models (FMs) has revolutionized the field of natural language processing (NLP) and artificial intelligence (AI). You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more. If you don’t have one, you can create a new account.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. The following diagram provides a detailed view of the architecture to enhance email support using generative AI.

Knowledge Base

Knowledge Base Technical Review Generative AI Lambda

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

AWS Machine Learning - AI

NOVEMBER 22, 2024

Seamless integration of latest foundation models (FMs), Prompts, Agents, Knowledge Bases, Guardrails, and other AWS services. Prerequisites Before implementing the new capabilities, make sure that you have the following: An AWS account In Amazon Bedrock: Create and test your base prompts for customer service interactions in Prompt Management.

Generative AI

Generative AI Artificial Inteligence Knowledge Base AWS

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

IBM and AWS forge global alliance, streamlining access to AI and hybrid cloud solutions

CIO

MAY 3, 2024

IBM has announced the expansion of its software portfolio to 92 countries in AWS Marketplace, a digital catalog with thousands of software listings from independent software vendors (ISVs). Since AWS is the cloud infra leader with thousands of enterprises and many of them overlap with IBM and RedHat using their SaaS solutions.

AWS

AWS Cloud Artificial Intelligence Artificial Inteligence

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

This surge is driven by the rapid expansion of cloud computing and artificial intelligence, both of which are reshaping industries and enabling unprecedented scalability and innovation. The result was a compromised availability architecture. This lack of engagement results in inertia and minimal progress.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications. By assigning AWS cost allocation tags, the organization can effectively monitor and track their Bedrock spend patterns.

Generative AI

Generative AI AWS Artificial Inteligence Budget

How I replaced Xebia Leadership with Artificial Intelligence

Xebia

APRIL 20, 2023

That’s right, folks; I replaced the Xebia leadership with artificial intelligence! The payload, which includes the selected Xebian and the question, is sent to the API endpoint at [link] This endpoint is a CloudFront distribution in front of an AWS Lambda function that acts as an HTTP endpoint. What would you build?

Artificial Intelligence

Artificial Intelligence Artificial Inteligence Leadership Lambda

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. 8B ) and DeepSeek-R1-Distill-Llama-70B (from base model Llama-3.3-70B-Instruct

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. The following diagram is the solution architecture. aws ec2 describe-images --filters 'Name=name,Values=Deep Learning OSS Nvidia Driver AMI GPU PyTorch 2.5*(Ubuntu*'

Artificial Inteligence

Artificial Inteligence AWS Generative AI Storage

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

What is enterprise architecture? A framework for transformation

CIO

NOVEMBER 23, 2022

Enterprise architecture definition Enterprise architecture (EA) is the practice of analyzing, designing, planning, and implementing enterprise analysis to successfully execute on business strategies. Making it easier to evaluate existing architecture against long-term goals.

Architecture

Architecture Enterprise Agile Artificial Inteligence

Creating asynchronous AI agents with Amazon Bedrock

AWS Machine Learning - AI

MARCH 13, 2025

Advancements in multimodal artificial intelligence (AI), where agents can understand and generate not just text but also images, audio, and video, will further broaden their applications. This post will discuss agentic AI driven architecture and ways of implementing.

Artificial Inteligence

Artificial Inteligence Lambda Travel Generative AI

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

In a transformer architecture, such layers are the embedding layers and the multilayer perceptron (MLP) layers. and prior Llama models) and Mistral model architectures for context parallelism. Delving deeper into FP8’s architecture, we discover two distinct subtypes: E4M3 and E5M2. supports the Llama 3.1 (and

Training

Training Artificial Inteligence AWS Machine Learning

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Dzone - DevOps

APRIL 9, 2025

Amazon Web Services (AWS) provides an expansive suite of tools to help developers build and manage serverless applications with ease. By abstracting the complexities of infrastructure, AWS enables teams to focus on innovation. Why Combine AI, ML, and Serverless Computing?

Serverless

Serverless Artificial Inteligence Scalability AWS

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. The following diagram illustrates the solution architecture. At the time of writing, Amazon Nova model fine-tuning is exclusively available in us-east-1.

Case Study

Case Study Artificial Inteligence Study Generative AI

Enter the next phase of Industry 4.0 with edge AI

CIO

DECEMBER 9, 2024

Generally speaking, a healthy application and data architecture is at the heart of successful modernisation. IBM and Amazon Web Services (AWS) have partnered up to make this easier. All kinds of things can be automated The question is, how should businesses go about modernising their own applications effectively?

Industry

Industry Banking AWS Agile

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Intelligent document processing , translation and summarization, flexible and insightful responses for customer support agents, personalized marketing content, and image and code generation are a few use cases using generative AI that organizations are rolling out in production.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. versions, catering to different programming preferences.

Generative AI

Generative AI Applications AWS Knowledge Base

9 Best AI Tools for Programming Assistance in 2024

The Crazy Programmer

JUNE 14, 2024

Artificial Intelligence (AI) is revolutionizing software development by enhancing productivity, improving code quality, and automating routine tasks. Amazon CodeWhisperer Amazon CodeWhisperer is a machine learning-powered code suggestion tool from Amazon Web Services (AWS).

Programming

Programming Tools Software Review Artificial Inteligence

Build your multilingual personal calendar assistant with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

JULY 3, 2024

To solve this problem, this post shows you how to apply AWS services such as Amazon Bedrock , AWS Step Functions , and Amazon Simple Email Service (Amazon SES) to build a fully-automated multilingual calendar artificial intelligence (AI) assistant. It lets you orchestrate multiple steps in the pipeline.

AWS

AWS Artificial Inteligence Generative AI Lambda

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

Agents will begin replacing services Software has evolved from big, monolithic systems running on mainframes, to desktop apps, to distributed, service-based architectures, web applications, and mobile apps. Agents can be more loosely coupled than services, making these architectures more flexible, resilient and smart.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

The failed instance also needs to be isolated and terminated manually, either through the AWS Management Console , AWS Command Line Interface (AWS CLI), or tools like kubectl or eksctl. About the Authors Anoop Saha is a Sr GTM Specialist at Amazon Web Services (AWS) focusing on generative AI model training and inference.

Training

Training Artificial Inteligence Hardware Systems Review

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Some examples of AWS-sourced operational events include: AWS Health events — Notifications related to AWS service availability, operational issues, or scheduled maintenance that might affect your AWS resources.

Cloud

Cloud AWS Serverless Policies

Insights in implementing production-ready solutions with generative AI

AWS Machine Learning - AI

APRIL 30, 2025

This post explores key insights and lessons learned from AWS customers in Europe, Middle East, and Africa (EMEA) who have successfully navigated this transition, providing a roadmap for others looking to follow suit. Il Sole 24 Ore leveraged its vast internal knowledge with a Retrieval Augmented Generation (RAG) solution powered by AWS.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

AWS Machine Learning - AI

JULY 31, 2024

Amazon Web Services (AWS) is committed to supporting the development of cutting-edge generative artificial intelligence (AI) technologies by companies and organizations across the globe. Let’s dive in and explore how these organizations are transforming what’s possible with generative AI on AWS.

Artificial Inteligence

Artificial Inteligence AWS Programming Innovation

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

In this post, we describe the development journey of the generative AI companion for Mozart, the data, the architecture, and the evaluation of the pipeline. The following diagram illustrates the solution architecture. You can create a decoupled architecture with reusable components.

Generative AI

Generative AI Technical Review Insurance Policies

A secure approach to generative AI with AWS

AWS Machine Learning - AI

APRIL 16, 2024

Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. At AWS, our top priority is safeguarding the security and confidentiality of our customers’ workloads. With the AWS Nitro System , we delivered a first-of-its-kind innovation on behalf of our customers.

Generative AI

Generative AI AWS Artificial Inteligence Infrastructure

Taming the cost of AI: Is FinOps the answer?

CIO

APRIL 1, 2025

As artificial intelligence (AI) services, particularly generative AI (genAI), become increasingly integral to modern enterprises, establishing a robust financial operations (FinOps) strategy is essential. in artificial intelligence and the genetic algorithm. Magesh Kasthuri is a Ph.D

Technical Review

Technical Review Azure Budget Artificial Intelligence

Deep Vision announces its low-latency AI processor for the edge

TechCrunch

NOVEMBER 16, 2020

Hameed and Qadeer developed Deep Vision’s architecture as part of a Ph.D. “They came up with a very compelling architecture for AI that minimizes data movement within the chip,” Annavajjhala explained. In addition, its software optimizes the overall data flow inside the architecture based on the specific workload.

Weak Development Team

Weak Development Team Hardware Architecture Automotive

Accelerate AWS Well-Architected reviews with Generative AI

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

Trending Sources

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Building a Scalable ML Pipeline and API in AWS

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Build a multi-tenant generative AI environment for your enterprise on AWS

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

How AWS sales uses Amazon Q Business for customer engagement

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AI in action: Stories of how enterprises are transforming and modernizing

The future of data: A 5-pillar approach to modern data management

Enter the next phase of Industry 4.0 with edge AI

Integrate foundation models into your code with Amazon Bedrock

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

IBM and AWS forge global alliance, streamlining access to AI and hybrid cloud solutions

Why GreenOps will succeed where FinOps is failing

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

How I replaced Xebia Leadership with Artificial Intelligence

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Host concurrent LLMs with LoRAX

Enable Amazon Bedrock cross-Region inference in multi-account environments

Build a video insights and summarization engine using generative AI with Amazon Bedrock

What is enterprise architecture? A framework for transformation

Creating asynchronous AI agents with Amazon Bedrock

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Model customization, RAG, or both: A case study with Amazon Nova

Enter the next phase of Industry 4.0 with edge AI

Generative AI operating models in enterprise organizations with Amazon Bedrock

Empower your generative AI application with a comprehensive custom observability solution

9 Best AI Tools for Programming Assistance in 2024

Build your multilingual personal calendar assistant with Amazon Bedrock and AWS Step Functions

12 AI predictions for 2025

Reduce ML training costs with Amazon SageMaker HyperPod

Boost productivity by using AI in cloud operational health management

Insights in implementing production-ready solutions with generative AI

Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

A secure approach to generative AI with AWS

Taming the cost of AI: Is FinOps the answer?

Deep Vision announces its low-latency AI processor for the edge

Stay Connected