Architecture, AWS and Scalability

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. It stores information such as job ID, status, creation time, and other metadata.

Scalability

Scalability Lambda Generative AI AWS

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

This blog post discusses an end-to-end ML pipeline on AWS SageMaker that leverages serverless computing, event-trigger-based data processing, and external API integrations. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Semantic routing offers several advantages, such as efficiency gained through fast similarity search in vector databases, and scalability to accommodate a large number of task categories and downstream LLMs. This architecture workflow includes the following steps: A user submits a question through a web or mobile application.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

Add to this the escalating costs of maintaining legacy systems, which often act as bottlenecks for scalability. The latter option had emerged as a compelling solution, offering the promise of enhanced agility, reduced operational costs, and seamless scalability. Scalability. Architecture complexity. Legacy infrastructure.

Serverless

Serverless Architecture Microservices Scalability

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

In todays fast-paced digital landscape, the cloud has emerged as a cornerstone of modern business infrastructure, offering unparalleled scalability, agility, and cost-efficiency. Technology modernization strategy : Evaluate the overall IT landscape through the lens of enterprise architecture and assess IT applications through a 7R framework.

Cloud

Cloud Strategy Architecture Policies

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

AI practitioners and industry leaders discussed these trends, shared best practices, and provided real-world use cases during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI. And its modular architecture distributes tasks across multiple agents in parallel, increasing the speed and scalability of migrations.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Without a scalable approach to controlling costs, organizations risk unbudgeted usage and cost overruns. Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Vibe Coding: Shaping the Future of Software

Hacker Earth Developers Blog

APRIL 16, 2025

It is important for us to rethink our role as developers and focus on architecture and system design rather than simply on typing code. AI-powered coding tools like GitHub Copilot and AWS’s Q Developer have demonstrated significant productivity gains. The Promise and the Pitfalls I have experienced both sides of vibe coding.

Software

Software Architecture System Design System Architecture

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure. The CCoE implemented AWS Organizations across a substantial number of business units.

Generative AI

Generative AI Government Technical Review Innovation

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. An S3 bucket prepared to store the custom model.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Dzone - DevOps

APRIL 9, 2025

Amazon Web Services (AWS) provides an expansive suite of tools to help developers build and manage serverless applications with ease. By abstracting the complexities of infrastructure, AWS enables teams to focus on innovation. Why Combine AI, ML, and Serverless Computing?

Serverless

Serverless Artificial Inteligence Scalability AWS

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

CIO

APRIL 15, 2025

Explaining further how Googles strategy differs from rivals, such as AWS and Microsoft, Hinchcliffe said, where Microsoft is optimizing for AI as UX layer and AWS is anchoring on primitives, Google is carving out the middle ground a developer-ready but enterprise-scalable agentic architecture.

Cloud

Cloud Innovation Artificial Inteligence Google Cloud

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. Nemotron-4 15B, with its impressive 15-billion-parameter architecture trained on 8 trillion text tokens, brings powerful multilingual and coding capabilities to the Amazon Bedrock. About the authors James Park is a Solutions Architect at Amazon Web Services.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. The following diagram illustrates the solution architecture.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

This surge is driven by the rapid expansion of cloud computing and artificial intelligence, both of which are reshaping industries and enabling unprecedented scalability and innovation. The result was a compromised availability architecture. Global IT spending is expected to soar in 2025, gaining 9% according to recent estimates.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Large organizations often have many business units with multiple lines of business (LOBs), with a central governing entity, and typically use AWS Organizations with an Amazon Web Services (AWS) multi-account strategy. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Creating asynchronous AI agents with Amazon Bedrock

AWS Machine Learning - AI

MARCH 13, 2025

This post will discuss agentic AI driven architecture and ways of implementing. Agentic AI architecture Agentic AI architecture is a shift in process automation through autonomous agents towards the capabilities of AI, with the purpose of imitating cognitive abilities and enhancing the actions of traditional autonomous agents.

Artificial Inteligence

Artificial Inteligence Lambda Travel Generative AI

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. Amazon Bedrocks broad choice of FMs from leading AI companies, along with its scalability and security features, made it an ideal solution for MaestroQA.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Best Practices for IaC using AWS CloudFormation

Perficient

MARCH 11, 2025

IaC enables developers to define infrastructure configurations using code, ensuring consistency, automation, and scalability. AWS CloudFormation, a key service in the AWS ecosystem, simplifies IaC by allowing users to easily model and set up AWS resources. Why Use AWS CloudFormation? Example: 3. Example: 4.

AWS

AWS Software Review Systems Review Policies

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. The following diagram illustrates the solution architecture. AWS CDK CLI, follow instructions here.

AWS

AWS Generative AI Linux Groups

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. versions, catering to different programming preferences.

Generative AI

Generative AI Applications AWS Knowledge Base

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. The following diagram illustrates the solution architecture. At the time of writing, Amazon Nova model fine-tuning is exclusively available in us-east-1.

Case Study

Case Study Artificial Inteligence Study Generative AI

Marsh McLennan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider. The biggest challenge is data.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

As DPG Media grows, they need a more scalable way of capturing metadata that enhances the consumer experience on online video services and aids in understanding key content characteristics. Irina Radu is a Prototyping Engagement Manager, part of AWS EMEA Prototyping and Cloud Engineering.

Media

Media Video Artificial Inteligence Generative AI

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

Agents will begin replacing services Software has evolved from big, monolithic systems running on mainframes, to desktop apps, to distributed, service-based architectures, web applications, and mobile apps. Agents can be more loosely coupled than services, making these architectures more flexible, resilient and smart.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Insights in implementing production-ready solutions with generative AI

AWS Machine Learning - AI

APRIL 30, 2025

This post explores key insights and lessons learned from AWS customers in Europe, Middle East, and Africa (EMEA) who have successfully navigated this transition, providing a roadmap for others looking to follow suit. Il Sole 24 Ore leveraged its vast internal knowledge with a Retrieval Augmented Generation (RAG) solution powered by AWS.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

In the current digital environment, migration to the cloud has emerged as an essential tactic for companies aiming to boost scalability, enhance operational efficiency, and reinforce resilience. Get AWS developers A step-by-step AWS migration checklist Mobilunity helps hiring dedicated development teams to businesses worldwide for 14+ years.

AWS

AWS Cloud Weak Development Team DevOps

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Users can access these AI capabilities through their organizations single sign-on (SSO), collaborate with team members, and refine AI applications without needing AWS Management Console access. Before we dive deep into the deployment of the AI agent, lets walk through the key steps of the architecture, as shown in the following diagram.

Generative AI

Generative AI Systems Review System Lambda

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

With Amazon Bedrock Data Automation, enterprises can accelerate AI adoption and develop solutions that are secure, scalable, and responsible. Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions. For example, a request made in the US stays within Regions in the US.

Data

Data Generative AI Artificial Inteligence Compliance

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The collaboration between BQA and AWS was facilitated through the Cloud Innovation Center (CIC) program, a joint initiative by AWS, Tamkeen , and leading universities in Bahrain, including Bahrain Polytechnic and University of Bahrain. The following diagram illustrates the solution architecture.

Education

Education Report Technical Review Generative AI

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

This challenge is further compounded by concerns over scalability and cost-effectiveness. Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. The following diagram is the solution architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Xebia

MAY 21, 2024

Cloud modernization has become a prominent topic for organizations, and AWS plays a crucial role in helping them modernize their IT infrastructure, applications, and services. Overall, discussions on AWS modernization are focused on security, faster releases, efficiency, and steps towards GenAI and improved innovation.

AWS

AWS Strategy Serverless Microservices

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Cloudera

JANUARY 7, 2025

A prominent public health organization integrated data from multiple regional health entities within a hybrid multi-cloud environment (AWS, Azure, and on-premise). A leading meal kit provider migrated its data architecture to Cloudera on AWS, utilizing Cloudera’s Open Data Lakehouse capabilities.

Cloud

Cloud Data Scalability Compliance

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform. Navigate to the AWS Secrets Manager console and find the secret -api-keys.

Generative AI

Generative AI Applications Technical Review Software Review

Accelerate AWS Well-Architected reviews with Generative AI

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Webinars

Trending Sources

Building a Scalable ML Pipeline and API in AWS

Webinars

Build and deploy a UI for your generative AI applications with AWS and Python

Multi-LLM routing strategies for generative AI applications on AWS

Can serverless fix fintech’s scaling problem?

Introducing AWS MCP Servers for code assistants (Part 1)

Build a multi-tenant generative AI environment for your enterprise on AWS

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

How AWS sales uses Amazon Q Business for customer engagement

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AI in action: Stories of how enterprises are transforming and modernizing

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Vibe Coding: Shaping the Future of Software

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Why GreenOps will succeed where FinOps is failing

Generative AI operating models in enterprise organizations with Amazon Bedrock

Creating asynchronous AI agents with Amazon Bedrock

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Best Practices for IaC using AWS CloudFormation

Getting started with computer use in Amazon Bedrock Agents

Enable Amazon Bedrock cross-Region inference in multi-account environments

Empower your generative AI application with a comprehensive custom observability solution

Model customization, RAG, or both: A case study with Amazon Nova

Marsh McLennan IT reorg lays foundation for gen AI

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

12 AI predictions for 2025

Insights in implementing production-ready solutions with generative AI

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

How BQA streamlines education quality reporting using Amazon Bedrock

Host concurrent LLMs with LoRAX

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Stay Connected