Architecture, AWS and Resources

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The following diagram illustrates the architecture of the application.

Generative AI

Generative AI AWS Lambda Authentication

Monitoring AWS Container Environments at Scale

Advertiser: Datadog

Particularly well-suited for microservice-oriented architectures and agile workflows, containers help organizations improve developer efficiency, feature velocity, and optimization of resources. Containers power many of the applications we use every day.

AWS

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Such a virtual assistant should support users across various business functions, such as finance, legal, human resources, and operations. This architecture workflow includes the following steps: A user submits a question through a web or mobile application. The architecture of this system is illustrated in the following figure.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

Manually managing such complexity can often be counter-productive and take away valuable resources from your businesses AI development. To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. You can also customize your distributed training. The Neuron 2.20

AWS

AWS Artificial Inteligence Generative AI Training

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

Generative AI

Generative AI AWS Lambda Authentication

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

The built-in elasticity in serverless computing architecture makes it particularly appealing for unpredictable workloads and amplifies developers productivity by letting developers focus on writing code and optimizing application design industry benchmarks , providing additional justification for this hypothesis. Architecture complexity.

Serverless

Serverless Architecture Microservices Scalability

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

The following diagram illustrates the solution architecture: The steps of the solution include: Upload data to Amazon S3 : Store the product images in Amazon Simple Storage Service (Amazon S3). The AWS Command Line Interface (AWS CLI) installed on your machine to upload the dataset to Amazon S3.

AWS

AWS Engineering Serverless eCommerce

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Although tagging is supported on a variety of Amazon Bedrock resources —including provisioned models, custom models, agents and agent aliases, model evaluations, prompts, prompt flows, knowledge bases, batch inference jobs, custom model jobs, and model duplication jobs—there was previously no capability for tagging on-demand foundation models.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

It prevents vendor lock-in, gives a lever for strong negotiation, enables business flexibility in strategy execution owing to complicated architecture or regional limitations in terms of security and legal compliance if and when they rise and promotes portability from an application architecture perspective.

Cloud

Cloud Strategy Architecture Policies

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. Using a client-server architecture (as illustrated in the following screenshot), MCP helps developers expose their data through lightweight MCP servers while building AI applications as MCP clients that connect to these servers.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Trade routes of the digital age: How data gravity shapes cloud strategy

CIO

APRIL 15, 2025

Just as ancient trade routes determined how and where commerce flowed, applications and computing resources today gravitate towards massive datasets. However, as companies expand their operations and adopt multi-cloud architectures, they are faced with an invisible but powerful challenge: Data gravity.

Strategy

Strategy Cloud Data Technical Review

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. 8B ) and DeepSeek-R1-Distill-Llama-70B (from base model Llama-3.3-70B-Instruct

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

Cloudera

DECEMBER 2, 2024

Cloudera is committed to providing the most optimal architecture for data processing, advanced analytics, and AI while advancing our customers’ cloud journeys. Together, Cloudera and AWS empower businesses to optimize performance for data processing, analytics, and AI while minimizing their resource consumption and carbon footprint.

Sustainability

Sustainability AWS Analytics Infrastructure

Vibe Coding: Shaping the Future of Software

Hacker Earth Developers Blog

APRIL 16, 2025

It is important for us to rethink our role as developers and focus on architecture and system design rather than simply on typing code. AI-powered coding tools like GitHub Copilot and AWS’s Q Developer have demonstrated significant productivity gains. The Promise and the Pitfalls I have experienced both sides of vibe coding.

Software

Software Architecture System Design System Architecture

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. The following diagram provides a detailed view of the architecture to enhance email support using generative AI.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. versions, catering to different programming preferences.

Generative AI

Generative AI Applications AWS Knowledge Base

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

However, the rapid pace of growth also highlights the urgent need for more sustainable and efficient resource management practices. The result was a compromised availability architecture. Many organizations have turned to FinOps practices to regain control over these escalating costs.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

Whether processing invoices, updating customer records, or managing human resource (HR) documents, these workflows often require employees to manually transfer information between different systems a process thats time-consuming, error-prone, and difficult to scale. The following diagram illustrates the solution architecture.

AWS

AWS Generative AI Linux Groups

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

However, Cloud Center of Excellence (CCoE) teams often can be perceived as bottlenecks to organizational transformation due to limited resources and overwhelming demand for their support. The CCoE implemented AWS Organizations across a substantial number of business units.

Generative AI

Generative AI Government Technical Review Innovation

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

However, training and deploying such models from scratch is a complex and resource-intensive process, often requiring specialized expertise and significant computational resources. You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more. We walk through a Python example in this post.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

TechCrunch

MARCH 9, 2022

As organizations continue to build out their digital architecture, a new category of enterprise software has emerged to help them manage that process. “Enterprise architecture today is very much about the scaffolding in the organization,” he said.

Architecture

Architecture Enterprise Network Organization

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

Amazon Q Business as a web experience makes AWS best practices readily accessible, providing cloud-centered recommendations quickly and making it straightforward to access AWS service functions, limits, and implementations. MuleSoft from Salesforce provides the Anypoint platform that gives IT the tools to automate everything.

Generative AI

Generative AI AWS Innovation Knowledge Base

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

CIO

APRIL 15, 2025

With the new TPU, Google said it wants to offer better performance per watt per dollar than any TPUs it released earlier and this is welcome news for CIOs as they often have to do more with less or constrained resources.

Cloud

Cloud Innovation Artificial Inteligence Google Cloud

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Large organizations often have many business units with multiple lines of business (LOBs), with a central governing entity, and typically use AWS Organizations with an Amazon Web Services (AWS) multi-account strategy. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

With a vast array of services and resource footprints spanning hundreds of accounts, organizations can face an overwhelming volume of operational events occurring daily, making manual administration impractical. It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services.

Cloud

Cloud AWS Serverless Policies

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform. Navigate to the AWS Secrets Manager console and find the secret -api-keys.

Generative AI

Generative AI Applications Technical Review Software Review

Best Practices for IaC using AWS CloudFormation

Perficient

MARCH 11, 2025

AWS CloudFormation, a key service in the AWS ecosystem, simplifies IaC by allowing users to easily model and set up AWS resources. This blog explores the best practices for utilizing AWS CloudFormation to achieve reliable, secure, and efficient infrastructure management. Why Use AWS CloudFormation?

AWS

AWS Software Review Systems Review Policies

AWS launches no-code service AppFabric with generative AI assistance

CIO

JUNE 28, 2023

Amazon Web Services (AWS) on Tuesday unveiled a new no-code offering, dubbed AppFabric, designed to simplify SaaS integration for enterprises by increasing application observability and reducing operational costs associated with building point-to-point solutions. AppFabric, which is available across AWS’ US East (N.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

Once a failure occurs, time (idle GPUs) is spent on detecting (MTD), replacing (MTT Replace), and continuing (MTR Restart) a training run, often wasting time and expensive resources. Summary Training frontier models is a complex, resource-intensive process that is particularly vulnerable to hardware failures.

Training

Training Artificial Inteligence Hardware Systems Review

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

First, it can lead to lower costs to convergence, allowing for more efficient use of resources during the training process. In a transformer architecture, such layers are the embedding layers and the multilayer perceptron (MLP) layers. and prior Llama models) and Mistral model architectures for context parallelism.

Training

Training Artificial Inteligence AWS Machine Learning

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Similarly, organizations are fine-tuning generative AI models for domains such as finance, sales, marketing, travel, IT, human resources (HR), procurement, healthcare and life sciences, and customer service. Why LoRAX for LoRA deployment on AWS? This challenge is further compounded by concerns over scalability and cost-effectiveness.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Accelerate AWS Well-Architected reviews with Generative AI

Building Resilient Public Networking on AWS: Part 4

Webinars

Trending Sources

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Monitoring AWS Container Environments at Scale

Build a multi-tenant generative AI environment for your enterprise on AWS

Introducing AWS MCP Servers for code assistants (Part 1)

Multi-LLM routing strategies for generative AI applications on AWS

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Can serverless fix fintech’s scaling problem?

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Trade routes of the digital age: How data gravity shapes cloud strategy

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

Vibe Coding: Shaping the Future of Software

Enable Amazon Bedrock cross-Region inference in multi-account environments

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Empower your generative AI application with a comprehensive custom observability solution

Why GreenOps will succeed where FinOps is failing

Getting started with computer use in Amazon Bedrock Agents

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Integrate foundation models into your code with Amazon Bedrock

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Generative AI operating models in enterprise organizations with Amazon Bedrock

Boost productivity by using AI in cloud operational health management

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Best Practices for IaC using AWS CloudFormation

AWS launches no-code service AppFabric with generative AI assistance

Reduce ML training costs with Amazon SageMaker HyperPod

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Host concurrent LLMs with LoRAX

Stay Connected