Architecture, AWS and Engineering

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

A reverse image search engine enables users to upload an image to find related information instead of using text-based queries. Solution overview The solution outlines how to build a reverse image search engine to retrieve similar images based on input image queries. Amazon Titan Multimodal Embeddings model access in Amazon Bedrock.

AWS

AWS Engineering Serverless eCommerce

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

What is a cloud architect? A vital role for success in the cloud

CIO

APRIL 30, 2025

Cloud architects are responsible for managing the cloud computing architecture in an organization, especially as cloud technologies grow increasingly complex. At organizations that have already completed their cloud adoption, cloud architects help maintain, oversee, troubleshoot, and optimize cloud architecture over time.

Cloud

Cloud AWS Azure Disaster Recovery

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Implementation of dynamic routing In this section, we explore different approaches to implementing dynamic routing on AWS, covering both built-in routing features and custom solutions that you can use as a starting point to build your own. The architecture of this system is illustrated in the following figure. 70B and 8B.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This post presents a solution where you can upload a recording of your meeting (a feature available in most modern digital communication services such as Amazon Chime ) to a centralized video insights and summarization engine. Solution overview The following diagram illustrates the pipeline for the video insights and summarization engine.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The following diagram illustrates the architecture of the application.

Generative AI

Generative AI AWS Lambda Authentication

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

By modern, I refer to an engineering-driven methodology that fully capitalizes on automation and software engineering best practices. The proposed model illustrates the data management practice through five functional pillars: Data platform; data engineering; analytics and reporting; data science and AI; and data governance.

Data

Data Technical Review Software Review Weak Development Team

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. You can also customize your distributed training.

AWS

AWS Artificial Inteligence Generative AI Training

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

Generative AI

Generative AI AWS Lambda Authentication

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

The built-in elasticity in serverless computing architecture makes it particularly appealing for unpredictable workloads and amplifies developers productivity by letting developers focus on writing code and optimizing application design industry benchmarks , providing additional justification for this hypothesis. Architecture complexity.

Serverless

Serverless Architecture Microservices Scalability

Vibe Coding: Shaping the Future of Software

Hacker Earth Developers Blog

APRIL 16, 2025

It is important for us to rethink our role as developers and focus on architecture and system design rather than simply on typing code. AI-powered coding tools like GitHub Copilot and AWS’s Q Developer have demonstrated significant productivity gains. The Promise and the Pitfalls I have experienced both sides of vibe coding.

Software

Software Architecture System Design System Architecture

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

It prevents vendor lock-in, gives a lever for strong negotiation, enables business flexibility in strategy execution owing to complicated architecture or regional limitations in terms of security and legal compliance if and when they rise and promotes portability from an application architecture perspective.

Cloud

Cloud Strategy Architecture Policies

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

Cloudera

DECEMBER 2, 2024

Cloudera is committed to providing the most optimal architecture for data processing, advanced analytics, and AI while advancing our customers’ cloud journeys. Together, Cloudera and AWS empower businesses to optimize performance for data processing, analytics, and AI while minimizing their resource consumption and carbon footprint.

Sustainability

Sustainability AWS Analytics Infrastructure

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

However, without a significant commitment from architects and engineers to design more efficient systems, shut down or resize underutilized resources, deploy autoscaling or adopt other cost optimization methods, many efforts fail to achieve meaningful impact. The result was a compromised availability architecture. Standardized metrics.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

IT leaders: What’s the gameplan as tech badly outpaces talent?

CIO

MARCH 13, 2025

Hes seeing the need for professionals who can not only navigate the technology itself, but also manage increasing complexities around its surrounding architectures, data sets, infrastructure, applications, and overall security. The talent shortage is particularly acute in two key areas, says Arun Chandrasekaran at Gartner.

Part-Time VPE

Part-Time VPE Weak Development Team Fractional VPE Fractional CTO

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

Amazon Q Business can increase productivity across diverse teams, including developers, architects, site reliability engineers (SREs), and product managers. Enterprises provide their developers, engineers, and architects with a range of knowledge bases and documents, such as usage guides, wikis, and tools.

Generative AI

Generative AI AWS Innovation Knowledge Base

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. 8B ) and DeepSeek-R1-Distill-Llama-70B (from base model Llama-3.3-70B-Instruct

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. Using a client-server architecture (as illustrated in the following screenshot), MCP helps developers expose their data through lightweight MCP servers while building AI applications as MCP clients that connect to these servers.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Harness the power of MCP servers with Amazon Bedrock Agents

AWS Machine Learning - AI

APRIL 1, 2025

invoke(input_text=Convert 11am from NYC time to London time) We showcase an example of building an agent to understand your Amazon Web Service (AWS) spend by connecting to AWS Cost Explorer , Amazon CloudWatch , and Perplexity AI through MCP. This gives you an AI agent that can transform the way you manage your AWS spend.

Generative AI

Generative AI AWS Artificial Inteligence Software Review

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. The following diagram illustrates the solution architecture. AWS CDK CLI, follow instructions here.

AWS

AWS Generative AI Linux Groups

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. AI-powered email processing engine – Central to the solution, this engine uses AI to analyze and process emails.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

TechCrunch

MARCH 9, 2022

As organizations continue to build out their digital architecture, a new category of enterprise software has emerged to help them manage that process. “Enterprise architecture today is very much about the scaffolding in the organization,” he said.

Architecture

Architecture Enterprise Network Organization

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The general architecture of the metadata pipeline consists of two primary steps: Generate transcriptions of audio tracks: use speech recognition models to generate accurate transcripts of the audio content. About the Authors Lucas Desard is GenAI Engineer at DPG Media.

Media

Media Video Artificial Inteligence Generative AI

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. MaestroQAs existing rules engine couldnt always answer these types of queries because end-users could ask for the same outcome in many different ways.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. To address this, the team implemented a chat assistant using Amazon Q Business.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

Marsh McLennan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

Prerequisites Before you dive into the integration process, make sure you have the following prerequisites in place: AWS account – You’ll need an AWS account to access and use Amazon Bedrock. You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure. The CCoE implemented AWS Organizations across a substantial number of business units. About the Authors Steven Craig is a Sr. Director, Cloud Center of Excellence.

Generative AI

Generative AI Government Technical Review Innovation

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

Each hardware failure can result in wasted GPU hours and requires valuable engineering time to identify and resolve the issue, making the system prone to downtime that can disrupt progress and delay completion. Each failure incurs engineering effort to identify its root cause.

Training

Training Artificial Inteligence Hardware Systems Review

Why Every Engineering Team Should Embrace AWS Graviton4

Honeycomb

JULY 9, 2024

Two years ago, we shared our experiences with adopting AWS Graviton3 and our enthusiasm for the future of AWS Graviton and Arm. Once again, we’re privileged to share our experiences as a launch customer of the Amazon EC2 R8g instances powered by AWS Graviton4, the newest generation of AWS Graviton processors.

AWS

AWS Engineering Metrics Network

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

In a transformer architecture, such layers are the embedding layers and the multilayer perceptron (MLP) layers. Supported models SMP supports context parallelism using NVIDIA Transformer Engine , and it seamlessly integrates with other model parallelism techniques Fully Sharded Data Parallel and Tensor Parallelism.

Training

Training Artificial Inteligence AWS Machine Learning

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. versions, catering to different programming preferences.

Generative AI

Generative AI Applications AWS Knowledge Base

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

This advancement makes sophisticated agent architectures more accessible and economically viable across a broader range of applications and scales of deployment. We recommend referring to the Submit a model distillation job in Amazon Bedrock in the official AWS documentation for the most up-to-date and comprehensive information.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

What is enterprise architecture? A framework for transformation

CIO

NOVEMBER 23, 2022

Enterprise architecture definition Enterprise architecture (EA) is the practice of analyzing, designing, planning, and implementing enterprise analysis to successfully execute on business strategies. Making it easier to evaluate existing architecture against long-term goals.

Architecture

Architecture Enterprise Agile Artificial Inteligence

Creating asynchronous AI agents with Amazon Bedrock

AWS Machine Learning - AI

MARCH 13, 2025

This post will discuss agentic AI driven architecture and ways of implementing. Agentic AI architecture Agentic AI architecture is a shift in process automation through autonomous agents towards the capabilities of AI, with the purpose of imitating cognitive abilities and enhancing the actions of traditional autonomous agents.

Artificial Inteligence

Artificial Inteligence Lambda Travel Generative AI

What is a data engineer? An analytics role in high demand

CIO

AUGUST 9, 2022

What is a data engineer? Data engineers design, build, and optimize systems for data collection, storage, access, and analytics at scale. Data engineers also need communication skills to work across departments and to understand what business leaders want to gain from the company’s large datasets. The data engineer role.

Data Engineering

Data Engineering Analytics Engineering Data

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. Need to hire skilled engineers? AWS migration isnt just about moving data; it requires careful planning and execution.

AWS

AWS Cloud Weak Development Team DevOps

Accelerate AWS Well-Architected reviews with Generative AI

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Webinars

Trending Sources

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

What is a cloud architect? A vital role for success in the cloud

Multi-LLM routing strategies for generative AI applications on AWS

Introducing AWS MCP Servers for code assistants (Part 1)

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Build a multi-tenant generative AI environment for your enterprise on AWS

How AWS sales uses Amazon Q Business for customer engagement

The future of data: A 5-pillar approach to modern data management

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Can serverless fix fintech’s scaling problem?

Vibe Coding: Shaping the Future of Software

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

Why GreenOps will succeed where FinOps is failing

IT leaders: What’s the gameplan as tech badly outpaces talent?

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Harness the power of MCP servers with Amazon Bedrock Agents

Getting started with computer use in Amazon Bedrock Agents

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Marsh McLennan IT reorg lays foundation for gen AI

Integrate foundation models into your code with Amazon Bedrock

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Reduce ML training costs with Amazon SageMaker HyperPod

Why Every Engineering Team Should Embrace AWS Graviton4

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Empower your generative AI application with a comprehensive custom observability solution

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

What is enterprise architecture? A framework for transformation

Creating asynchronous AI agents with Amazon Bedrock

What is a data engineer? An analytics role in high demand

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Stay Connected