Architecture, AWS and Testing

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. The generative AI playground is a UI provided to tenants where they can run their one-time experiments, chat with several FMs, and manually test capabilities such as guardrails or model evaluation for exploration purposes.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

What is a cloud architect? A vital role for success in the cloud

CIO

APRIL 30, 2025

Cloud architects are responsible for managing the cloud computing architecture in an organization, especially as cloud technologies grow increasingly complex. At organizations that have already completed their cloud adoption, cloud architects help maintain, oversee, troubleshoot, and optimize cloud architecture over time.

Cloud

Cloud AWS Azure Disaster Recovery

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Careful model selection, fine-tuning, configuration, and testing might be necessary to balance the impact of latency and cost with the desired classification accuracy. This architecture workflow includes the following steps: A user submits a question through a web or mobile application. 70B and 8B.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The following diagram illustrates the architecture of the application.

Generative AI

Generative AI AWS Lambda Authentication

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

Generative AI

Generative AI AWS Lambda Authentication

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

The built-in elasticity in serverless computing architecture makes it particularly appealing for unpredictable workloads and amplifies developers productivity by letting developers focus on writing code and optimizing application design industry benchmarks , providing additional justification for this hypothesis. Architecture complexity.

Serverless

Serverless Architecture Microservices Scalability

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

Vibe Coding: Shaping the Future of Software

Hacker Earth Developers Blog

APRIL 16, 2025

It is important for us to rethink our role as developers and focus on architecture and system design rather than simply on typing code. Teams have been able to test new ideas and validate concepts much faster. AI-powered coding tools like GitHub Copilot and AWS’s Q Developer have demonstrated significant productivity gains.

Software

Software Architecture System Design System Architecture

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. Red teaming is a methodology used to test and evaluate systems by simulating real-world adversarial conditions.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. Adjust the inference parameters as needed and write your test prompt.

Generative AI

Generative AI Artificial Inteligence AWS Serverless

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Code Harbor automates current-state assessment, code transformation and optimization, as well as code testing and validation by relying on task-specific, finely tuned AI agents. And its modular architecture distributes tasks across multiple agents in parallel, increasing the speed and scalability of migrations. The EXLerate.AI

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

AWS Machine Learning - AI

NOVEMBER 22, 2024

Seamless integration of latest foundation models (FMs), Prompts, Agents, Knowledge Bases, Guardrails, and other AWS services. Reduced time and effort in testing and deploying AI workflows with SDK APIs and serverless infrastructure. Test your Flows with the implemented guardrails by entering a prompt in the Test Flow.

Generative AI

Generative AI Artificial Inteligence Knowledge Base AWS

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. The following diagram provides a detailed view of the architecture to enhance email support using generative AI.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

It prevents vendor lock-in, gives a lever for strong negotiation, enables business flexibility in strategy execution owing to complicated architecture or regional limitations in terms of security and legal compliance if and when they rise and promotes portability from an application architecture perspective.

Cloud

Cloud Strategy Architecture Policies

Harness the power of MCP servers with Amazon Bedrock Agents

AWS Machine Learning - AI

APRIL 1, 2025

invoke(input_text=Convert 11am from NYC time to London time) We showcase an example of building an agent to understand your Amazon Web Service (AWS) spend by connecting to AWS Cost Explorer , Amazon CloudWatch , and Perplexity AI through MCP. This gives you an AI agent that can transform the way you manage your AWS spend.

Generative AI

Generative AI AWS Artificial Inteligence Software Review

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

The company says it can achieve PhD-level performance in challenging benchmark tests in physics, chemistry, and biology. He expects the same to happen in all areas of software development, starting with user requirements research through project management and all the way to testing and quality assurance.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Trade routes of the digital age: How data gravity shapes cloud strategy

CIO

APRIL 15, 2025

However, as companies expand their operations and adopt multi-cloud architectures, they are faced with an invisible but powerful challenge: Data gravity. Instead of fighting against data gravity, organizations should design architectures that leverage their strengths while mitigating their risks.

Strategy

Strategy Cloud Data Technical Review

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

Organizations must decide on their hosting provider, whether it be an on-prem setup, cloud solutions like AWS, GCP, Azure or specialized data platform providers such as Snowflake and Databricks. Not my original quote, but a cardinal sin of cloud-native data architecture is copying data from one location to another.

Data

Data Technical Review Software Review Weak Development Team

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications. By assigning AWS cost allocation tags, the organization can effectively monitor and track their Bedrock spend patterns.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

Capital One built Cloud Custodian initially to address the issue of dev/test systems left running with little utilization. The result was a compromised availability architecture. On average, financial services clients weve worked with on cloud migration have had cloud bills 2-3 times the original expectations.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The general architecture of the metadata pipeline consists of two primary steps: Generate transcriptions of audio tracks: use speech recognition models to generate accurate transcripts of the audio content. Irina Radu is a Prototyping Engagement Manager, part of AWS EMEA Prototyping and Cloud Engineering.

Media

Media Video Artificial Inteligence Generative AI

Optimizing AWS Step Functions: Insights from Amsterdam Summit

Xebia

APRIL 16, 2025

Yesterday I attended the AWS Summit 2025 in Amsterdam where I joined a session about AWS Step Functions hosted by Adriaan de Jonge, a former Xebia colleague. I summarized my key takeaways that can help you improve your serverless architectures.

AWS

AWS Lambda Serverless Architecture

AWS launches no-code service AppFabric with generative AI assistance

CIO

JUNE 28, 2023

Amazon Web Services (AWS) on Tuesday unveiled a new no-code offering, dubbed AppFabric, designed to simplify SaaS integration for enterprises by increasing application observability and reducing operational costs associated with building point-to-point solutions. AppFabric, which is available across AWS’ US East (N.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Some examples of AWS-sourced operational events include: AWS Health events — Notifications related to AWS service availability, operational issues, or scheduled maintenance that might affect your AWS resources.

Cloud

Cloud AWS Serverless Lambda

Best Practices for IaC using AWS CloudFormation

Perficient

MARCH 11, 2025

AWS CloudFormation, a key service in the AWS ecosystem, simplifies IaC by allowing users to easily model and set up AWS resources. This blog explores the best practices for utilizing AWS CloudFormation to achieve reliable, secure, and efficient infrastructure management. Why Use AWS CloudFormation? Example: 3.

AWS

AWS Software Review Systems Review Policies

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. The following diagram illustrates the solution architecture. AWS CDK CLI, follow instructions here.

AWS

AWS Generative AI Linux Groups

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

This advancement makes sophisticated agent architectures more accessible and economically viable across a broader range of applications and scales of deployment. We recommend referring to the Submit a model distillation job in Amazon Bedrock in the official AWS documentation for the most up-to-date and comprehensive information.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. Our study used Amazon Nova Micro and Amazon Nova Lite as baseline FMs and tested their performance across different configurations.

Case Study

Case Study Artificial Inteligence Study Generative AI

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. These recipes are processed through the HyperPod recipe launcher, which serves as the orchestration layer responsible for launching a job on the corresponding architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Cloud-native technology: The next airline frontier

CIO

MAY 6, 2025

Here are some real-life examples of innovative capabilities created by major airlines in partnership with Infosysa company with deep roots in the airline businessand Amazon Web Services (AWS). It all starts with the Infosys Cobalt Airline Cloud on AWS. Click here to learn more.

Airlines

Airlines Technology Cloud AWS

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

Model Variants The current DeepSeek model collection consists of the following models: DeepSeek-V3 An LLM that uses a Mixture-of-Experts (MoE) architecture. These models retain their existing architecture while gaining additional reasoning capabilities through a distillation process. deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning - AI

APRIL 30, 2025

Because Amazon Bedrock is serverless, you dont have to manage infrastructure to securely integrate and deploy generative AI capabilities into your application, handle spiky traffic patterns, and enable new features like cross-Region inference, which helps provide scalability and reliability across AWS Regions.

Artificial Inteligence

Artificial Inteligence Technical Review Software Review Generative AI

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. This a revolutionary new capability within Amazon Bedrock that serves as a centralized hub for discovering, testing, and implementing foundation models (FMs). Prior to joining AWS, Dr. Li held data science roles in the financial and retail industries.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

In a transformer architecture, such layers are the embedding layers and the multilayer perceptron (MLP) layers. and prior Llama models) and Mistral model architectures for context parallelism. Delving deeper into FP8’s architecture, we discover two distinct subtypes: E4M3 and E5M2. supports the Llama 3.1 (and

Training

Training Artificial Inteligence AWS Machine Learning

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The collaboration between BQA and AWS was facilitated through the Cloud Innovation Center (CIC) program, a joint initiative by AWS, Tamkeen , and leading universities in Bahrain, including Bahrain Polytechnic and University of Bahrain. The following diagram illustrates the solution architecture.

Education

Education Report Technical Review Generative AI

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

In this post, we describe the development journey of the generative AI companion for Mozart, the data, the architecture, and the evaluation of the pipeline. The following diagram illustrates the solution architecture. Feedback from each round of tests was incorporated in subsequent tests.

Generative AI

Generative AI Technical Review Insurance Policies

Build and deploy a UI for your generative AI applications with AWS and Python

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Trending Sources

Build a multi-tenant generative AI environment for your enterprise on AWS

Webinars

What is a cloud architect? A vital role for success in the cloud

Multi-LLM routing strategies for generative AI applications on AWS

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

How AWS sales uses Amazon Q Business for customer engagement

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Can serverless fix fintech’s scaling problem?

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Vibe Coding: Shaping the Future of Software

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AI in action: Stories of how enterprises are transforming and modernizing

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Harness the power of MCP servers with Amazon Bedrock Agents

12 AI predictions for 2025

Enable Amazon Bedrock cross-Region inference in multi-account environments

Trade routes of the digital age: How data gravity shapes cloud strategy

The future of data: A 5-pillar approach to modern data management

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Why GreenOps will succeed where FinOps is failing

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Optimizing AWS Step Functions: Insights from Amsterdam Summit

AWS launches no-code service AppFabric with generative AI assistance

Boost productivity by using AI in cloud operational health management

Best Practices for IaC using AWS CloudFormation

Getting started with computer use in Amazon Bedrock Agents

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Model customization, RAG, or both: A case study with Amazon Nova

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Cloud-native technology: The next airline frontier

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

How BQA streamlines education quality reporting using Amazon Bedrock

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Stay Connected