Architecture, AWS and Metrics

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

The built-in elasticity in serverless computing architecture makes it particularly appealing for unpredictable workloads and amplifies developers productivity by letting developers focus on writing code and optimizing application design industry benchmarks , providing additional justification for this hypothesis. Architecture complexity.

Serverless

Serverless Architecture Microservices Scalability

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Monitoring AWS Container Environments at Scale

Advertiser: Datadog

Particularly well-suited for microservice-oriented architectures and agile workflows, containers help organizations improve developer efficiency, feature velocity, and optimization of resources. Key metrics to monitor when leveraging two container orchestration systems.

AWS

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

Generative AI

Generative AI AWS Lambda Authentication

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

The following diagram illustrates the solution architecture: The steps of the solution include: Upload data to Amazon S3 : Store the product images in Amazon Simple Storage Service (Amazon S3). The AWS Command Line Interface (AWS CLI) installed on your machine to upload the dataset to Amazon S3.

AWS

AWS Engineering Serverless eCommerce

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Security – The solution uses AWS services and adheres to AWS Cloud Security best practices so your data remains within your AWS account.

Generative AI

Generative AI Applications AWS Knowledge Base

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

The result was a compromised availability architecture. For example, the database team we worked with in an organization new to the cloud launched all the AWS RDS database servers from dev through production, incurring a $600K a month cloud bill nine months before the scheduled production launch. Standardized metrics.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The general architecture of the metadata pipeline consists of two primary steps: Generate transcriptions of audio tracks: use speech recognition models to generate accurate transcripts of the audio content. Word information lost (WIL) – This metric quantifies the amount of information lost due to transcription errors.

Media

Media Video Artificial Inteligence Generative AI

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. Review the model response and metrics provided.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications. By assigning AWS cost allocation tags, the organization can effectively monitor and track their Bedrock spend patterns.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

It prevents vendor lock-in, gives a lever for strong negotiation, enables business flexibility in strategy execution owing to complicated architecture or regional limitations in terms of security and legal compliance if and when they rise and promotes portability from an application architecture perspective.

Cloud

Cloud Strategy Architecture Policies

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

TechCrunch

MARCH 9, 2022

As organizations continue to build out their digital architecture, a new category of enterprise software has emerged to help them manage that process. “Enterprise architecture today is very much about the scaffolding in the organization,” he said. This means that you can also then run, for example, scenario analysis.

Architecture

Architecture Enterprise Network Organization

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform.

Generative AI

Generative AI Applications Technical Review Software Review

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The collaboration between BQA and AWS was facilitated through the Cloud Innovation Center (CIC) program, a joint initiative by AWS, Tamkeen , and leading universities in Bahrain, including Bahrain Polytechnic and University of Bahrain. The following diagram illustrates the solution architecture.

Education

Education Report Technical Review Generative AI

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. The following diagram illustrates the solution architecture.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. MaestroQA integrated Amazon Bedrock into their existing architecture using Amazon Elastic Container Service (Amazon ECS).

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

AWS launches no-code service AppFabric with generative AI assistance

CIO

JUNE 28, 2023

Amazon Web Services (AWS) on Tuesday unveiled a new no-code offering, dubbed AppFabric, designed to simplify SaaS integration for enterprises by increasing application observability and reducing operational costs associated with building point-to-point solutions. AppFabric, which is available across AWS’ US East (N.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Our partnership with AWS and our commitment to be early adopters of innovative technologies like Amazon Bedrock underscore our dedication to making advanced HCM technology accessible for businesses of any size. We are thrilled to partner with AWS on this groundbreaking generative AI project. John Canada, VP of Engineering at Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. All of this data is centralized and can be used to improve metrics in scenarios such as sales or call centers.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Although automated metrics are fast and cost-effective, they can only evaluate the correctness of an AI response, without capturing other evaluation dimensions or providing explanations of why an answer is problematic. Human evaluation, although thorough, is time-consuming and expensive at scale.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

Improve Amazon Nova migration performance with data-aware prompt optimization

AWS Machine Learning - AI

APRIL 29, 2025

In the era of generative AI , new large language models (LLMs) are continually emerging, each with unique capabilities, architectures, and optimizations. In this post, we present an LLM migration paradigm and architecture, including a continuous process of model evaluation, prompt generation using Amazon Bedrock, and data-aware optimization.

Artificial Inteligence

Artificial Inteligence Performance Data Generative AI

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Our proposed architecture provides a scalable and customizable solution for online LLM monitoring, enabling teams to tailor your monitoring solution to your specific use cases and requirements. Overview of solution The first thing to consider is that different metrics require different computation considerations.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. The following diagram illustrates the solution architecture. To do so, we create a knowledge base. For Job name , enter a name for the fine-tuning job.

Case Study

Case Study Artificial Inteligence Study Generative AI

Deep Vision announces its low-latency AI processor for the edge

TechCrunch

NOVEMBER 16, 2020

Hameed and Qadeer developed Deep Vision’s architecture as part of a Ph.D. “They came up with a very compelling architecture for AI that minimizes data movement within the chip,” Annavajjhala explained. In addition, its software optimizes the overall data flow inside the architecture based on the specific workload.

Weak Development Team

Weak Development Team Hardware Architecture Automotive

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

Model Variants The current DeepSeek model collection consists of the following models: DeepSeek-V3 An LLM that uses a Mixture-of-Experts (MoE) architecture. These models retain their existing architecture while gaining additional reasoning capabilities through a distillation process. deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Large organizations often have many business units with multiple lines of business (LOBs), with a central governing entity, and typically use AWS Organizations with an Amazon Web Services (AWS) multi-account strategy. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

By monitoring utilization metrics, organizations can quantify the actual productivity gains achieved with Amazon Q Business. Tracking metrics such as time saved and number of queries resolved can provide tangible evidence of the services impact on overall workplace productivity.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

This advancement makes sophisticated agent architectures more accessible and economically viable across a broader range of applications and scales of deployment. We recommend referring to the Submit a model distillation job in Amazon Bedrock in the official AWS documentation for the most up-to-date and comprehensive information.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. These recipes are processed through the HyperPod recipe launcher, which serves as the orchestration layer responsible for launching a job on the corresponding architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

To evaluate the effectiveness of a RAG system, we focus on three key metrics: Answer relevancy – Measures how well the generated answer addresses the user’s query. By implementing dynamic metadata filtering, you can significantly improve these metrics, leading to more accurate and relevant RAG responses. model in Amazon Bedrock.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Why Every Engineering Team Should Embrace AWS Graviton4

Honeycomb

JULY 9, 2024

Two years ago, we shared our experiences with adopting AWS Graviton3 and our enthusiasm for the future of AWS Graviton and Arm. Once again, we’re privileged to share our experiences as a launch customer of the Amazon EC2 R8g instances powered by AWS Graviton4, the newest generation of AWS Graviton processors.

AWS

AWS Engineering Metrics Network

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

To assess system reliability, engineering teams often rely on key metrics such as mean time between failures (MTBF), which measures the average operational time between hardware failures and serves as a valuable indicator of system robustness.

Training

Training Artificial Inteligence Hardware Systems Review

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions. Amazon Bedrock Data Automation optimizes for available AWS Regional capacity by automatically routing across regions within the same geographic area to maximize throughput at no additional cost.

Data

Data Generative AI Artificial Inteligence Compliance

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

In this post, we describe the development journey of the generative AI companion for Mozart, the data, the architecture, and the evaluation of the pipeline. The following diagram illustrates the solution architecture. Data: Policy forms Mozart is designed to author policy forms like coverage and endorsements.

Generative AI

Generative AI Technical Review Insurance Policies

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

AWS Machine Learning - AI

AUGUST 26, 2024

At AWS, we are transforming our seller and customer journeys by using generative artificial intelligence (AI) across the sales lifecycle. It will be able to answer questions, generate content, and facilitate bidirectional interactions, all while continuously using internal AWS and external data to deliver timely, personalized insights.

Generative AI

Generative AI AWS Artificial Inteligence Technical Review

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning - AI

MAY 30, 2024

Because Amazon Bedrock is serverless, you don’t have to manage infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. AWS Prototyping developed an AWS Cloud Development Kit (AWS CDK) stack for deployment following AWS best practices.

AWS

AWS Lambda Performance Artificial Inteligence

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning - AI

MARCH 3, 2025

Mistral developed a novel architecture for Pixtral 12B, optimized for both computational efficiency and performance. This architecture supports processing an arbitrary number of images of varying sizes within a large context window of 128k tokens. For more Mistral resources on AWS, check out the GitHub repo.

Insurance

Insurance AWS eCommerce Software Review

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

DeepSeek-R1 uses a Mixture of Experts (MoE) architecture and is 671 billion parameters in size. The MoE architecture allows activation of 37 billion parameters, enabling efficient inference by routing queries to the most relevant expert clusters. For details, refer to Create an AWS account.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning - AI

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! are the sessions dedicated to AWS DeepRacer ! Generative AI is at the heart of the AWS Village this year. You marked your calendars, you booked your hotel, and you even purchased the airfare. And last but not least (and always fun!)

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Building Resilient Public Networking on AWS: Part 4

Can serverless fix fintech’s scaling problem?

Webinars

Trending Sources

Build a multi-tenant generative AI environment for your enterprise on AWS

Webinars

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Monitoring AWS Container Environments at Scale

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Empower your generative AI application with a comprehensive custom observability solution

Why GreenOps will succeed where FinOps is failing

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

How BQA streamlines education quality reporting using Amazon Bedrock

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS launches no-code service AppFabric with generative AI assistance

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Improve Amazon Nova migration performance with data-aware prompt optimization

Techniques and approaches for monitoring large language models on AWS

Model customization, RAG, or both: A case study with Amazon Nova

Deep Vision announces its low-latency AI processor for the edge

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Generative AI operating models in enterprise organizations with Amazon Bedrock

Boost team productivity with Amazon Q Business Insights

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Why Every Engineering Team Should Embrace AWS Graviton4

Reduce ML training costs with Amazon SageMaker HyperPod

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Your guide to generative AI and ML at AWS re:Invent 2023

Stay Connected