AWS, Metrics and Reference

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. Shared components refer to the functionality and features shared by all tenants. You can use AWS services such as Application Load Balancer to implement this approach. API Gateway also provides a WebSocket API.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. For more information on how to view and increase your quotas, refer to Amazon EC2 service quotas.

AWS

AWS Load Balancer Software Review Artificial Inteligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Security – The solution uses AWS services and adheres to AWS Cloud Security best practices so your data remains within your AWS account.

Generative AI

Generative AI Applications AWS Knowledge Base

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Model customization refers to adapting a pre-trained language model to better fit specific tasks, domains, or datasets. Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. To do so, we create a knowledge base.

Case Study

Case Study Artificial Inteligence Study Generative AI

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

To evaluate the transcription accuracy quality, the team compared the results against ground truth subtitles on a large test set, using the following metrics: Word error rate (WER) – This metric measures the percentage of words that are incorrectly transcribed compared to the ground truth. A lower MER signifies better accuracy.

Media

Media Video Artificial Inteligence Generative AI

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Response latency refers to the time between the user finishing their speech and beginning to hear the AI assistants response. AWS Local Zones are a type of edge infrastructure deployment that places select AWS services close to large population and industry centers. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

By monitoring utilization metrics, organizations can quantify the actual productivity gains achieved with Amazon Q Business. Tracking metrics such as time saved and number of queries resolved can provide tangible evidence of the services impact on overall workplace productivity.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. For human-in-the-loop evaluation, which can be done by either AWS managed or customer managed teams, you must bring your own dataset.

Applications

Applications Generative AI AWS Artificial Inteligence

‘AWS for blockchain’ Alchemy boosts valuation to $3.5B with $250M raise

TechCrunch

OCTOBER 28, 2021

For the unacquainted, Web3 refers to a set of protocols led by blockchain, that intends to reinvent how the Internet is wired in the backend). Put simply, Alchemy wants to do for blockchain and Web3 what AWS (Amazon Web Services) did for the internet. ?? Alchemy raises $80M at a $505M valuation to be the ‘AWS for blockchain’.

Blockchain

Blockchain AWS Internet Comparison

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Although automated metrics are fast and cost-effective, they can only evaluate the correctness of an AI response, without capturing other evaluation dimensions or providing explanations of why an answer is problematic. Human evaluation, although thorough, is time-consuming and expensive at scale.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

With this launch, you can now access Mistrals frontier-class multimodal model to build, experiment, and responsibly scale your generative AI ideas on AWS. AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. Additionally, Pixtral Large supports the Converse API and tool usage.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Our partnership with AWS and our commitment to be early adopters of innovative technologies like Amazon Bedrock underscore our dedication to making advanced HCM technology accessible for businesses of any size. We are thrilled to partner with AWS on this groundbreaking generative AI project. John Canada, VP of Engineering at Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

For a comprehensive overview of metadata filtering and its benefits, refer to Amazon Bedrock Knowledge Bases now supports metadata filtering to improve retrieval accuracy. To evaluate the effectiveness of a RAG system, we focus on three key metrics: Answer relevancy – Measures how well the generated answer addresses the user’s query.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

By using AWS services, our architecture provides real-time visibility into LLM behavior and enables teams to quickly identify and address any issues or anomalies. In this post, we demonstrate a few metrics for online LLM monitoring and their respective architecture for scale using AWS services such as Amazon CloudWatch and AWS Lambda.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

It is designed to handle the demanding computational and latency requirements of state-of-the-art transformer models, including Llama, Falcon, Mistral, Mixtral, and GPT variants for a full list of TGI supported models refer to supported models. For a complete list of runtime configurations, please refer to text-generation-launcher arguments.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. Review the model response and metrics provided.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning - AI

MARCH 11, 2025

OpenAI launched GPT-4o in May 2024, and Amazon introduced Amazon Nova models at AWS re:Invent in December 2024. How do Amazon Nova Micro and Amazon Nova Lite perform against GPT-4o mini in these same metrics? Vector database FloTorch selected Amazon OpenSearch Service as a vector database for its high-performance metrics.

Artificial Inteligence

Artificial Inteligence Knowledge Base Comparison Generative AI

Elevate customer experience by using the Amazon Q Business custom plugin for New Relic AI

AWS Machine Learning - AI

DECEMBER 3, 2024

It examines service performance metrics, forecasts of key indicators like error rates, error patterns and anomalies, security alerts, and overall system status and health. To learn more about improving your operational efficiency with AI-powered observability, refer to the Amazon Q Business User Guide and explore New Relic AI capabilities.

Technical Review

Technical Review AWS eCommerce Systems Review

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Large organizations often have many business units with multiple lines of business (LOBs), with a central governing entity, and typically use AWS Organizations with an Amazon Web Services (AWS) multi-account strategy. LOBs have autonomy over their AI workflows, models, and data within their respective AWS accounts.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

We also provide insights on how to achieve optimal results for different dataset sizes and use cases, backed by experimental data and performance metrics. The evaluation metric is the F1 score that measures the word-to-word matching of the extracted content between the generated output and the ground truth answer.

Artificial Inteligence

Artificial Inteligence Generative AI Training Metrics

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

To assess system reliability, engineering teams often rely on key metrics such as mean time between failures (MTBF), which measures the average operational time between hardware failures and serves as a valuable indicator of system robustness. The time taken to determine the root cause is referred to as mean time to detect (MTTD).

Training

Training Artificial Inteligence Hardware Systems Review

Spend Smarter, Not More: A Guide to AWS Storage Cost Optimization

Xebia

JANUARY 8, 2024

The cloud, particularly Amazon Web Services (AWS), has made storing vast amounts of data more uncomplicated than ever before. S3 Storage Undoubtedly, anyone who uses AWS will inevitably encounter S3, one of the platform’s most popular storage services. The following table gives you an overview of AWS storage costs.

Storage

Storage AWS Backup Policies

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

These recipes include a training stack validated by Amazon Web Services (AWS) , which removes the tedious work of experimenting with different model configurations, minimizing the time it takes for iterative evaluation and testing. You can use JupyterLab in your local setup, too.) 24xlarge" image_uri = ( f"658645717510.dkr.ecr.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Tracking your security posture in AWS

Xebia

MARCH 1, 2024

How do you track your security posture in AWS? AWS Security Hub AWS Security Hub is the service for your cloud security posture management. Security Hub will only show you the compliance scores of standards that AWS provides. This requires an AWS Account per environment and you need a naming schema.

AWS

AWS Compliance Metrics Resources

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning - AI

MAY 30, 2024

Because Amazon Bedrock is serverless, you don’t have to manage infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. AWS Prototyping developed an AWS Cloud Development Kit (AWS CDK) stack for deployment following AWS best practices.

AWS

AWS Lambda Performance Artificial Inteligence

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning - AI

MARCH 3, 2025

Performance metrics and benchmarks Pixtral 12B is trained to understand both natural images and documents, achieving 52.5% You can review the Mistral published benchmarks Prerequisites To try out Pixtral 12B in Amazon Bedrock Marketplace, you will need the following prerequisites: An AWS account that will contain all your AWS resources.

Insurance

Insurance AWS eCommerce Software Review

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Ground truth data in AI refers to data that is known to be factual, representing the expected use case outcome for the system being modeled. With deterministic evaluation processes such as the Factual Knowledge and QA Accuracy metrics of FMEval , ground truth generation and evaluation metric implementation are tightly coupled.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

AWS Lambda Benchmarking

Xebia

FEBRUARY 19, 2024

In this blog post, we examine the relative costs of different language runtimes on AWS Lambda. Many languages can be used with AWS Lambda today, so we focus on four interesting ones. Rust just came to AWS Lambda in November 2023 , so probably a lot of folks are wondering whether to try it out.

Lambda

Lambda AWS Software Review Systems Review

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. To run this benchmark, we use sub-minute metrics to detect the need for scaling. The following table summarizes our setup.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

How Much Should I Be Spending On Observability?

Honeycomb

APRIL 23, 2025

Some observability platforms are approaching AWS levels of pricing complexity these days. Get your free copy of Charity’s Cost Crisis in Metrics Tooling whitepaper. In the past, I have referred to these models as observability 1.0 But companies built using the multiple pillars model have bristled at being referred to as 1.0

Weak Development Team

Weak Development Team Metrics Storage Engineering

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

AWS Machine Learning - AI

AUGUST 26, 2024

At AWS, we are transforming our seller and customer journeys by using generative artificial intelligence (AI) across the sales lifecycle. It will be able to answer questions, generate content, and facilitate bidirectional interactions, all while continuously using internal AWS and external data to deliver timely, personalized insights.

Generative AI

Generative AI AWS Artificial Inteligence Technical Review

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions. Amazon Bedrock Data Automation optimizes for available AWS Regional capacity by automatically routing across regions within the same geographic area to maximize throughput at no additional cost.

Data

Data Generative AI Artificial Inteligence Compliance

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

Distillation refers to a process of training smaller, more efficient models to mimic the behavior and reasoning patterns of the larger DeepSeek-R1 model, using it as a teacher model. Solution overview You can use DeepSeeks distilled models within the AWS managed machine learning (ML) infrastructure.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

What Are AWS Resource Control Policies (RCPs)?

Tamnoon

MARCH 12, 2025

What Are AWS Resource Control Policies (RCPs)? The Complete Guide Resource Control Policies (RCPs) are organization-wide guardrails designed to enforce security and governance across AWS resources. These deny-only policies establish permission boundaries for specific resource types within AWS organizations.

Policies

Policies AWS Resources Examples

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

We recommend referring to the Submit a model distillation job in Amazon Bedrock in the official AWS documentation for the most up-to-date and comprehensive information. For the most current list of supported models, refer to the Amazon Bedrock documentation. Prior to joining AWS, he obtained his Ph.D.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

All the major cloud providers from North America AWS, Google, Microsoft Azure, Oracle Cloud are on par with each other, with most of their services and capabilities are primed to address the needs of any enterprise. The AWS Cloud Adoption Framework (CAF) is an effective tool that helps to evaluate cloud readiness.

Cloud

Cloud Strategy Architecture Policies

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

For a detailed explanation of the concept, refer to the paper Accelerating Large Language Model Decoding with Speculative Sampling. For details, refer to Creating an AWS account. Be sure to set up your AWS Command Line Interface (AWS CLI) credentials correctly. For more information, refer Configure the AWS CLI.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning - AI

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! are the sessions dedicated to AWS DeepRacer ! Generative AI is at the heart of the AWS Village this year. You marked your calendars, you booked your hotel, and you even purchased the airfare. And last but not least (and always fun!)

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock

AWS Machine Learning - AI

MAY 1, 2025

Prerequisites To use this feature, make sure that you have satisfied the following requirements: An active AWS account. model customization is available in the US West (Oregon) AWS Region. Refer to Supported models and Regions for fine-tuning and continued pre-training for updates on Regional availability and quotas.

Generative AI

Generative AI AWS Artificial Inteligence Training

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 2, 2024

In this post, we share AWS guidance that we have learned and developed as part of real-world projects into practical guides oriented towards the AWS Well-Architected Framework , which is used to build production infrastructure and applications on AWS. We focus on the operational excellence pillar in this post.

Generative AI

Generative AI Artificial Inteligence AWS Government

Ground truth curation and metric interpretation best practices for evaluating generative AI question answering using FMEval

AWS Machine Learning - AI

SEPTEMBER 6, 2024

This post focuses on evaluating and interpreting metrics using FMEval for question answering in a generative AI application. FMEval is a comprehensive evaluation suite from Amazon SageMaker Clarify , providing standardized implementations of metrics to assess quality and responsibility. Question Answer Fact Who is Andrew R.

Generative AI

Generative AI Metrics Artificial Inteligence Systems Review

How Vidmob is using generative AI to transform its creative data landscape

AWS Machine Learning - AI

SEPTEMBER 6, 2024

In this post, we illustrate how Vidmob , a creative data company, worked with the AWS Generative AI Innovation Center (GenAIIC) team to uncover meaningful insights at scale within creative data using Amazon Bedrock. The chatbot built by AWS GenAIIC would take in this tag data and retrieve insights.

Generative AI

Generative AI Artificial Inteligence Data AWS

Building Resilient Public Networking on AWS: Part 4

Build a multi-tenant generative AI environment for your enterprise on AWS

Webinars

Trending Sources

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Webinars

Empower your generative AI application with a comprehensive custom observability solution

Model customization, RAG, or both: A case study with Amazon Nova

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Boost team productivity with Amazon Q Business Insights

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

‘AWS for blockchain’ Alchemy boosts valuation to $3.5B with $250M raise

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Pixtral Large is now available in Amazon Bedrock

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Techniques and approaches for monitoring large language models on AWS

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Elevate customer experience by using the Amazon Q Business custom plugin for New Relic AI

Generative AI operating models in enterprise organizations with Amazon Bedrock

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

Reduce ML training costs with Amazon SageMaker HyperPod

Spend Smarter, Not More: A Guide to AWS Storage Cost Optimization

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Tracking your security posture in AWS

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Lambda Benchmarking

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

How Much Should I Be Spending On Observability?

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

What Are AWS Resource Control Policies (RCPs)?

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Your guide to generative AI and ML at AWS re:Invent 2023

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

Ground truth curation and metric interpretation best practices for evaluating generative AI question answering using FMEval

How Vidmob is using generative AI to transform its creative data landscape

Stay Connected