Machine Learning, Metrics and Reference

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. For a detailed breakdown of the features and implementation specifics, refer to the comprehensive documentation in the GitHub repository.

Generative AI

Generative AI Applications AWS Knowledge Base

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Model customization refers to adapting a pre-trained language model to better fit specific tasks, domains, or datasets. Under Input data , enter the location of the source S3 bucket (training data) and target S3 bucket (model outputs and training metrics), and optionally the location of your validation dataset.

Case Study

Case Study Artificial Inteligence Study Generative AI

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

We also provide insights on how to achieve optimal results for different dataset sizes and use cases, backed by experimental data and performance metrics. The evaluation metric is the F1 score that measures the word-to-word matching of the extracted content between the generated output and the ground truth answer.

Artificial Inteligence

Artificial Inteligence Generative AI Training Metrics

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

MLOps: Methods and Tools of DevOps for Machine Learning

Altexsoft

JULY 23, 2020

When speaking of machine learning, we typically discuss data preparation or model building. The fusion of terms “machine learning” and “operations”, MLOps is a set of methods to automate the lifecycle of machine learning algorithms in production — from initial model training to deployment to retraining against new data.

Artificial Inteligence

Artificial Inteligence Machine Learning DevOps Tools

Machine Learning for Fraud Detection in Streaming Services

Netflix Tech

NOVEMBER 11, 2022

Data analysis and machine learning techniques are great candidates to help secure large-scale streaming platforms. That’s up to the machine learning model to discover and avoid such false-positive incidents. For the one-class as well as binary anomaly detection task, such metrics are accuracy, precision, recall, f0.5,

Machine Learning

Machine Learning Artificial Inteligence Metrics Training

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

To evaluate the transcription accuracy quality, the team compared the results against ground truth subtitles on a large test set, using the following metrics: Word error rate (WER) – This metric measures the percentage of words that are incorrectly transcribed compared to the ground truth. A lower MER signifies better accuracy.

Media

Media Video Artificial Inteligence Generative AI

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

For a comprehensive overview of metadata filtering and its benefits, refer to Amazon Bedrock Knowledge Bases now supports metadata filtering to improve retrieval accuracy. To evaluate the effectiveness of a RAG system, we focus on three key metrics: Answer relevancy – Measures how well the generated answer addresses the user’s query.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

Shared components refer to the functionality and features shared by all tenants. Refer to Perform AI prompt-chaining with Amazon Bedrock for more details. Additionally, contextual grounding checks can help detect hallucinations in model responses based on a reference source and a user query.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

New Applied ML Prototypes Now Available in Cloudera Machine Learning

Cloudera

NOVEMBER 17, 2021

In recognition of the diverse workload that data scientists face, Cloudera’s library of Applied ML Prototypes (AMPs) provide Data Scientists with pre-built reference examples and end-to-end solutions, using some of the most cutting edge ML methods, for a variety of common data science projects. AutoML with TPOT.

Machine Learning

Machine Learning Artificial Inteligence Hotels Data Engineering

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

It is designed to handle the demanding computational and latency requirements of state-of-the-art transformer models, including Llama, Falcon, Mistral, Mixtral, and GPT variants for a full list of TGI supported models refer to supported models. For a complete list of runtime configurations, please refer to text-generation-launcher arguments.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

For instance, Pixtral Large is highly effective at spotting irregularities or insightful trends within training loss curves or performance metrics, enhancing the accuracy of data-driven decision-making. For more information on generating JSON using the Converse API, refer to Generating JSON with the Amazon Bedrock Converse API.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

We then guide you through getting started with Container Caching, explaining its automatic enablement for SageMaker provided DLCs and how to reference cached versions. With its growing feature set, TorchServe is a popular choice for deploying and scaling machine learning models among inference customers.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

The Importance of Assessing Interpersonal Skills in Recruitment

Hacker Earth Developers Blog

DECEMBER 4, 2024

Lack of standardized metrics Interpersonal skills are inherently difficult to measure, and many organizations lack standardized methods or benchmarks for assessing them. Example: “Imagine you’re explaining how machine learning works to a client with no technical background. How would you describe it?”

Recruiting

Recruiting Technical Review Software Review Exercises

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

To assess system reliability, engineering teams often rely on key metrics such as mean time between failures (MTBF), which measures the average operational time between hardware failures and serves as a valuable indicator of system robustness. The time taken to determine the root cause is referred to as mean time to detect (MTTD).

Training

Training Artificial Inteligence Hardware Systems Review

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Amazon Bedrock offers fine-tuning capabilities that allow you to customize these pre-trained models using proprietary call transcript data, facilitating high accuracy and relevance without the need for extensive machine learning (ML) expertise. In addition, traditional ML metrics were used for Yes/No answers.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

By monitoring utilization metrics, organizations can quantify the actual productivity gains achieved with Amazon Q Business. Tracking metrics such as time saved and number of queries resolved can provide tangible evidence of the services impact on overall workplace productivity.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Although automated metrics are fast and cost-effective, they can only evaluate the correctness of an AI response, without capturing other evaluation dimensions or providing explanations of why an answer is problematic. Human evaluation, although thorough, is time-consuming and expensive at scale.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

AWS Machine Learning - AI

AUGUST 21, 2024

Amazon SageMaker Canvas is a no-code machine learning (ML) service that empowers business analysts and domain experts to build, train, and deploy ML models without writing a single line of code. For instructions to catalog the data, refer to Populating the AWS Glue Data Catalog. You can monitor the progress of model creation.

Artificial Inteligence

Artificial Inteligence Machine Learning Government Software Review

Audio Analysis With Machine Learning: Building AI-Fueled Sound Detection App

Altexsoft

MAY 12, 2022

Today, we have AI and machine learning to extract insights, inaudible to human beings, from speech, voices, snoring, music, industrial and traffic noise, and other types of acoustic signals. At the same time, keep in mind that neither of those and other audio files can be fed directly to machine learning models.

Machine Learning

Machine Learning Artificial Inteligence Analysis Energy

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

To learn more details about these service features, refer to Generative AI foundation model training on Amazon SageMaker. This design simplifies the complexity of distributed training while maintaining the flexibility needed for diverse machine learning (ML) workloads, making it an ideal solution for enterprise AI development.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

13 power tips for Microsoft Power BI

CIO

OCTOBER 19, 2023

With Power BI, you can pull data from almost any data source and create dashboards that track the metrics you care about the most. You can also use Power BI to prepare and manage high-quality data to use across the business in other tools, from low-code apps to machine learning.

Artificial Inteligence

Artificial Inteligence Azure Machine Learning Metrics

Machine Learning In Internet Of Things (IoT) – The next big IT revolution in the making

Openxcell

MARCH 30, 2023

From human genome mapping to Big Data Analytics, Artificial Intelligence (AI),Machine Learning, Blockchain, Mobile digital Platforms (Digital Streets, towns and villages),Social Networks and Business, Virtual reality and so much more. What is Machine Learning? Machine Learning delivers on this need.

Artificial Inteligence

Artificial Inteligence Machine Learning IoT Internet

Machine Learning Pipeline: Architecture of ML Platform in Production

Altexsoft

MAY 27, 2020

Machine learning (ML) history can be traced back to the 1950s, when the first neural networks and ML algorithms appeared. Analysis of more than 16.000 papers on data science by MIT technologies shows the exponential growth of machine learning during the last 20 years pumped by big data and deep learning advancements.

Machine Learning

Machine Learning Artificial Inteligence Architecture Training

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

For automatic model evaluation jobs, you can either use built-in datasets across three predefined metrics (accuracy, robustness, toxicity) or bring your own datasets. It refers to the ability to manage, guide, and constrain AI systems to make sure they operate within desired parameters.

Applications

Applications Generative AI AWS Artificial Inteligence

Supporting content decision makers with machine learning

Netflix Tech

DECEMBER 10, 2020

The commissioning of a series or film, which we refer to as a title , is a creative decision. The increasing vastness and diversity of what our members are watching make answering these questions particularly challenging using conventional methods, which draw on a limited set of comparable titles and their respective performance metrics (e.g.,

Machine Learning

Machine Learning Artificial Inteligence Film Training

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

In this post, we demonstrate a few metrics for online LLM monitoring and their respective architecture for scale using AWS services such as Amazon CloudWatch and AWS Lambda. Overview of solution The first thing to consider is that different metrics require different computation considerations. The function invokes the modules.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Ground truth data in AI refers to data that is known to be factual, representing the expected use case outcome for the system being modeled. With deterministic evaluation processes such as the Factual Knowledge and QA Accuracy metrics of FMEval , ground truth generation and evaluation metric implementation are tightly coupled.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Review the model response and metrics provided. Amazon CloudWatch provides metrics for your imported models, helping you track usage patterns and performance. For more information, refer to the Amazon Bedrock User Guide. Consider implementing monitoring and observability. You can monitor costs with AWS Cost Explorer.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

For a detailed explanation of the concept, refer to the paper Accelerating Large Language Model Decoding with Speculative Sampling. For details, refer to Creating an AWS account. For more information, refer Configure the AWS CLI. We use JupyterLab in Amazon SageMaker Studio running on an ml.t3.medium

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

7 AI startups that stood out in YC’s Summer ’22 batch

TechCrunch

SEPTEMBER 8, 2022

But a particular category of startup stood out: those applying AI and machine learning to solve problems, especially for business-to-business clients. The platform is powered by large language models (think GPT-3) that reference several sources to find the most likely answers, according to co-founder Michael Royzen.

Artificial Inteligence

Artificial Inteligence Machine Learning Quality Assurance Software Engineering

LibLab raises $42M to launch its ‘SDK-as-a-service’ platform

TechCrunch

JUNE 8, 2022

Beyond this, LibLab monitors and updates the SDK “when the language evolves,” according to Ofek, and shows metrics that indicate how the API is being used. But it’s true that code-generating systems have become more capable in recent years with the advent of sophisticated machine learning techniques.

Open Source

Open Source Artificial Inteligence Machine Learning Software Development

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning - AI

MARCH 11, 2025

How do Amazon Nova Micro and Amazon Nova Lite perform against GPT-4o mini in these same metrics? Vector database FloTorch selected Amazon OpenSearch Service as a vector database for its high-performance metrics. How well do these models handle RAG use cases across different industry domains? Each provisioned node was r7g.4xlarge,

Artificial Inteligence

Artificial Inteligence Knowledge Base Comparison Generative AI

Introducing Configurable Metaflow

Netflix Tech

DECEMBER 19, 2024

self.config) self.next(self.end) @step def end(self): pass if __name__ == "__main__": ConfigurableFlow() There is a lot going on in the code above, a few highlights: you can refer to configs before they have been defined using config_expr. From the developers point of view, Configs behave like dictionary-like artifacts.

Training

Training Artificial Inteligence Machine Learning Metrics

How to hire a data scientist

Hacker Earth Developers Blog

JUNE 26, 2019

An ideal candidate has skills in the 3 fields: mathematics/ statistics/ machine learning/ programming and business/ domain knowledge. . Machine Learning and Programming. Apart from the programming skills, the candidate should have a good understanding of machine learning concepts like: Classification and Regression.

Data

Data How To Machine Learning Recruiting

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning - AI

MARCH 3, 2025

You can also use this model with Amazon SageMaker JumpStart , a machine learning (ML) hub that provides access to algorithms and models that can be deployed with one click for running inference. Performance metrics and benchmarks Pixtral 12B is trained to understand both natural images and documents, achieving 52.5%

Insurance

Insurance AWS eCommerce Software Review

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

It often requires managing multiple machine learning (ML) models, designing complex workflows, and integrating diverse data sources into production-ready formats. He leads machine learning initiatives and projects across business domains, leveraging multimodal AI, generative models, computer vision, and natural language processing.

Data

Data Generative AI Artificial Inteligence Compliance

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

Distillation refers to a process of training smaller, more efficient models to mimic the behavior and reasoning patterns of the larger DeepSeek-R1 model, using it as a teacher model. Solution overview You can use DeepSeeks distilled models within the AWS managed machine learning (ML) infrastructure.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

How ML System Design helps us to make better ML products

Xebia

AUGUST 9, 2023

With the industry moving towards end-to-end ML teams to enable them to implement MLOPs practices, it is paramount to look past the model and view the entire system around your machine learning model. Demand forecasting is chosen because it’s a very tangible problem and very suitable application for machine learning.

System Design

System Design Systems Review System Machine Learning

Automated Claims Processing: Using RPA and Machine Learning to Manage Insurance Claims

Altexsoft

APRIL 13, 2021

Customer satisfaction score (CSAT) and Net Promoter Score (NPS) are the most important metrics for any insurance company. But it does need more advanced approaches that mimic human perception and judgment like AI, Machine Learning, and ML-based Robotic Process Automation. Hire machine learning specialists on the team.

Machine Learning

Machine Learning Artificial Inteligence Insurance Technical Review

Customer Churn Prediction for Subscription Businesses Using Machine Learning: Main Approaches and Models

Altexsoft

MARCH 27, 2019

We’ll discuss collecting data about client relationship with a brand, characteristics of customer behavior that correlate the most with churn, and explore the logic behind selecting the best-performing machine learning models. Identifying at-risk customers with machine learning: problem-solving at a glance.

Artificial Inteligence

Artificial Inteligence Machine Learning Weak Development Team Windows

Tableau further democratizes analytics with AI-fueled features

CIO

APRIL 30, 2024

This feature provides users the ability to explore metrics with natural language. Tableau Pulse will then send insights for that metric directly to the executive’s preferred communications platform: Slack, email, mobile device, etc. Metrics Bootstrapping. Metric Goals.

Analytics

Analytics Artificial Inteligence Metrics Generative AI

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

AWS Machine Learning - AI

APRIL 1, 2024

Machine learning (ML) research has proven that large language models (LLMs) trained with significantly large datasets result in better model quality. For more detailed information, refer to Getting Started with Fully Sharded Data Parallel (FSDP). For more information, refer to Getting started with Llama.

Artificial Inteligence

Artificial Inteligence Machine Learning AWS Training

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Governance in the context of generative AI refers to the frameworks, policies, and processes that streamline the responsible development, deployment, and use of these technologies. For a comprehensive read about vector store and embeddings, you can refer to The role of vector databases in generative AI applications.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Expectations vs. reality: A real-world check on generative AI

CIO

MAY 1, 2024

Gen AI takes us from single-use models of machine learning (ML) to AI tools that promise to be a platform with uses in many areas, but you still need to validate they’re appropriate for the problems you want solved, and that your users know how to use gen AI effectively. Now nearly half of code suggestions are accepted.

Generative AI

Generative AI Artificial Inteligence Metrics Training

Empower your generative AI application with a comprehensive custom observability solution

Model customization, RAG, or both: A case study with Amazon Nova

Webinars

Trending Sources

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

Webinars

MLOps: Methods and Tools of DevOps for Machine Learning

Machine Learning for Fraud Detection in Streaming Services

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Build a multi-tenant generative AI environment for your enterprise on AWS

New Applied ML Prototypes Now Available in Cloudera Machine Learning

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Pixtral Large is now available in Amazon Bedrock

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

The Importance of Assessing Interpersonal Skills in Recruitment

Reduce ML training costs with Amazon SageMaker HyperPod

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Boost team productivity with Amazon Q Business Insights

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

Audio Analysis With Machine Learning: Building AI-Fueled Sound Detection App

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

13 power tips for Microsoft Power BI

Machine Learning In Internet Of Things (IoT) – The next big IT revolution in the making

Machine Learning Pipeline: Architecture of ML Platform in Production

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Supporting content decision makers with machine learning

Techniques and approaches for monitoring large language models on AWS

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

7 AI startups that stood out in YC’s Summer ’22 batch

LibLab raises $42M to launch its ‘SDK-as-a-service’ platform

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Introducing Configurable Metaflow

How to hire a data scientist

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

How ML System Design helps us to make better ML products

Automated Claims Processing: Using RPA and Machine Learning to Manage Insurance Claims

Customer Churn Prediction for Subscription Businesses Using Machine Learning: Main Approaches and Models

Tableau further democratizes analytics with AI-fueled features

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

Generative AI operating models in enterprise organizations with Amazon Bedrock

Expectations vs. reality: A real-world check on generative AI

Stay Connected