Artificial Inteligence, Generative AI and Metrics

LLM benchmarking: How to find the right AI model

CIO

MARCH 11, 2025

But how do companies decide which large language model (LLM) is right for them? But beneath the glossy surface of advertising promises lurks the crucial question: Which of these technologies really delivers what it promises and which ones are more likely to cause AI projects to falter?

Artificial Inteligence

Artificial Inteligence How To Metrics Software Review

CIOs’ lack of success metrics dooms many AI projects

CIO

DECEMBER 5, 2024

Many organizations have launched dozens of AI proof-of-concept projects only to see a huge percentage fail, in part because CIOs don’t know whether the POCs are meeting key metrics, according to research firm IDC. Many organizations have launched gen AI projects without cleaning up and organizing their internal data , he adds.

Metrics

Metrics Artificial Inteligence Fractional CTO Strategic Planning

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. 70B model showed significant and consistent improvements in end-to-end (E2E) scaling times.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

As enterprises increasingly embrace generative AI , they face challenges in managing the associated costs. With demand for generative AI applications surging across projects and multiple lines of business, accurately allocating and tracking spend becomes more complex.

Generative AI

Generative AI AWS Artificial Inteligence Budget

LLMs in Production: Tooling, Process, and Team Structure

Speaker: Dr. Greg Loughnane and Chris Alexiuk

Technology professionals developing generative AI applications are finding that there are big leaps from POCs and MVPs to production-ready applications. However, during development – and even more so once deployed to production – best practices for operating and improving generative AI applications are less understood.

Tools

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Recently, we’ve been witnessing the rapid development and evolution of generative AI applications, with observability and evaluation emerging as critical aspects for developers, data scientists, and stakeholders. In the context of Amazon Bedrock , observability and evaluation become even more crucial.

Generative AI

Generative AI Applications AWS Knowledge Base

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

While organizations continue to discover the powerful applications of generative AI , adoption is often slowed down by team silos and bespoke workflows. To move faster, enterprises need robust operating models and a holistic approach that simplifies the generative AI lifecycle.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

How to Use Generative AI and LLMs to Improve Search

TechEmpower CTO

OCTOBER 9, 2023

Artificial Intelligence (AI), and particularly Large Language Models (LLMs), have significantly transformed the search engine as we’ve known it. With Generative AI and LLMs, new avenues for improving operational efficiency and user satisfaction are emerging every day.

Generative AI

Generative AI Artificial Inteligence How To Systems Review

Agentic AI design: An architectural case study

CIO

NOVEMBER 19, 2024

From obscurity to ubiquity, the rise of large language models (LLMs) is a testament to rapid technological advancement. Just a few short years ago, models like GPT-1 (2018) and GPT-2 (2019) barely registered a blip on anyone’s tech radar. In 2024, a new trend called agentic AI emerged. Do you see any issues?

Case Study

Case Study Artificial Inteligence Study Architecture

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation metrics for at-scale production guardrails.

Artificial Inteligence

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. All of this data is centralized and can be used to improve metrics in scenarios such as sales or call centers.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

Principal wanted to use existing internal FAQs, documentation, and unstructured data and build an intelligent chatbot that could provide quick access to the right information for different roles. Adherence to responsible and ethical AI practices were a priority for Principal.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

Building generative AI applications presents significant challenges for organizations: they require specialized ML expertise, complex infrastructure management, and careful orchestration of multiple services. Building a generative AI application SageMaker Unified Studio offers tools to discover and build with generative AI.

Generative AI

Generative AI Applications Technical Review Software Review

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Generative AI can revolutionize organizations by enabling the creation of innovative applications that offer enhanced customer and employee experiences. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. With the significant developments in the field of generative AI , intelligent applications powered by foundation models (FMs) can help users map out an itinerary through an intuitive natural conversation interface.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The following were some initial challenges in automation: Language diversity – The services host both Dutch and English shows. Some local shows feature Flemish dialects, which can be difficult for some large language models (LLMs) to understand. A lower WER indicates a more accurate transcription.

Media

Media Video Artificial Inteligence Generative AI

5 tips for better business value from gen AI

CIO

DECEMBER 10, 2024

Instead, CIOs must partner with CMOs and other business leaders to help quantify where gen AI can drive other strategic impacts especially those directly connected to the bottom line. CIOs should return to basics, zero in on metrics that will improve through gen AI investments, and estimate targets and timeframes.

Weak Development Team

Weak Development Team Metrics Software Review Technical Review

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

Retrieval Augmented Generation (RAG) has become a crucial technique for improving the accuracy and relevance of AI-generated responses. The effectiveness of RAG heavily depends on the quality of context provided to the large language model (LLM), which is typically retrieved from vector stores based on user queries.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Generative AI – The End of Empty Textboxes

TechEmpower CTO

NOVEMBER 13, 2023

This isn’t just our opinion - our startup metrics prove it! On a different project, we’d just used a Large Language Model (LLM) - in this case OpenAI’s GPT - to provide users with pre-filled text boxes, with content based on choices they’d previously made. Everyone struggles with empty text boxes.

Generative AI

Generative AI Artificial Inteligence Real Estate Education

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

At the forefront of using generative AI in the insurance industry, Verisks generative AI-powered solutions, like Mozart, remain rooted in ethical and responsible AI use. Security and governance Generative AI is very new technology and brings with it new challenges related to security and compliance.

Generative AI

Generative AI Technical Review Insurance Policies

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Large language models (LLMs) have revolutionized the field of natural language processing with their ability to understand and generate humanlike text. Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

Improve Amazon Nova migration performance with data-aware prompt optimization

AWS Machine Learning - AI

APRIL 29, 2025

In the era of generative AI , new large language models (LLMs) are continually emerging, each with unique capabilities, architectures, and optimizations. We also discuss the lessons learned and best practices for you to implement the solution for your real-world use cases.

Artificial Inteligence

Artificial Inteligence Performance Data Generative AI

7 ways gen AI can create more work than it saves

CIO

NOVEMBER 13, 2024

One is going through the big areas where we have operational services and look at every process to be optimized using artificial intelligence and large language models. And the second is deploying what we call LLM Suite to almost every employee. “We’re doing two things,” he says.

Weak Development Team

Weak Development Team Artificial Inteligence Technical Review Generative AI

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. Monitoring the performance and behavior of LLMs is a critical task for ensuring their safety and effectiveness.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Asure anticipated that generative AI could aid contact center leaders to understand their teams support performance, identify gaps and pain points in their products, and recognize the most effective strategies for training customer support representatives using call transcripts. Yasmine Rodriguez, CTO of Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

AWS Machine Learning - AI

APRIL 22, 2025

Over the past several months, we drove several improvements in intelligent prompt routing based on customer feedback and extensive internal testing. We encourage you to incorporate Amazon Bedrock Intelligent Prompt Routing into your new and existing generative AI applications. Lets dive in! 35% 9.98% Anthropic 0.86

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Fine-tuning is a powerful approach in natural language processing (NLP) and generative AI , allowing businesses to tailor pre-trained large language models (LLMs) for specific tasks. This process involves updating the model’s weights to improve its performance on targeted applications.

Artificial Inteligence

Artificial Inteligence Generative AI Training Metrics

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Generative AI question-answering applications are pushing the boundaries of enterprise productivity. These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques.

Generative AI

Generative AI Systems Review Artificial Inteligence Software Review

AI as a catalyst for ESG: Empowering CIOs to drive sustainable innovation

CIO

OCTOBER 10, 2024

Technologies such as artificial intelligence (AI), generative AI (genAI) and blockchain are revolutionizing operations. Aligning IT operations with ESG metrics: CIOs need to ensure that technology systems are energy-efficient and contribute to reducing the company’s carbon footprint.

Sustainability

Sustainability Innovation Blockchain Energy

How ServiceNow gets the most out of generative AI

CIO

NOVEMBER 15, 2023

Competition among software vendors to be “the” platform on which enterprises build their IT infrastructure is intensifying, with the focus of late on how much noise they can make about their implementation of generative AI features. One reason we’re releasing early is because we’re ready,” says ServiceNow CIO Chris Bedi.

Generative AI

Generative AI Artificial Inteligence Technical Advisors Weak Development Team

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology.

Generative AI

Generative AI AWS Lambda Authentication

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

Amazon Bedrock Model Distillation is generally available, and it addresses the fundamental challenge many organizations face when deploying generative AI : how to maintain high performance while reducing costs and latency. Evaluation metric We use abstract syntax tree (AST) to evaluate the function calling performance.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

Generative AI is rapidly reshaping industries worldwide, empowering businesses to deliver exceptional customer experiences, streamline processes, and push innovation at an unprecedented scale. Specifically, we discuss Data Replys red teaming solution, a comprehensive blueprint to enhance AI safety and responsible AI practices.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Organizations building and deploying AI applications, particularly those using large language models (LLMs) with Retrieval Augmented Generation (RAG) systems, face a significant challenge: how to evaluate AI outputs effectively throughout the application lifecycle.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

3 ways to avoid the generative AI ROI doom loop

CIO

NOVEMBER 12, 2024

By Bryan Kirschner, Vice President, Strategy at DataStax From the Wall Street Journal to the World Economic Forum , it seems like everyone is talking about the urgency of demonstrating ROI from generative AI (genAI). Make ‘soft metrics’ matter Imagine an experienced manager with an “open door policy.”

Generative AI

Generative AI ChatGPT Meeting Metrics

Deploy large language models for a healthtech use case on Amazon SageMaker

AWS Machine Learning - AI

FEBRUARY 6, 2024

To support overarching pharmacovigilance activities, our pharmaceutical customers want to use the power of machine learning (ML) to automate the adverse event detection from various data sources, such as social media feeds, phone calls, emails, and handwritten notes, and trigger appropriate actions. The training jobs used an ml.p3dn.24xlarge

Artificial Inteligence

Artificial Inteligence Pharmaceuticals Healthcare AWS

Can AI solve your technical debt problem?

CIO

APRIL 29, 2025

Just as generative AI tools are fundamentally changing the ways developers write code, theyre being used to refactor code as well. The file can be passed directly to the LLM with simple instructions like, Please resolve the rubocop:todos. And that has significant implications for how IT shops can approach technical debt.

Technical Review

Technical Review Weak Development Team Software Review Systems Review

Unbundling the Graph in GraphRAG

O'Reilly Media - Ideas

NOVEMBER 19, 2024

One popular term encountered in generative AI practice is retrieval-augmented generation (RAG). Reasons for using RAG are clear: large language models (LLMs), which are effectively syntax engines, tend to “hallucinate” by inventing answers from pieces of their training data.

Artificial Inteligence

Artificial Inteligence Construction Open Source Training

AWS launches no-code service AppFabric with generative AI assistance

CIO

JUNE 28, 2023

When you create an app bundle, AppFabric creates the required AWS Identity and Access Management (IAM) role in your AWS account, which is required to send metrics to Amazon CloudWatch and to access AWS resources such as Amazon Simple Storage Service (Amazon S3) and Amazon Kinesis Data Firehose,” AWS wrote in a blog post.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Open foundation models (FMs) have become a cornerstone of generative AI innovation, enabling organizations to build and customize AI applications while maintaining control over their costs and deployment strategies. Review the model response and metrics provided. You can monitor costs with AWS Cost Explorer.

Generative AI

Generative AI Artificial Inteligence AWS Serverless

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

AWS Machine Learning - AI

MARCH 18, 2025

Today, generative AI can help bridge this knowledge gap for nontechnical users to generate SQL queries by using a text-to-SQL application. This application allows users to ask questions in natural language and then generates a SQL query for the users request. The following diagram illustrates the RAG framework.

Artificial Inteligence

Artificial Inteligence Applications Generative AI Off-The-Shelf

Generative AI in enterprises: LLM orchestration holds the key to success

CIO

DECEMBER 6, 2023

Many enterprises are accelerating their artificial intelligence (AI) plans, and in particular moving quickly to stand up a full generative AI (GenAI) organization, tech stacks, projects, and governance. For readers short on time, you can skip to the section titled Strategies for effective LLM orchestration.

Artificial Inteligence

Artificial Inteligence Generative AI Enterprise Scalability

LLM benchmarking: How to find the right AI model

CIOs’ lack of success metrics dooms many AI projects

Webinars

Trending Sources

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Webinars

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

LLMs in Production: Tooling, Process, and Team Structure

Empower your generative AI application with a comprehensive custom observability solution

Build a multi-tenant generative AI environment for your enterprise on AWS

How to Use Generative AI and LLMs to Improve Search

Agentic AI design: An architectural case study

How to Achieve High-Accuracy Results When Using LLMs

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Generative AI operating models in enterprise organizations with Amazon Bedrock

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

5 tips for better business value from gen AI

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Model customization, RAG, or both: A case study with Amazon Nova

Generative AI – The End of Empty Textboxes

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Improve Amazon Nova migration performance with data-aware prompt optimization

7 ways gen AI can create more work than it saves

Techniques and approaches for monitoring large language models on AWS

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AI as a catalyst for ESG: Empowering CIOs to drive sustainable innovation

How ServiceNow gets the most out of generative AI

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

3 ways to avoid the generative AI ROI doom loop

Deploy large language models for a healthtech use case on Amazon SageMaker

Can AI solve your technical debt problem?

Unbundling the Graph in GraphRAG

AWS launches no-code service AppFabric with generative AI assistance

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

Generative AI in enterprises: LLM orchestration holds the key to success

Stay Connected