Generative AI, Performance and Scalability

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

As enterprises increasingly embrace generative AI , they face challenges in managing the associated costs. With demand for generative AI applications surging across projects and multiple lines of business, accurately allocating and tracking spend becomes more complex.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements. However, it also presents some trade-offs.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

In this post, we explore a generative AI solution leveraging Amazon Bedrock to streamline the WAFR process. We demonstrate how to harness the power of LLMs to build an intelligent, scalable system that analyzes architecture documents and generates insightful recommendations based on AWS Well-Architected best practices.

Generative AI

Generative AI Technical Review Software Review Systems Review

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Recently, we’ve been witnessing the rapid development and evolution of generative AI applications, with observability and evaluation emerging as critical aspects for developers, data scientists, and stakeholders. In the context of Amazon Bedrock , observability and evaluation become even more crucial.

Generative AI

Generative AI Applications AWS Knowledge Base

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

While organizations continue to discover the powerful applications of generative AI , adoption is often slowed down by team silos and bespoke workflows. To move faster, enterprises need robust operating models and a holistic approach that simplifies the generative AI lifecycle.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

The emergence of generative AI has ushered in a new era of possibilities, enabling the creation of human-like text, images, code, and more. Solution overview For this solution, you deploy a demo application that provides a clean and intuitive UI for interacting with a generative AI model, as illustrated in the following screenshot.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Conclusion In this post, we’ve introduced a scalable and efficient solution for automating batch inference jobs in Amazon Bedrock. Access to your selected models hosted on Amazon Bedrock.

Scalability

Scalability Lambda Generative AI AWS

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

Building generative AI applications presents significant challenges for organizations: they require specialized ML expertise, complex infrastructure management, and careful orchestration of multiple services. Building a generative AI application SageMaker Unified Studio offers tools to discover and build with generative AI.

Generative AI

Generative AI Applications Technical Review Software Review

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. In our tests, we’ve seen substantial improvements in scaling times for generative AI model endpoints across various frameworks.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Many commercial generative AI solutions available are expensive and require user-based licenses.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal implemented several measures to improve the security, governance, and performance of its conversational AI platform.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

Companies across all industries are harnessing the power of generative AI to address various use cases. Cloud providers have recognized the need to offer model inference through an API call, significantly streamlining the implementation of AI within applications.

Generative AI

Generative AI AWS Technical Review Backup

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more.

Generative AI

Generative AI Lambda Applications AWS

Unlocking the full potential of enterprise AI

CIO

JANUARY 5, 2025

Despite the huge promise surrounding AI, many organizations are finding their implementations are not delivering as hoped. 1] The limits of siloed AI implementations According to SS&C Blue Prism , an expert on AI and automation, the chief issue is that enterprises often implement AI in siloes.

Enterprise

Enterprise Generative AI Weak Development Team Technical Review

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Generative AI can revolutionize organizations by enabling the creation of innovative applications that offer enhanced customer and employee experiences. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

EBSCOlearning scales assessment generation for their online learning content with generative AI

AWS Machine Learning - AI

DECEMBER 11, 2024

In this post, we illustrate how EBSCOlearning partnered with AWS Generative AI Innovation Center (GenAIIC) to use the power of generative AI in revolutionizing their learning assessment process. Visit Generative AI Innovation Center to learn more about our program. Sonnet in Amazon Bedrock.

Generative AI

Generative AI Artificial Inteligence Guidelines Education

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Generative AI has emerged as a game changer, offering unprecedented opportunities for game designers to push boundaries and create immersive virtual worlds. At the forefront of this revolution is Stability AIs cutting-edge text-to-image AI model, Stable Diffusion 3.5 Large (SD3.5

Generative AI

Generative AI Games Development AWS

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

At the forefront of using generative AI in the insurance industry, Verisks generative AI-powered solutions, like Mozart, remain rooted in ethical and responsible AI use. Security and governance Generative AI is very new technology and brings with it new challenges related to security and compliance.

Generative AI

Generative AI Technical Review Insurance Policies

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Generative AI agents offer a powerful solution by automatically interfacing with company systems, executing tasks, and delivering instant insights, helping organizations scale operations without scaling complexity. The following diagram illustrates the generative AI agent solution workflow.

Generative AI

Generative AI Systems Review System Lambda

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

John Snow Labs

APRIL 2, 2025

John Snow Labs, the AI for healthcare company, today announced the release of Generative AI Lab 7.0. New capabilities include no-code features to streamline the process of auditing and tuning AI models. Domain experts are often best positioned to develop AI-driven solutions tailored to their specific business needs.

Artificial Inteligence

Artificial Inteligence Software Review Generative AI Technical Review

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

Generative AI is rapidly reshaping industries worldwide, empowering businesses to deliver exceptional customer experiences, streamline processes, and push innovation at an unprecedented scale. Specifically, we discuss Data Replys red teaming solution, a comprehensive blueprint to enhance AI safety and responsible AI practices.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Asure anticipated that generative AI could aid contact center leaders to understand their teams support performance, identify gaps and pain points in their products, and recognize the most effective strategies for training customer support representatives using call transcripts. Yasmine Rodriguez, CTO of Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Generative AI: the Shortcut to Digital Modernisation

CIO

DECEMBER 20, 2023

THE BOOM OF GENERATIVE AI Digital transformation is the bleeding edge of business resilience. Notably, organisations are now turning to Generative AI to navigate the rapidly evolving tech landscape. Notably, organisations are now turning to Generative AI to navigate the rapidly evolving tech landscape.

Generative AI

Generative AI Software Review Technical Review Weak Development Team

Digital transformation 2025: What’s in, what’s out

CIO

JANUARY 7, 2025

Out: Sponsoring moonshot AI innovations lacking business drivers How much patience will boards and executives have with ongoing AI experimentation and long-term investments? 2025 will be the year when generative AI needs to generate value, says Louis Landry, CTO at Teradata.

Technical Cofounder

Technical Cofounder Technical Review Weak Development Team Software Review

Bridging the IT skills gap, Part 1: Assessing current strategies and introducing GenAI as a unified solution

CIO

JANUARY 14, 2025

The gap between emerging technological capabilities and workforce skills is widening, and traditional approaches such as hiring specialized professionals or offering occasional training are no longer sufficient as they often lack the scalability and adaptability needed for long-term success.

Technical Advisors

Technical Advisors Strategy Training Survey

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

CIO

APRIL 15, 2025

During his one hour forty minute-keynote, Thomas Kurian, CEO of Google Cloud showcased updates around most of the companys offerings, including new large language models (LLMs) , a new AI accelerator chip, new open source frameworks around agents, and updates to its data analytics, databases, and productivity tools and services among others.

Cloud

Cloud Innovation Artificial Inteligence Google Cloud

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

Scalable infrastructure – Bedrock Marketplace offers configurable scalability through managed endpoints, allowing organizations to select their desired number of instances, choose appropriate instance types, define custom auto scaling policies that dynamically adjust to workload demands, and optimize costs while maintaining performance.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Open foundation models (FMs) have become a cornerstone of generative AI innovation, enabling organizations to build and customize AI applications while maintaining control over their costs and deployment strategies. 70B-Instruct ), offer different trade-offs between performance and resource requirements.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Enterprises willing to spend up to $250 million on gen AI, but ROI remains elusive

CIO

JANUARY 10, 2025

A sharp rise in enterprise investments in generative AI is poised to reshape business operations, with 68% of companies planning to invest between $50 million and $250 million over the next year, according to KPMGs latest AI Quarterly Pulse Survey. Upskilling and seamless integration into workflows will drive adoption and ROI.

Enterprise

Enterprise Generative AI Survey Metrics

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

Some challenges include data infrastructure that allows scaling and optimizing for AI; data management to inform AI workflows where data lives and how it can be used; and associated data services that help data scientists protect AI workflows and keep their models clean. Performance enhancements.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

AI skills broadly include programming languages, database modeling, data analysis and visualization, machine learning (ML), statistics, natural language processing (NLP), generative AI, and AI ethics.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Generative AI question-answering applications are pushing the boundaries of enterprise productivity. These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Accelerating generative AI requires the right storage

CIO

AUGUST 9, 2023

It’s an appropriate takeaway for another prominent and high-stakes topic, generative AI. Generative AI “fuel” and the right “fuel tank” Enterprises are in their own race, hastening to embrace generative AI ( another CIO.com article talks more about this). What does this have to do with technology?

Generative AI

Generative AI Storage Scalability Technical Review

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

AI practitioners and industry leaders discussed these trends, shared best practices, and provided real-world use cases during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI. AI is no longer just a tool, said Vishal Chhibbar, chief growth officer at EXL. Its a driver of transformation.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Data trends in 2025

Xebia

FEBRUARY 23, 2025

AI agents remain prone to error, and significant missteps could result in public and regulatory scrutiny. The first agents to emerge are expected to perform small, structured internal tasks with some degree of fault-tolerance, such as helping to change passwords on IT systems or book vacation time on HR platforms.

Trends

Trends Data Artificial Inteligence Weak Development Team

Generative AI is electrifying. Charge ahead or get shocked.

CIO

AUGUST 24, 2023

The generative AI differentiator Up until this point, we’ve seen customers create great products, applications, and experiences using predictive AI. But today, every customer and prospect I speak with is thinking about how generative AI (GenAI) can benefit their business. The bottom line?

Generative AI

Generative AI Artificial Inteligence Open Source Scalability

The executive’s guide to generative AI for sustainability

AWS Machine Learning - AI

APRIL 22, 2024

This post serves as a starting point for any executive seeking to navigate the intersection of generative artificial intelligence (generative AI) and sustainability. A roadmap to generative AI for sustainability In the sections that follow, we provide a roadmap for integrating generative AI into sustainability initiatives 1.

Generative AI

Generative AI Sustainability Artificial Inteligence AWS

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

As a result, DPG Media Producers have to run a screening process to consume and understand the content sufficiently to generate the missing metadata, such as brief summaries. For some content, additional screening is performed to generate subtitles and captions. About the Authors Lucas Desard is GenAI Engineer at DPG Media.

Media

Media Video Artificial Inteligence Generative AI

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies, such as AI21 Labs, Anthropic, Cohere, Meta, Mistral, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Designing generative AI workloads for resilience

AWS Machine Learning - AI

FEBRUARY 1, 2024

Resilience plays a pivotal role in the development of any workload, and generative AI workloads are no different. There are unique considerations when engineering generative AI workloads through a resilience lens. If you’re performing prompt engineering, you should persist your prompts to a reliable data store.

Generative AI

Generative AI Disaster Recovery Artificial Inteligence AWS

A secure approach to generative AI with AWS

AWS Machine Learning - AI

APRIL 16, 2024

Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. The biggest concern we hear from customers as they explore the advantages of generative AI is how to protect their highly sensitive data and investments.

Generative AI

Generative AI AWS Artificial Inteligence Infrastructure

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

AWS Machine Learning - AI

AUGUST 8, 2024

To optimize its AI/ML infrastructure, Cisco migrated its LLMs to Amazon SageMaker Inference , improving speed, scalability, and price-performance. By integrating generative AI, they can now analyze call transcripts to better understand customer pain points and improve agent productivity.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

Generative AI has seen faster and more widespread adoption than any other technology today, with many companies already seeing ROI and scaling up use cases into wide adoption. Vendors are adding gen AI across the board to enterprise software products, and AI developers havent been idle this year either.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Multi-LLM routing strategies for generative AI applications on AWS

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Empower your generative AI application with a comprehensive custom observability solution

Build a multi-tenant generative AI environment for your enterprise on AWS

Build and deploy a UI for your generative AI applications with AWS and Python

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Unlocking the full potential of enterprise AI

Generative AI operating models in enterprise organizations with Amazon Bedrock

EBSCOlearning scales assessment generation for their online learning content with generative AI

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Generative AI: the Shortcut to Digital Modernisation

Digital transformation 2025: What’s in, what’s out

Bridging the IT skills gap, Part 1: Assessing current strategies and introducing GenAI as a unified solution

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Enterprises willing to spend up to $250 million on gen AI, but ROI remains elusive

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

9 IT skills where expertise pays the most

Model customization, RAG, or both: A case study with Amazon Nova

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Accelerating generative AI requires the right storage

AI in action: Stories of how enterprises are transforming and modernizing

Data trends in 2025

Generative AI is electrifying. Charge ahead or get shocked.

The executive’s guide to generative AI for sustainability

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Designing generative AI workloads for resilience

A secure approach to generative AI with AWS

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

12 AI predictions for 2025

Stay Connected