Performance, Reference and Serverless

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements. In contrast, more complex questions might require the application to summarize a lengthy dissertation by performing deeper analysis, comparison, and evaluation of the research results.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. 70B-Instruct ), offer different trade-offs between performance and resource requirements.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

If you don’t have an AWS account, refer to How do I create and activate a new Amazon Web Services account? If you don’t have an existing knowledge base, refer to Create an Amazon Bedrock knowledge base. Performance optimization The serverless architecture used in this post provides a scalable solution out of the box.

Generative AI

Generative AI Lambda Applications AWS

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

Shared components refer to the functionality and features shared by all tenants. API Gateway is serverless and hence automatically scales with traffic. The advantage of using Application Load Balancer is that it can seamlessly route the request to virtually any managed, serverless or self-hosted component and can also scale well.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Video security analysis for privileged access management using generative AI and Amazon Bedrock

AWS Machine Learning - AI

JANUARY 22, 2025

Security and compliance regulations require that security teams audit the actions performed by systems administrators using privileged credentials. Video recordings cant be easily parsed like log files, requiring security team members to playback the recordings to review the actions performed in them.

Generative AI

Generative AI Video Analysis Technical Review

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

That’s where the new Amazon EMR Serverless application integration in Amazon SageMaker Studio can help. In this post, we demonstrate how to leverage the new EMR Serverless integration with SageMaker Studio to streamline your data processing and machine learning workflows.

Serverless

Serverless AWS Artificial Inteligence Big Data

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

Event-driven operations management Operational events refer to occurrences within your organization’s cloud environment that might impact the performance, resilience, security, or cost of your workloads. Create business intelligence (BI) dashboards for visual representation and analysis of event data.

Cloud

Cloud AWS Serverless Policies

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

Since Amazon Bedrock is serverless, you don’t have to manage any infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. Haiku model to receive answers to an array of questions because it’s a performant, fast, and cost-effective option.

Generative AI

Generative AI AWS Technical Review Backup

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

AWS Machine Learning - AI

APRIL 3, 2024

In this post, we show how to build a contextual text and image search engine for product recommendations using the Amazon Titan Multimodal Embeddings model , available in Amazon Bedrock , with Amazon OpenSearch Serverless. Store embeddings into the Amazon OpenSearch Serverless as the search engine.

Serverless

Serverless Artificial Inteligence Engineering Generative AI

High-performance computing on AWS

Xebia

AUGUST 29, 2023

How does High-Performance Computing on AWS differ from regular computing? For this HPC will bring massive parallel computing, cluster and workload managers and high-performance components to the table. Each job references a job definition. Today’s server hardware is powerful enough to execute most compute tasks.

AWS

AWS Performance Storage Linux

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

The solution presented in this post takes approximately 15–30 minutes to deploy and consists of the following key components: Amazon OpenSearch Service Serverless maintains three indexes : the inventory index, the compatible parts index, and the owner manuals index. On the Agents page, you’ll notice a new agent called car-parts-agent.

Lambda

Lambda Enterprise Automotive Knowledge Base

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. This is particularly beneficial for tasks like automatically processing receipts or invoices, where it can perform calculations and context-aware evaluations, streamlining processes such as expense tracking or financial analysis.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Integrating Key Vault Secrets with Azure Synapse Analytics

Apiumhub

DECEMBER 9, 2024

Give each secret a clear name, as youll use these names to reference them in Synapse. Add a Linked Service to the pipeline that references the Key Vault. When setting up a linked service for these sources, reference the names of the secrets stored in Key Vault instead of hard-coding the credentials.

Azure

Azure Analytics Storage Artificial Inteligence

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

AWS Machine Learning - AI

NOVEMBER 22, 2023

In addition, customers are looking for choices to select the most performant and cost-effective machine learning (ML) model and the ability to perform necessary customization (fine-tuning) to fit their business use cases. For an in-depth understanding, refer to the LangChain documentation. An OpenSearch Serverless collection.

Artificial Inteligence

Artificial Inteligence Serverless Engineering Machine Learning

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

Using Amazon Bedrock Knowledge Base, the sample solution ingests these documents and generates embeddings, which are then stored and indexed in Amazon OpenSearch Serverless. The assessment is also stored in an Amazon DynamoDB table for quick retrieval and future reference. These documents form the foundation of the RAG architecture.

Generative AI

Generative AI Technical Review Software Review Systems Review

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Cost optimization – This solution uses serverless technologies, making it cost-effective for the observability infrastructure. However, some components may incur additional usage-based costs.

Generative AI

Generative AI Applications AWS Knowledge Base

DeltaStream secures cash to build real-time streaming databases

TechCrunch

JULY 13, 2022

DeltaStream provides a serverless streaming database to manage, secure and process data streams. “Serverless” refers to the way DeltaStream abstracts away infrastructure, allowing developers to interact with databases without having to think about servers. .”

Serverless

Serverless Systems Review Storage Technical Review

Securing Serverless Applications with Prisma Cloud

Palo Alto Networks

MARCH 4, 2020

The term serverless typically describes an application operating model where infrastructure is completely abstracted away. Since the release of Lambda by Amazon Web Services (AWS), the term serverless has evolved from referring to function-as-a-service (FaaS) offerings. Why Is Serverless Security Different?

Serverless

Serverless Applications Cloud Lambda

A serverless glossary

Stackery

MAY 22, 2019

With Serverless, it’s not the technology that’s hard, it’s understanding the language of a new culture and operational model. Serverless architecture has coined some new terms and, more confusingly, re-used a few older terms with new meanings. This glossary will clarify some of them. We call it Cloudlocal, try it for yourself.

Serverless

Serverless Lambda AWS Resources

Capsule gets $1.5M to build ‘super simple’ decentralized social media

TechCrunch

MARCH 9, 2021

“We think Capsule’s value will lie in its exceptional user experience, quality, performance, ease of use and high quality engineering that draws on advanced technologies such as TIC and IPFS without saddling bloat,” he says. Kobeissi’s original concept for Capsule, meanwhile, was to create self-hosting microservices.

Media

Media Social Blockchain USP

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning - AI

MAY 30, 2024

Because Amazon Bedrock is serverless, you don’t have to manage infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. CBRE, in parallel, completed UAT testing to confirm it performed as expected.

AWS

AWS Lambda Performance Artificial Inteligence

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 1, 2024

We explore how to build a fully serverless, voice-based contextual chatbot tailored for individuals who need it. The aim of this post is to provide a comprehensive understanding of how to build a voice-based, contextual chatbot that uses the latest advancements in AI and serverless computing. We discuss this later in the post.

Serverless

Serverless Artificial Inteligence AWS Software Review

Top 10 Serverless Deployment Errors (and How to Fix Them)

Stackery

FEBRUARY 5, 2020

However, in the past few years we have witnessed some recurring deployment errors while helping customers on their serverless journeys, so I thought I’d share them and their solutions in hopes of making them a little less common?—or brokenApi : Type : AWS::Serverless::Api. workingApi : Type : AWS::Serverless::Api.

Serverless

Serverless AWS How To Resources

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

Designed with a serverless, cost-optimized architecture, the platform provisions SageMaker endpoints dynamically, providing efficient resource utilization while maintaining scalability. Cost and Performance The solution achieves remarkable throughput by processing 100,000 documents within a 12-hour window.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Xebia

MAY 21, 2024

Modernizing on AWS refers to migrating and transforming traditional applications, workloads, and infrastructure to leverage the benefits of cloud computing and AWS services. Adoption of Cloud-Native Technologies: Companies embrace cloud-native technologies such as containers, serverless computing, and microservices architecture.

AWS

AWS Strategy Serverless Microservices

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. Monitoring – Monitors system performance and user activity to maintain operational reliability and efficiency.

Knowledge Base

Knowledge Base Technical Review Generative AI Lambda

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

This capability enables Anthropics Claude models to identify whats on a screen, understand the context of UI elements, and recognize actions that should be performed such as clicking buttons, typing text, scrolling, and navigating between applications. The following diagram illustrates the solution architecture.

AWS

AWS Generative AI Linux Groups

The Anatomy of a Secure Serverless Platform, pt. I — Design

Stackery

APRIL 1, 2020

While a serverless focus might be justified by improving the overall speed and efficiency of your development workflow, security needs to remain a core element at every step. But serverless design also involves a shift in thinking and the daunting challenge of leveraging the massive suite of AWS tools and services.

Serverless

Serverless AWS Architecture Infrastructure

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning - AI

MARCH 3, 2025

Overview of Pixtral 12B Pixtral 12B, Mistrals inaugural VLM, delivers robust performance across a range of benchmarks, surpassing other open models and rivaling larger counterparts, according to Mistrals evaluation. Mistral developed a novel architecture for Pixtral 12B, optimized for both computational efficiency and performance.

Insurance

Insurance AWS eCommerce Software Review

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Ground truth data in AI refers to data that is known to be factual, representing the expected use case outcome for the system being modeled. These benchmarks are essential for tracking performance drift over time and for statistically comparing multiple assistants in accomplishing the same task.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Governance in the context of generative AI refers to the frameworks, policies, and processes that streamline the responsible development, deployment, and use of these technologies. For a comprehensive read about vector store and embeddings, you can refer to The role of vector databases in generative AI applications.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

APRIL 23, 2024

Here are some features which we will cover: AWS CloudFormation support Private network policies for Amazon OpenSearch Serverless Multiple S3 buckets as data sources Service Quotas support Hybrid search, metadata filters, custom prompts for the RetreiveAndGenerate API, and maximum number of retrievals.

Knowledge Base

Knowledge Base Scalability Applications Generative AI

How Cato Networks uses Amazon Bedrock to transform free text search into structured GraphQL queries

AWS Machine Learning - AI

JANUARY 22, 2025

To address this challenge, we recently enabled customers to perform free text searches on the event management page, allowing new users to run queries with minimal product knowledge. This was accomplished by using foundation models (FMs) to transform natural language into structured queries that are compatible with our products GraphQL API.

Network

Network Artificial Inteligence Machine Learning Serverless

Improve public speaking skills using a generative AI-based virtual assistant with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 15, 2024

In the following sections, we walk you through constructing a scalable, serverless, end-to-end Public Speaking Mentor AI Assistant with Amazon Bedrock, Amazon Transcribe , and AWS Step Functions using provided sample code. Prompt chaining is performed with Amazon Bedrock for these prompts. See AWS CDK bootstrapping for more details.

Generative AI

Generative AI Virtualization Technical Advisors AWS

Why Serverless Won’t Replace Traditional Servers

ParkMyCloud

MAY 7, 2019

Curious why serverless is so popular – and why it won’t replace traditional servers in the cloud? Today we’ll take a look at what serverless computing is good for, and what it can’t replace. Today we’ll take a look at what serverless computing is good for, and what it can’t replace. Understanding Serverless.

Serverless

Serverless Lambda Google Cloud Azure

re:Invent Serverless Talks — Serverless SaaS Deep Dive

Stackery

DECEMBER 6, 2019

But after two days of discussing serverless development and AWS tooling with the many awesome folks who have visited the Stackery booth (plus the primer I attended on day one) I was actually feeling pretty limber for the marathon that was “Serverless SaaS Deep Dive: Building Serverless on AWS”. Serverless for SaaS.

Serverless

Serverless Lambda Microservices AWS

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

Overview of key metrics Amazon Q Business Insights (see the following screenshot) offers a comprehensive set of metrics that provide valuable insights into user engagement and system performance. Refer to Monitoring Amazon Q Business and Q Apps for more details. These logs are then queryable using Amazon Athena.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

Running Serverless in Production: 7 Best Practices for DevOps

DevOps.com

FEBRUARY 8, 2023

Serverless in production refers to the deployment and use of serverless architecture in a live, production environment. In this context, serverless refers to a cloud computing paradigm where the cloud provider manages the infrastructure and allocates resources as needed to run and scale applications and services.

Serverless

Serverless DevOps Architecture Infrastructure

The blockchain beyond bitcoin

O'Reilly Media - Data

JUNE 12, 2018

Some even refer to these uses of a blockchain as enterprise resource planning (ERP) 2.0. Fundamentally, a smart contract can be created with nothing more than a microservice with a trigger event, otherwise known as function-as-a-service (FaaS) or a serverless model. A blockchain provides an immutable store of facts.

Blockchain

Blockchain Disaster Recovery Fashion Enterprise

Creating Your Own Serverless Cloud with Fn Project

Gorilla Logic

SEPTEMBER 5, 2019

In this Fn Project tutorial, you will learn the basic features of Fn Project by creating a serverless cloud and installing it on your own infrastructure. This will illustrate some of the most useful concepts of Fn Project and help you get familiarized with this lightweight and simple serverless platform. . What is Serverless? .

Serverless

Serverless Cloud Software Review Azure

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

The Lambda function performs the actions by calling the JIRA API or database with the required parameters provided from the agent. Use the following AWS CloudFormation template , and refer to Create a stack from the CloudFormation console to launch the stack in your preferred AWS Region. List recent customer interactions.

Generative AI

Generative AI Systems Review System Lambda

Evaluation of generative AI techniques for clinical report summarization

AWS Machine Learning - AI

MAY 13, 2024

This is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading artificial intelligence (AI) companies like AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon through a single API. It’s serverless, so you don’t have to manage any infrastructure.

Generative AI

Generative AI Artificial Inteligence Report Healthcare

Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock

AWS Machine Learning - AI

FEBRUARY 9, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon via a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Lambda

Lambda Generative AI AWS Microservices

App Discovery for VMware Migration: Unlocking Cloud Migration Success with Cloudsphere

CloudSphere

APRIL 16, 2025

Without visibility into how applications function, their dependencies, and resource requirements, organizations risk costly delays, performance issues, and potential business disruptions. All Enterprise Strategy Group research references in this Showcase are from this report unless otherwise noted.

Cloud

Cloud Architecture Serverless Applications

Multi-LLM routing strategies for generative AI applications on AWS

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Webinars

Trending Sources

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Webinars

Build a multi-tenant generative AI environment for your enterprise on AWS

Video security analysis for privileged access management using generative AI and Amazon Bedrock

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Boost productivity by using AI in cloud operational health management

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

High-performance computing on AWS

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Pixtral Large is now available in Amazon Bedrock

Integrating Key Vault Secrets with Azure Synapse Analytics

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

Accelerate AWS Well-Architected reviews with Generative AI

Empower your generative AI application with a comprehensive custom observability solution

DeltaStream secures cash to build real-time streaming databases

Securing Serverless Applications with Prisma Cloud

A serverless glossary

Capsule gets $1.5M to build ‘super simple’ decentralized social media

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

Top 10 Serverless Deployment Errors (and How to Fix Them)

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Getting started with computer use in Amazon Bedrock Agents

The Anatomy of a Secure Serverless Platform, pt. I — Design

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Generative AI operating models in enterprise organizations with Amazon Bedrock

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

How Cato Networks uses Amazon Bedrock to transform free text search into structured GraphQL queries

Improve public speaking skills using a generative AI-based virtual assistant with Amazon Bedrock

Why Serverless Won’t Replace Traditional Servers

re:Invent Serverless Talks — Serverless SaaS Deep Dive

Boost team productivity with Amazon Q Business Insights

Running Serverless in Production: 7 Best Practices for DevOps

The blockchain beyond bitcoin

Creating Your Own Serverless Cloud with Fn Project

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Evaluation of generative AI techniques for clinical report summarization

Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock

App Discovery for VMware Migration: Unlocking Cloud Migration Success with Cloudsphere

Stay Connected