Machine Learning, Reference and Serverless

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers. The user prompt is then routed to the LLM associated with the task category of the reference prompt that has the closest match.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

Shared components refer to the functionality and features shared by all tenants. API Gateway is serverless and hence automatically scales with traffic. The advantage of using Application Load Balancer is that it can seamlessly route the request to virtually any managed, serverless or self-hosted component and can also scale well.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. For more information on generating JSON using the Converse API, refer to Generating JSON with the Amazon Bedrock Converse API. In this post, we discuss the features of Pixtral Large and its possible use cases.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Integrating Key Vault Secrets with Azure Synapse Analytics

Apiumhub

DECEMBER 9, 2024

Give each secret a clear name, as youll use these names to reference them in Synapse. Add a Linked Service to the pipeline that references the Key Vault. When setting up a linked service for these sources, reference the names of the secrets stored in Key Vault instead of hard-coding the credentials.

Azure

Azure Analytics Storage Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Amazon Bedrock Custom Model Import enables the import and use of your customized models alongside existing FMs through a single serverless, unified API. This serverless approach eliminates the need for infrastructure management while providing enterprise-grade security and scalability.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

That’s where the new Amazon EMR Serverless application integration in Amazon SageMaker Studio can help. In this post, we demonstrate how to leverage the new EMR Serverless integration with SageMaker Studio to streamline your data processing and machine learning workflows.

Serverless

Serverless AWS Artificial Inteligence Big Data

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

If you don’t have an AWS account, refer to How do I create and activate a new Amazon Web Services account? If you don’t have an existing knowledge base, refer to Create an Amazon Bedrock knowledge base. Performance optimization The serverless architecture used in this post provides a scalable solution out of the box.

Generative AI

Generative AI Lambda Applications AWS

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Cost optimization – This solution uses serverless technologies, making it cost-effective for the observability infrastructure. However, some components may incur additional usage-based costs.

Generative AI

Generative AI Applications AWS Knowledge Base

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

AWS Machine Learning - AI

APRIL 3, 2024

In this post, we show how to build a contextual text and image search engine for product recommendations using the Amazon Titan Multimodal Embeddings model , available in Amazon Bedrock , with Amazon OpenSearch Serverless. Amazon SageMaker Studio – It is an integrated development environment (IDE) for machine learning (ML).

Serverless

Serverless Artificial Inteligence Engineering Generative AI

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

AWS Machine Learning - AI

NOVEMBER 22, 2023

In addition, customers are looking for choices to select the most performant and cost-effective machine learning (ML) model and the ability to perform necessary customization (fine-tuning) to fit their business use cases. For an in-depth understanding, refer to the LangChain documentation. An OpenSearch Serverless collection.

Artificial Inteligence

Artificial Inteligence Serverless Engineering Machine Learning

How Cato Networks uses Amazon Bedrock to transform free text search into structured GraphQL queries

AWS Machine Learning - AI

JANUARY 22, 2025

With the Amazon Bedrock serverless experience, you can get started quickly, privately customize FMs with your own data, and quickly integrate and deploy them into your applications using AWS tools without having to manage the infrastructure. With six years of experience in ML and cybersecurity, he brings a wealth of knowledge to his work.

Network

Network Artificial Inteligence Machine Learning Serverless

Video security analysis for privileged access management using generative AI and Amazon Bedrock

AWS Machine Learning - AI

JANUARY 22, 2025

These services use advanced machine learning (ML) algorithms and computer vision techniques to perform functions like object detection and tracking, activity recognition, and text and audio recognition. The following graphic is a simple example of Windows Server Console activity that could be captured in a video recording.

Generative AI

Generative AI Video Analysis Technical Review

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

Event-driven operations management Operational events refer to occurrences within your organization’s cloud environment that might impact the performance, resilience, security, or cost of your workloads. Create business intelligence (BI) dashboards for visual representation and analysis of event data.

Cloud

Cloud AWS Serverless Policies

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

Designed with a serverless, cost-optimized architecture, the platform provisions SageMaker endpoints dynamically, providing efficient resource utilization while maintaining scalability. References: What is Intelligent Document Processing (IDP)? The following diagram illustrates the solution architecture.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

AWS Machine Learning - AI

AUGUST 21, 2024

Amazon SageMaker Canvas is a no-code machine learning (ML) service that empowers business analysts and domain experts to build, train, and deploy ML models without writing a single line of code. For instructions to catalog the data, refer to Populating the AWS Glue Data Catalog. For Select a data source , choose Athena.

Artificial Inteligence

Artificial Inteligence Machine Learning Government Software Review

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The architecture is complemented by essential supporting services, including AWS Key Management Service (AWS KMS) for security and Amazon CloudWatch for monitoring, creating a resilient, serverless container environment that alleviates the need to manage underlying infrastructure while maintaining robust security and high availability.

AWS

AWS Generative AI Linux Groups

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

Using Amazon Bedrock Knowledge Base, the sample solution ingests these documents and generates embeddings, which are then stored and indexed in Amazon OpenSearch Serverless. Amazon Textract extracts the content from the uploaded documents, making it machine-readable for further processing.

Generative AI

Generative AI Technical Review Software Review Systems Review

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

We're more than happy to provide further references upon request. after our text key to reference a node in this state’s JSON input. We've had numerous positive feedback from our clients, with Example Corp and AnyCompany Networks among those who have expressed satisfaction with our services. We must also include.$

Generative AI

Generative AI AWS Technical Review Backup

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 1, 2024

We explore how to build a fully serverless, voice-based contextual chatbot tailored for individuals who need it. The aim of this post is to provide a comprehensive understanding of how to build a voice-based, contextual chatbot that uses the latest advancements in AI and serverless computing. We discuss this later in the post.

Serverless

Serverless Artificial Inteligence AWS Software Review

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

Readers will learn the key design decisions, benefits achieved, and lessons learned from Hearst’s innovative CCoE team. This solution can serve as a valuable reference for other organizations looking to scale their cloud governance and enable their CCoE teams to drive greater impact. About the Authors Steven Craig is a Sr.

Generative AI

Generative AI Government Technical Review Innovation

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

AWS Machine Learning - AI

APRIL 22, 2025

A serverless, event-driven workflow using Amazon EventBridge and AWS Lambda automates the post-event processing. The chat assistant is powered by Amazon Bedrock and retrieves information from the Amazon OpenSearch Serverless index, enabling seamless access to session insights.

Media

Media Knowledge Base AWS Systems Review

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Amazon Comprehend provides real-time APIs, such as DetectPiiEntities and DetectEntities , which use natural language processing (NLP) machine learning (ML) models to identify text portions for redaction. For information about deploying the Amazon Q Business application with sample boosting and guardrails, refer to the GitHub repo.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning - AI

MARCH 3, 2025

You can also use this model with Amazon SageMaker JumpStart , a machine learning (ML) hub that provides access to algorithms and models that can be deployed with one click for running inference. To learn more about how IAM works with Amazon Bedrock Marketplace, refer to Set up Amazon Bedrock Marketplace.

Insurance

Insurance AWS eCommerce Software Review

Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock

AWS Machine Learning - AI

FEBRUARY 9, 2024

Because Amazon Bedrock is serverless, you don’t have to manage any infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. For more details and specific model prices, refer to Amazon Bedrock Pricing.

Lambda

Lambda Generative AI AWS Microservices

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

Ground truth data in AI refers to data that is known to be factual, representing the expected use case outcome for the system being modeled. Document Section Targeting - Reference specific sections when the information location is relevant - Example: "In Section [X] of [Document Name], what are the steps for [specific process]?"

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

The solution presented in this post takes approximately 15–30 minutes to deploy and consists of the following key components: Amazon OpenSearch Service Serverless maintains three indexes : the inventory index, the compatible parts index, and the owner manuals index.

Lambda

Lambda Enterprise Automotive Knowledge Base

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

APRIL 23, 2024

Here are some features which we will cover: AWS CloudFormation support Private network policies for Amazon OpenSearch Serverless Multiple S3 buckets as data sources Service Quotas support Hybrid search, metadata filters, custom prompts for the RetreiveAndGenerate API, and maximum number of retrievals.

Knowledge Base

Knowledge Base Scalability Applications Generative AI

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Governance in the context of generative AI refers to the frameworks, policies, and processes that streamline the responsible development, deployment, and use of these technologies. For a comprehensive read about vector store and embeddings, you can refer to The role of vector databases in generative AI applications.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Building Generative AI prompt chaining workflows with human in the loop

AWS Machine Learning - AI

MAY 17, 2024

Like all AI, generative AI works by using machine learning models—very large models that are pretrained on vast amounts of data called foundation models (FMs). The second task then asks the LLM to compare the generated response to the reference response using the rules and generate an evaluation score.

Generative AI

Generative AI Artificial Inteligence Systems Review Software Review

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Amazon Bedrock offers fine-tuning capabilities that allow you to customize these pre-trained models using proprietary call transcript data, facilitating high accuracy and relevance without the need for extensive machine learning (ML) expertise. Architecture The following diagram illustrates the solution architecture. Choose Create new.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

AWS Machine Learning - AI

MAY 31, 2024

With the Amazon Bedrock serverless experience, you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using the Amazon Web Services (AWS) tools without having to manage infrastructure. Data to manage sessions is automatically purged after 24 hours.

Knowledge Base

Knowledge Base Artificial Inteligence AWS Serverless

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

Refer to Monitoring Amazon Q Business and Q Apps for more details. Several reference calculators are publicly available online, ranging from basic templates to more sophisticated models, which can serve as a starting point for organizations to build their own ROI analysis tools. These logs are then queryable using Amazon Athena.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

Build a contextual chatbot application using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

FEBRUARY 19, 2024

In this post, we illustrate contextually enhancing a chatbot by using Knowledge Bases for Amazon Bedrock , a fully managed serverless service. Knowledge Bases for Amazon Bedrock Knowledge Bases for Amazon Bedrock is a serverless option to build powerful conversational AI systems using RAG. For more information, refer to Model access.

Knowledge Base

Knowledge Base Artificial Inteligence Applications Lambda

Use RAG for drug discovery with Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

FEBRUARY 29, 2024

Knowledge Bases is completely serverless, so you don’t need to manage any infrastructure, and when using Knowledge Bases, you’re only charged for the models, vector databases and storage you use. For more information, refer to Model access. For instructions, refer to Manage your knowledge base. The S3 bucket.

Knowledge Base

Knowledge Base Artificial Inteligence Study AWS

GenAI for Aerospace: Empowering the workforce with expert knowledge on Amazon Q and Amazon Bedrock

AWS Machine Learning - AI

SEPTEMBER 26, 2024

This domain knowledge is traditionally captured in reference manuals, service bulletins, quality ticketing systems, engineering drawings, and more, but the quantity and complexity of documents is growing and takes time to learn. In RAG, these knowledge sources are often referred to as a knowledge base. Try it out!

Artificial Inteligence

Artificial Inteligence Generative AI Knowledge Base AWS

Optimize reasoning models like DeepSeek with prompt optimization on Amazon Bedrock

AWS Machine Learning - AI

MARCH 10, 2025

HLE is multi-modal, featuring questions that are either text-only or accompanied by an image reference, and includes both multiple-choice and exact-match questions for automated answer verification. GPT 4o, and OpenAI O1 (more details in this paper ). 288 3334 271 3063 80.0% Prompt Optimized DeepSeek 11 326 1925 27 1898 90.3%

Artificial Inteligence

Artificial Inteligence .Net Generative AI Budget

Create your fashion assistant application using Amazon Titan models and Amazon Bedrock Agents

AWS Machine Learning - AI

OCTOBER 4, 2024

Amazon Titan Multimodal Embeddings models can be used to search for a style on a database using both a prompt text or a reference image provided by the user to find similar styles. We use the Titan Multimodal Embeddings model to embed each product image and store them in Amazon OpenSearch Serverless for future retrieval.

Fashion

Fashion Applications Generative AI AWS

Build a self-service digital assistant using Amazon Lex and Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

JULY 1, 2024

In this post, we demonstrate how you can build chatbots with QnAIntent that connects to a knowledge base in Amazon Bedrock (powered by Amazon OpenSearch Serverless as a vector database ) and build rich, self-service, conversational experiences for your customers. For more information, refer to Create a knowledge base. Choose Next.

Knowledge Base

Knowledge Base Artificial Inteligence Generative AI AWS

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. Refer to the GitHub repository for deployment instructions.

Knowledge Base

Knowledge Base Technical Review Generative AI Lambda

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

MARCH 20, 2025

Use the following AWS CloudFormation template , and refer to Create a stack from the CloudFormation console to launch the stack in your preferred AWS Region. We dont focus on defining these services in this post, but we do use them to show use cases for the new Amazon Bedrock features within SageMaker Unified Studio.

Generative AI

Generative AI Systems Review System Lambda

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS CDK

AWS Machine Learning - AI

AUGUST 28, 2024

By using the AWS CDK, the solution sets up the necessary resources, including an AWS Identity and Access Management (IAM) role, Amazon OpenSearch Serverless collection and index, and knowledge base with its associated data source. For installation instructions, refer to the AWS CDK workshop. The AWS CDK already set up.

Knowledge Base

Knowledge Base AWS Generative AI Artificial Inteligence

Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock

AWS Machine Learning - AI

AUGUST 2, 2024

Generative AI empowers organizations to combine their data with the power of machine learning (ML) algorithms to generate human-like content, streamline processes, and unlock innovation. The following diagram illustrates this architecture. The following screenshot shows an example of the conversational interface. 2 Medium 9.25

Artificial Inteligence

Artificial Inteligence Generative AI Energy Knowledge Base

Evaluation of generative AI techniques for clinical report summarization

AWS Machine Learning - AI

MAY 13, 2024

It’s serverless, so you don’t have to manage any infrastructure. Evaluating LLMs is an undervalued part of the machine learning (ML) pipeline. These metrics will assess how well a machine-generated summary compares to one or more reference summaries. It is time-consuming but, at the same time, critical.

Generative AI

Generative AI Artificial Inteligence Report Healthcare

Improve public speaking skills using a generative AI-based virtual assistant with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 15, 2024

In the following sections, we walk you through constructing a scalable, serverless, end-to-end Public Speaking Mentor AI Assistant with Amazon Bedrock, Amazon Transcribe , and AWS Step Functions using provided sample code. Refer to Configure Amazon SNS to send messages for alerts to other destinations for more information.

Generative AI

Generative AI Virtualization Technical Advisors AWS

Multi-LLM routing strategies for generative AI applications on AWS

Build a multi-tenant generative AI environment for your enterprise on AWS

Pixtral Large is now available in Amazon Bedrock

Webinars

Integrating Key Vault Secrets with Azure Synapse Analytics

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Empower your generative AI application with a comprehensive custom observability solution

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

How Cato Networks uses Amazon Bedrock to transform free text search into structured GraphQL queries

Video security analysis for privileged access management using generative AI and Amazon Bedrock

Boost productivity by using AI in cloud operational health management

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

Getting started with computer use in Amazon Bedrock Agents

Accelerate AWS Well-Architected reviews with Generative AI

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Build a serverless voice-based contextual chatbot for people with disabilities using Amazon Bedrock

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

Generative AI operating models in enterprise organizations with Amazon Bedrock

Building Generative AI prompt chaining workflows with human in the loop

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

Boost team productivity with Amazon Q Business Insights

Build a contextual chatbot application using Knowledge Bases for Amazon Bedrock

Use RAG for drug discovery with Knowledge Bases for Amazon Bedrock

GenAI for Aerospace: Empowering the workforce with expert knowledge on Amazon Q and Amazon Bedrock

Optimize reasoning models like DeepSeek with prompt optimization on Amazon Bedrock

Create your fashion assistant application using Amazon Titan models and Amazon Bedrock Agents

Build a self-service digital assistant using Amazon Lex and Knowledge Bases for Amazon Bedrock

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS CDK

Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock

Evaluation of generative AI techniques for clinical report summarization

Improve public speaking skills using a generative AI-based virtual assistant with Amazon Bedrock

Stay Connected