AWS, Generative AI and Reference

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. In this post, we explore a generative AI solution leveraging Amazon Bedrock to streamline the WAFR process.

Multi-LLM routing strategies for generative AI applications on AWS

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Trending Sources

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Webinars

Build a multi-tenant generative AI environment for your enterprise on AWS

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Empower your generative AI application with a comprehensive custom observability solution

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Generative AI operating models in enterprise organizations with Amazon Bedrock

Generate financial industry-specific insights using generative AI and in-context fine-tuning

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

How AWS sales uses Amazon Q Business for customer engagement

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Add a generative AI experience to your website or web application with Amazon Q embedded

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Model customization, RAG, or both: A case study with Amazon Nova

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Video security analysis for privileged access management using generative AI and Amazon Bedrock

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

A secure approach to generative AI with AWS

Integrate foundation models into your code with Amazon Bedrock

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Pixtral Large is now available in Amazon Bedrock

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Your guide to generative AI and ML at AWS re:Invent 2023

Using responsible AI principles with Amazon Bedrock Batch Inference

Best practices to build generative AI applications on AWS

Host concurrent LLMs with LoRAX

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Getting started with computer use in Amazon Bedrock Agents

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

Stay Connected