AWS, Generative AI and Reference

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs).

Multi-LLM routing strategies for generative AI applications on AWS

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Webinars

Trending Sources

Build a multi-tenant generative AI environment for your enterprise on AWS

Webinars

Accelerate AWS Well-Architected reviews with Generative AI

Empower your generative AI application with a comprehensive custom observability solution

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Generative AI operating models in enterprise organizations with Amazon Bedrock

Generate financial industry-specific insights using generative AI and in-context fine-tuning

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

How AWS sales uses Amazon Q Business for customer engagement

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Video security analysis for privileged access management using generative AI and Amazon Bedrock

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Add a generative AI experience to your website or web application with Amazon Q embedded

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Model customization, RAG, or both: A case study with Amazon Nova

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

A secure approach to generative AI with AWS

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Integrate foundation models into your code with Amazon Bedrock

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Your guide to generative AI and ML at AWS re:Invent 2023

Dynamic video content moderation and policy evaluation using AWS generative AI services

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Using responsible AI principles with Amazon Bedrock Batch Inference

Enable Amazon Bedrock cross-Region inference in multi-account environments

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Pixtral Large is now available in Amazon Bedrock

Getting started with computer use in Amazon Bedrock Agents

Expectations vs. reality: A real-world check on generative AI

The future of data: A 5-pillar approach to modern data management

Best practices to build generative AI applications on AWS

The executive’s guide to generative AI for sustainability

Stay Connected