Artificial Inteligence, AWS and Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

The emergence of generative AI has ushered in a new era of possibilities, enabling the creation of human-like text, images, code, and more. Solution overview For this solution, you deploy a demo application that provides a clean and intuitive UI for interacting with a generative AI model, as illustrated in the following screenshot.

Multi-LLM routing strategies for generative AI applications on AWS

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

Trending Sources

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Webinars

Accelerate AWS Well-Architected reviews with Generative AI

Build a multi-tenant generative AI environment for your enterprise on AWS

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

United Airlines sets its flight plan for gen AI success

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Extends Generative AI Reach to Third-Party IT Platforms

EBSCOlearning scales assessment generation for their online learning content with generative AI

Empower your generative AI application with a comprehensive custom observability solution

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Generate financial industry-specific insights using generative AI and in-context fine-tuning

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Better together? Why AWS is unifying data analytics and AI services in SageMaker

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Model customization, RAG, or both: A case study with Amazon Nova

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AI in action: Stories of how enterprises are transforming and modernizing

Insights in implementing production-ready solutions with generative AI

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Integrate foundation models into your code with Amazon Bedrock

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Generative AI operating models in enterprise organizations with Amazon Bedrock

Introducing AWS MCP Servers for code assistants (Part 1)

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Camelot Secure’s AI wizard eases path to cybersecurity compliance

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Amazon wants to boost ten generative AI startups around the globe

5 ways to deploy your own large language model

How AWS sales uses Amazon Q Business for customer engagement

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Stay Connected