Generative AI, Reference and Scalability

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

In this post, we explore a generative AI solution leveraging Amazon Bedrock to streamline the WAFR process. We demonstrate how to harness the power of LLMs to build an intelligent, scalable system that analyzes architecture documents and generates insightful recommendations based on AWS Well-Architected best practices.

Multi-LLM routing strategies for generative AI applications on AWS

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Trending Sources

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Webinars

Build a multi-tenant generative AI environment for your enterprise on AWS

Empower your generative AI application with a comprehensive custom observability solution

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Generative AI operating models in enterprise organizations with Amazon Bedrock

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

CIOs contend with gen AI growing pains

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Model customization, RAG, or both: A case study with Amazon Nova

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

12 AI predictions for 2025

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Designing generative AI workloads for resilience

Pixtral Large is now available in Amazon Bedrock

The executive’s guide to generative AI for sustainability

A secure approach to generative AI with AWS

Medical content creation in the age of generative AI

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

Amazon Bedrock Guardrails announces IAM Policy-based enforcement to deliver safe AI interactions

Improving air quality with generative AI

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AI dominates Gartner’s 2025 predictions

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

Building Generative AI prompt chaining workflows with human in the loop

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

Safeguard OT Environments with the Power of Precision AI

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

How Vidmob is using generative AI to transform its creative data landscape

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Evaluation of generative AI techniques for clinical report summarization

Best practices to build generative AI applications on AWS

Stay Connected