Artificial Inteligence, Engineering and Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Have we reached the end of ‘too expensive’ for enterprise software?

CIO

JANUARY 9, 2025

Generative artificial intelligence ( genAI ) and in particular large language models ( LLMs ) are changing the way companies develop and deliver software. These autoregressive models can ultimately process anything that can be easily broken down into tokens: image, video, sound and even proteins.

Artificial Inteligence

Artificial Inteligence Software Review Software Enterprise

Dulling the impact of AI-fueled cyber threats with AI

CIO

OCTOBER 24, 2024

IT leaders are placing faith in AI. Consider 76 percent of IT leaders believe that generative AI (GenAI) will significantly impact their organizations, with 76 percent increasing their budgets to pursue AI. But when it comes to cybersecurity, AI has become a double-edged sword.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Generative AI Training

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. 70B model showed significant and consistent improvements in end-to-end (E2E) scaling times.

Multi-LLM routing strategies for generative AI applications on AWS

Have we reached the end of ‘too expensive’ for enterprise software?

Webinars

Trending Sources

Dulling the impact of AI-fueled cyber threats with AI

Webinars

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

LLMs in Production: Tooling, Process, and Team Structure

Build and deploy a UI for your generative AI applications with AWS and Python

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AI-native software engineering may be closer than developers think

EBSCOlearning scales assessment generation for their online learning content with generative AI

How to Achieve High-Accuracy Results When Using LLMs

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

AI in the C-suite: Using AI to shape business strategy

Accelerate AWS Well-Architected reviews with Generative AI

Generate financial industry-specific insights using generative AI and in-context fine-tuning

Generative AI Deep Dive: Advancing from Proof of Concept to Production

The key to operational AI: Modern data architecture

Build a multi-tenant generative AI environment for your enterprise on AWS

5 Things To Look For When Evaluating AI Startups

Salesforce Ventures targets new $250M fund at generative AI startups

How MCP can revolutionize the way DevOps teams use AI

How to Use Generative AI and LLMs to Improve Search

Insights in implementing production-ready solutions with generative AI

Empower your generative AI application with a comprehensive custom observability solution

Gen AI graduates to operations in higher ed

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

10 generative AI certs and certificate programs to grow your skills

Layoffs, AI demand create mismatched talent market for IT skills

Model customization, RAG, or both: A case study with Amazon Nova

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

IDC chief research officer: GenAI, from experimentation to adoption

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AI Pact: Simplifying EU AI Act compliance for enterprises

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Generative AI – The End of Empty Textboxes

Integrate foundation models into your code with Amazon Bedrock

7 ways gen AI can create more work than it saves

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

CIOs contend with gen AI growing pains

Top 7 generative AI use cases for business

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Stay Connected