Artificial Inteligence, Generative AI and Scalability

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server. We will deep dive into the MCP architecture later in this post.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Reimagine application modernisation with the power of generative AI

CIO

JANUARY 15, 2025

2] The myriad potential of GenAI enables enterprises to simplify coding and facilitate more intelligent and automated system operations. By leveraging large language models and platforms like Azure Open AI, for example, organisations can transform outdated code into modern, customised frameworks that support advanced features.

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Reimagine application modernisation with the power of generative AI

Webinars

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Build and deploy a UI for your generative AI applications with AWS and Python

EBSCOlearning scales assessment generation for their online learning content with generative AI

AI in action: How enterprises are scaling AI for real business impact

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Accelerate AWS Well-Architected reviews with Generative AI

The key to operational AI: Modern data architecture

EXL’s Insurance LLM transforms claims and underwriting

Build a multi-tenant generative AI environment for your enterprise on AWS

AI in action: Stories of how enterprises are transforming and modernizing

Build a video insights and summarization engine using generative AI with Amazon Bedrock

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

AI dominates Gartner’s 2025 predictions

Gartner projects major IT spending increases for 2025

Exploring the pros and cons of cloud-based large language models

Empower your generative AI application with a comprehensive custom observability solution

Insights in implementing production-ready solutions with generative AI

IT leaders see big business potential in small AI models

Generative AI operating models in enterprise organizations with Amazon Bedrock

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

CAIOs are stepping out from the CIO’s shadow

Model customization, RAG, or both: A case study with Amazon Nova

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Unlocking the full potential of enterprise AI

9 IT skills where expertise pays the most

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

CIOs contend with gen AI growing pains

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Stay Connected