Artificial Inteligence, Lambda and Scalability

Artificial Inteligence

Lambda

Scalability

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server. We will deep dive into the MCP architecture later in this post.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Join 49,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Martin Fowler

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. We walk you through our solution, detailing the core logic of the Lambda functions. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. Monitoring the performance and behavior of LLMs is a critical task for ensuring their safety and effectiveness.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services. An agent uses the power of an LLM to determine which function to execute, and output the result based on the prompt guide.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

National Laboratory has implemented an AI-driven document processing platform that integrates named entity recognition (NER) and large language models (LLMs) on Amazon SageMaker AI. In this post, we discuss how you can build an AI-powered document processing platform with open source NER and LLMs on SageMaker.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

The solution integrates large language models (LLMs) with your organization’s data and provides an intelligent chat assistant that understands conversation context and provides relevant, interactive responses directly within the Google Chat interface. Which LLM you want to use in Amazon Bedrock for text generation.

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Webinars

Techniques and approaches for monitoring large language models on AWS

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Use zero-shot large language models on Amazon Bedrock for custom named entity recognition

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Creating asynchronous AI agents with Amazon Bedrock

Build a multi-tenant generative AI environment for your enterprise on AWS

Accelerate AWS Well-Architected reviews with Generative AI

WordFinder app: Harnessing generative AI on AWS for aphasia communication

How BQA streamlines education quality reporting using Amazon Bedrock

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Build a gen AI–powered financial assistant with Amazon Bedrock multi-agent collaboration

Introducing AWS MCP Servers for code assistants (Part 1)

Medical content creation in the age of generative AI

Predictive analytics helps Fresenius anticipate dialysis complications

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

Empower your generative AI application with a comprehensive custom observability solution

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Building Generative AI prompt chaining workflows with human in the loop

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

How Vidmob is using generative AI to transform its creative data landscape

Create generative AI agents that interact with your companies’ systems in a few clicks using Amazon Bedrock in Amazon SageMaker Unified Studio

Enhance conversational AI with advanced routing techniques with Amazon Bedrock

Automate invoice processing with Streamlit and Amazon Bedrock

Build a contextual chatbot application using Knowledge Bases for Amazon Bedrock

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Building a virtual meteorologist using Amazon Bedrock Agents

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Boost employee productivity with automated meeting summaries using Amazon Transcribe, Amazon SageMaker, and LLMs from Hugging Face

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

Knowledge Bases for Amazon Bedrock now supports advanced parsing, chunking, and query reformulation giving greater control of accuracy in RAG based applications

Transform one-on-one customer interactions: Build speech-capable order processing agents with AWS and generative AI

How healthcare payers and plans can empower members with generative AI

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

Build your multilingual personal calendar assistant with Amazon Bedrock and AWS Step Functions

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

Accenture creates a regulatory document authoring solution using AWS generative AI services

Stay Connected