Artificial Inteligence, Generative AI and Reference

LLM benchmarking: How to find the right AI model

CIO

MARCH 11, 2025

But how do companies decide which large language model (LLM) is right for them? But beneath the glossy surface of advertising promises lurks the crucial question: Which of these technologies really delivers what it promises and which ones are more likely to cause AI projects to falter?

Artificial Inteligence

Artificial Inteligence How To Metrics Software Review

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server. You ask the agent to Book a 5-day trip to Europe in January and we like warm weather.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. 70B model showed significant and consistent improvements in end-to-end (E2E) scaling times.

LLM benchmarking: How to find the right AI model

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Webinars

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Generate financial industry-specific insights using generative AI and in-context fine-tuning

Build a multi-tenant generative AI environment for your enterprise on AWS

Accelerate AWS Well-Architected reviews with Generative AI

How to Use Generative AI and LLMs to Improve Search

Writer deploys home-cooked large language models to power up enterprise copy

Agentic AI design: An architectural case study

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

Empower your generative AI application with a comprehensive custom observability solution

AI dominates Gartner’s 2025 predictions

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Model customization, RAG, or both: A case study with Amazon Nova

Generative AI operating models in enterprise organizations with Amazon Bedrock

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Integrate foundation models into your code with Amazon Bedrock

Gen AI graduates to operations in higher ed

Generative AI – The End of Empty Textboxes

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Top 7 generative AI use cases for business

CIOs contend with gen AI growing pains

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

‘Just-in-time’ AI: Has its moment arrived?

Host concurrent LLMs with LoRAX

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

UPS delivers customer wins with generative AI

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Liberty Mutual CIO Monica Caldas on developing a digital-savvy workforce

Medical content creation in the age of generative AI

Five generative AI tips for every business leader

Stability AI backs effort to bring machine learning to biomed

Stay Connected