Artificial Inteligence, AWS and Serverless

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server. You ask the agent to Book a 5-day trip to Europe in January and we like warm weather.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

With serverless components, there is no need to manage infrastructure, and the inbuilt tracing, logging, monitoring and debugging make it easy to run these workloads in production and maintain service levels. Financial services unique challenges However, it is important to understand that serverless architecture is not a silver bullet.

Serverless

Serverless Architecture Microservices Scalability

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. Access to Amazon Bedrock foundation models is not granted by default.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

With rapid progress in the fields of machine learning (ML) and artificial intelligence (AI), it is important to deploy the AI/ML model efficiently in production environments. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. API Gateway is serverless and hence automatically scales with traffic. You can use AWS services such as Application Load Balancer to implement this approach.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

The Amazon Bedrock single API access, regardless of the models you choose, gives you the flexibility to use different FMs and upgrade to the latest model versions with minimal code changes. Amazon Titan FMs provide customers with a breadth of high-performing image, multimodal, and text model choices, through a fully managed API.

AWS

AWS Engineering Serverless eCommerce

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. With the significant developments in the field of generative AI , intelligent applications powered by foundation models (FMs) can help users map out an itinerary through an intuitive natural conversation interface.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

National Laboratory has implemented an AI-driven document processing platform that integrates named entity recognition (NER) and large language models (LLMs) on Amazon SageMaker AI. In this post, we discuss how you can build an AI-powered document processing platform with open source NER and LLMs on SageMaker.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions. It will be marked for deletion and will be deleted when all executions are stopped.

Generative AI

Generative AI AWS Technical Review Backup

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

AWS Machine Learning - AI

APRIL 22, 2025

In December, we announced the preview availability for Amazon Bedrock Intelligent Prompt Routing , which provides a single serverless endpoint to efficiently route requests between different foundation models within the same model family. We reduced overhead of added components by over 20% to approximately 85 ms (P90).

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

AWS Machine Learning - AI

NOVEMBER 22, 2024

Seamless integration of latest foundation models (FMs), Prompts, Agents, Knowledge Bases, Guardrails, and other AWS services. Reduced time and effort in testing and deploying AI workflows with SDK APIs and serverless infrastructure. To learn more, see the AWS user guide for Guardrails integration and Traceability.

Generative AI

Generative AI Artificial Inteligence Knowledge Base AWS

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Discover, Protect and Respond with AWS and Prisma Cloud

Prisma Clud

NOVEMBER 22, 2024

Organizations are increasingly turning to cloud providers, like Amazon Web Services (AWS), to address these challenges and power their digital transformation initiatives. However, the vastness of AWS environments and the ease of spinning up new resources and services can lead to cloud sprawl and ongoing security risks.

AWS

AWS Cloud Network Compliance

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. This post provides guidance on how you can create a video insights and summarization engine using AWS AI/ML services.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Dzone - DevOps

APRIL 9, 2025

Amazon Web Services (AWS) provides an expansive suite of tools to help developers build and manage serverless applications with ease. By abstracting the complexities of infrastructure, AWS enables teams to focus on innovation. Why Combine AI, ML, and Serverless Computing?

Serverless

Serverless Artificial Inteligence Scalability AWS

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. Post-authentication, users access the UI Layer, a gateway to the Red Teaming Playground built on AWS Amplify and React.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. For more information, see Create a service role for model import.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning - AI

APRIL 30, 2025

With advancement in AI technology, the time is right to address such complexities with large language models (LLMs). Amazon Bedrock has helped democratize access to LLMs, which have been challenging to host and manage. The following diagram illustrates the architecture using AWS services.

Artificial Inteligence

Artificial Inteligence Technical Review Software Review Generative AI

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. However, some components may incur additional usage-based costs.

Generative AI

Generative AI Applications AWS Knowledge Base

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. Its serverless architecture allowed the team to rapidly prototype and refine their application without the burden of managing complex hardware infrastructure.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

AWS Machine Learning - AI

APRIL 11, 2024

AWS was delighted to present to and connect with over 18,000 in-person and 267,000 virtual attendees at NVIDIA GTC, a global artificial intelligence (AI) conference that took place March 2024 in San Jose, California, returning to a hybrid, in-person experience for the first time since 2019.

Generative AI

Generative AI AWS Artificial Inteligence Innovation

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques. To learn more about FMEval, see Evaluate large language models for quality and responsibility of LLMs.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Empowering everyone with GenAI to rapidly build, customize, and deploy apps securely: Highlights from the AWS New York Summit

AWS Machine Learning - AI

JULY 10, 2024

Imagine this—all employees relying on generative artificial intelligence (AI) to get their work done faster, every task becoming less mundane and more innovative, and every application providing a more useful, personal, and engaging experience. That’s another reason why hundreds of thousands of customers are now using our AI services.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Knowledge Base

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Intelligent document processing , translation and summarization, flexible and insightful responses for customer support agents, personalized marketing content, and image and code generation are a few use cases using generative AI that organizations are rolling out in production.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. Prerequisites AWS Command Line Interface (CLI), follow instructions here. Require Python 3.11

AWS

AWS Generative AI Linux Groups

How I replaced Xebia Leadership with Artificial Intelligence

Xebia

APRIL 20, 2023

That’s right, folks; I replaced the Xebia leadership with artificial intelligence! The magic happens through a combination of Serverless, user input, a CloudFront distribution, a Lambda function, and the OpenAI API. provider: name: aws runtime: python3.9 Please try again later.';

Artificial Intelligence

Artificial Intelligence Artificial Inteligence Leadership Lambda

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

AWS Machine Learning - AI

NOVEMBER 22, 2023

In addition, customers are looking for choices to select the most performant and cost-effective machine learning (ML) model and the ability to perform necessary customization (fine-tuning) to fit their business use cases. The LLM generated text, and the IR system retrieves relevant information from a knowledge base.

Artificial Inteligence

Artificial Inteligence Serverless Engineering Machine Learning

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

AI agents , powered by large language models (LLMs), can analyze complex customer inquiries, access multiple data sources, and deliver relevant, detailed responses. Generative AI has transformed customer support, offering businesses the ability to respond faster, more accurately, and with greater personalization.

Lambda

Lambda Enterprise Automotive Knowledge Base

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

AWS Machine Learning - AI

APRIL 3, 2024

In this post, we show how to build a contextual text and image search engine for product recommendations using the Amazon Titan Multimodal Embeddings model , available in Amazon Bedrock , with Amazon OpenSearch Serverless. Amazon SageMaker Studio – It is an integrated development environment (IDE) for machine learning (ML).

Serverless

Serverless Artificial Inteligence Engineering Generative AI

eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker

AWS Machine Learning - AI

JUNE 21, 2024

To accomplish this, eSentire built AI Investigator, a natural language query tool for their customers to access security platform data by using AWS generative artificial intelligence (AI) capabilities. Therefore, eSentire decided to build their own LLM using Llama 1 and Llama 2 foundational models.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Serverless

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

With this launch, you can now access Mistrals frontier-class multimodal model to build, experiment, and responsibly scale your generative AI ideas on AWS. AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. Take a look at the Mistral-on-AWS repo.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Video security analysis for privileged access management using generative AI and Amazon Bedrock

AWS Machine Learning - AI

JANUARY 22, 2025

These services use advanced machine learning (ML) algorithms and computer vision techniques to perform functions like object detection and tracking, activity recognition, and text and audio recognition. An EventBridge rule then triggers the AWS Step Functions workflow to begin processing the video recording into a transcript.

Generative AI

Generative AI Video Analysis Technical Review

Building Generative AI prompt chaining workflows with human in the loop

AWS Machine Learning - AI

MAY 17, 2024

Generative AI is a type of artificial intelligence (AI) that can be used to create new content, including conversations, stories, images, videos, and music. Like all AI, generative AI works by using machine learning models—very large models that are pretrained on vast amounts of data called foundation models (FMs).

Generative AI

Generative AI Artificial Inteligence Systems Review Software Review

Improving air quality with generative AI

AWS Machine Learning - AI

JUNE 18, 2024

On December 6 th -8 th 2023, the non-profit organization, Tech to the Rescue , in collaboration with AWS, organized the world’s largest Air Quality Hackathon – aimed at tackling one of the world’s most pressing health and environmental challenges, air pollution. This is done to optimize performance and minimize cost of LLM invocation.

Generative AI

Generative AI Artificial Inteligence Technical Review AWS

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Some examples of AWS-sourced operational events include: AWS Health events — Notifications related to AWS service availability, operational issues, or scheduled maintenance that might affect your AWS resources.

Cloud

Cloud AWS Serverless Policies

Top 8 predictive analytics tools compared

CIO

MAY 12, 2022

Predictive analytics tools blend artificial intelligence and business reporting. AWS SageMaker. Full integration with AWS, third-party marketplace, serverless options. Composite AI mixes statistics and machine learning; industry-specific solutions. AWS SageMaker. Free trial. Free tier.

Analytics

Analytics Tools Artificial Inteligence Open Source

Incorporate offline and online human – machine workflows into your generative AI applications on AWS

AWS Machine Learning - AI

MAY 14, 2024

Recent advances in artificial intelligence have led to the emergence of generative AI that can produce human-like novel content such as images, text, and audio. These models are pre-trained on massive datasets and, to sometimes fine-tuned with smaller sets of more task specific data.

Generative AI

Generative AI Artificial Inteligence AWS Applications

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock is a fully managed service that makes foundation models (FMs) from leading AI startups and Amazon Web Services available through an API, so you can choose from a wide range of FMs to find the model that is best suited for your use case. Deploy the AWS CDK project to provision the required resources in your AWS account.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

From deriving insights to powering generative artificial intelligence (AI) -driven applications, the ability to efficiently process and analyze large datasets is a vital capability. That’s where the new Amazon EMR Serverless application integration in Amazon SageMaker Studio can help.

Serverless

Serverless AWS Artificial Inteligence Big Data

How Cato Networks uses Amazon Bedrock to transform free text search into structured GraphQL queries

AWS Machine Learning - AI

JANUARY 22, 2025

In this post, we demonstrate how we used Amazon Bedrock , a fully managed service that makes FMs from leading AI startups and Amazon available through an API, so you can choose from a wide range of FMs to find the model that is best suited for your use case. Presently, his main area of focus is state-of-the-art natural language processing.

Network

Network Artificial Inteligence Machine Learning Serverless

Multi-LLM routing strategies for generative AI applications on AWS

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Webinars

Trending Sources

Can serverless fix fintech’s scaling problem?

Webinars

Build and deploy a UI for your generative AI applications with AWS and Python

Accelerate AWS Well-Architected reviews with Generative AI

Building a Scalable ML Pipeline and API in AWS

Build a multi-tenant generative AI environment for your enterprise on AWS

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

How AWS sales uses Amazon Q Business for customer engagement

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Discover, Protect and Respond with AWS and Prisma Cloud

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Empower your generative AI application with a comprehensive custom observability solution

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Empowering everyone with GenAI to rapidly build, customize, and deploy apps securely: Highlights from the AWS New York Summit

Generative AI operating models in enterprise organizations with Amazon Bedrock

Getting started with computer use in Amazon Bedrock Agents

How I replaced Xebia Leadership with Artificial Intelligence

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker

Pixtral Large is now available in Amazon Bedrock

Video security analysis for privileged access management using generative AI and Amazon Bedrock

Building Generative AI prompt chaining workflows with human in the loop

Improving air quality with generative AI

Boost productivity by using AI in cloud operational health management

Top 8 predictive analytics tools compared

Incorporate offline and online human – machine workflows into your generative AI applications on AWS

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

How Cato Networks uses Amazon Bedrock to transform free text search into structured GraphQL queries

Stay Connected