Architecture, Artificial Inteligence and Serverless

Architecture

Artificial Inteligence

Serverless

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

With serverless components, there is no need to manage infrastructure, and the inbuilt tracing, logging, monitoring and debugging make it easy to run these workloads in production and maintain service levels. Financial services unique challenges However, it is important to understand that serverless architecture is not a silver bullet.

Serverless

Serverless Architecture Microservices Scalability

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Join 49,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Revolutionizing data management: Trends driving security, scalability, and governance in 2025

CIO

JANUARY 30, 2025

Augmented data management with AI/ML Artificial Intelligence and Machine Learning transform traditional data management paradigms by automating labour-intensive processes and enabling smarter decision-making. With machine learning, these processes can be refined over time and anomalies can be predicted before they arise.

Scalability

Scalability Government Trends Artificial Inteligence

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

National Laboratory has implemented an AI-driven document processing platform that integrates named entity recognition (NER) and large language models (LLMs) on Amazon SageMaker AI. In this post, we discuss how you can build an AI-powered document processing platform with open source NER and LLMs on SageMaker.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

With rapid progress in the fields of machine learning (ML) and artificial intelligence (AI), it is important to deploy the AI/ML model efficiently in production environments. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. The Streamlit application will now display a button labeled Get LLM Response.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

Architecture The following figure shows the architecture of the solution. Being serverless, it allows secure integration and deployment of generative AI capabilities without managing infrastructure. An agent uses the power of an LLM to determine which function to execute, and output the result based on the prompt guide.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. The solution incorporates the following key features: Using a Retrieval Augmented Generation (RAG) architecture, the system generates a context-aware detailed assessment.

Generative AI

Generative AI Technical Review Software Review Systems Review

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. The resulting distilled models, such as DeepSeek-R1-Distill-Llama-8B (from base model Llama-3.1-8B

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

API Gateway is serverless and hence automatically scales with traffic. The advantage of using Application Load Balancer is that it can seamlessly route the request to virtually any managed, serverless or self-hosted component and can also scale well. It’s serverless so you don’t have to manage the infrastructure.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Dzone - DevOps

APRIL 9, 2025

Amazon Web Services (AWS) provides an expansive suite of tools to help developers build and manage serverless applications with ease. In this article, we delve into serverless AI/ML on AWS, exploring best practices, implementation strategies, and an example to illustrate these concepts in action.

Serverless

Serverless Artificial Inteligence Scalability AWS

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

AWS Machine Learning - AI

NOVEMBER 22, 2024

Seamless integration of latest foundation models (FMs), Prompts, Agents, Knowledge Bases, Guardrails, and other AWS services. Reduced time and effort in testing and deploying AI workflows with SDK APIs and serverless infrastructure. Flexibility to define the workflow based on your business logic.

Generative AI

Generative AI Artificial Inteligence Knowledge Base AWS

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

The solution integrates large language models (LLMs) with your organization’s data and provides an intelligent chat assistant that understands conversation context and provides relevant, interactive responses directly within the Google Chat interface. It can be a local machine or a cloud instance.

Generative AI

Generative AI Lambda Applications AWS

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning - AI

APRIL 30, 2025

With advancement in AI technology, the time is right to address such complexities with large language models (LLMs). Amazon Bedrock has helped democratize access to LLMs, which have been challenging to host and manage. The following diagram illustrates the architecture using AWS services.

Artificial Inteligence

Artificial Inteligence Technical Review Software Review Generative AI

Invest in AI search as an enterprise business asset

CIO

APRIL 25, 2025

By leveraging genAI assistants and large language models, AI search can interpret a user request and deliver results in a business context. Look for an open ecosystem that integrates with all the major AI foundation models and supports your own models so existing investments arent wasted.

Enterprise

Enterprise Artificial Inteligence Survey Firewall

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

The Amazon Bedrock single API access, regardless of the models you choose, gives you the flexibility to use different FMs and upgrade to the latest model versions with minimal code changes. Amazon Titan FMs provide customers with a breadth of high-performing image, multimodal, and text model choices, through a fully managed API.

AWS

AWS Engineering Serverless eCommerce

How I replaced Xebia Leadership with Artificial Intelligence

Xebia

APRIL 20, 2023

That’s right, folks; I replaced the Xebia leadership with artificial intelligence! The magic happens through a combination of Serverless, user input, a CloudFront distribution, a Lambda function, and the OpenAI API. The post How I replaced Xebia Leadership with Artificial Intelligence appeared first on Xebia.

Artificial Intelligence

Artificial Intelligence Artificial Inteligence Leadership Lambda

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

AWS Machine Learning - AI

NOVEMBER 22, 2023

In addition, customers are looking for choices to select the most performant and cost-effective machine learning (ML) model and the ability to perform necessary customization (fine-tuning) to fit their business use cases. The LLM generated text, and the IR system retrieves relevant information from a knowledge base.

Artificial Inteligence

Artificial Inteligence Serverless Engineering Machine Learning

Building Generative AI prompt chaining workflows with human in the loop

AWS Machine Learning - AI

MAY 17, 2024

Generative AI is a type of artificial intelligence (AI) that can be used to create new content, including conversations, stories, images, videos, and music. Like all AI, generative AI works by using machine learning models—very large models that are pretrained on vast amounts of data called foundation models (FMs).

Generative AI

Generative AI Artificial Inteligence Systems Review Software Review

Building accessible tools for large-scale computation and machine learning

O'Reilly Media - Data

AUGUST 30, 2018

The O’Reilly Data Show Podcast: Eric Jonas on Pywren, scientific computation, and machine learning. Jonas and his collaborators are working on a related project, NumPyWren, a system for linear algebra built on a serverless architecture. Jonas is also affiliated with UC Berkeley’s RISE Lab.

Artificial Inteligence

Artificial Inteligence Machine Learning Tools Serverless

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

AWS Machine Learning - AI

APRIL 3, 2024

In this post, we show how to build a contextual text and image search engine for product recommendations using the Amazon Titan Multimodal Embeddings model , available in Amazon Bedrock , with Amazon OpenSearch Serverless. Amazon SageMaker Studio – It is an integrated development environment (IDE) for machine learning (ML).

Serverless

Serverless Artificial Inteligence Engineering Generative AI

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques. To learn more about FMEval, see Evaluate large language models for quality and responsibility of LLMs.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker

AWS Machine Learning - AI

JUNE 21, 2024

To accomplish this, eSentire built AI Investigator, a natural language query tool for their customers to access security platform data by using AWS generative artificial intelligence (AI) capabilities. Therefore, eSentire decided to build their own LLM using Llama 1 and Llama 2 foundational models.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Serverless

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

MaestroQA integrated Amazon Bedrock into their existing architecture using Amazon Elastic Container Service (Amazon ECS). The following architecture diagram demonstrates the request flow for AskAI. The customer interaction transcripts are stored in an Amazon Simple Storage Service (Amazon S3) bucket.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

An operating model defines the organizational design, core processes, technologies, roles and responsibilities, governance structures, and financial models that drive a businesss operations. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Cloud Security — Maturing Past the Awkward Teenage Years

Palo Alto Networks

OCTOBER 22, 2024

According to the Unit 42 Cloud Threat Report : The rate of cloud migration shows no sign of slowing down—from $370 billion in 2021, with predictions to reach $830 billion in 2025—with many cloud-native applications and architectures already having had time to mature.

Cloud

Cloud Artificial Inteligence Software Review Systems Review

Improving air quality with generative AI

AWS Machine Learning - AI

JUNE 18, 2024

More than 170 tech teams used the latest cloud, machine learning and artificial intelligence technologies to build 33 solutions. Cost-effective – The solution should only invoke LLM to generate reusable code on an as-needed basis instead of manipulating the data directly to be as cost-effective as possible.

Generative AI

Generative AI Artificial Inteligence Technical Review AWS

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

This playground allows AI builders to explore scenarios, perform white hat hacking, and evaluate how models react under adversarial conditions. The following diagram illustrates the solution architecture. To learn more about Data Replys work, check out their specialized offerings for red teaming in generative AI and LLMOps.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Top 8 predictive analytics tools compared

CIO

MAY 12, 2022

Predictive analytics tools blend artificial intelligence and business reporting. Full integration with AWS, third-party marketplace, serverless options. Composite AI mixes statistics and machine learning; industry-specific solutions. Supports larger data management architecture; modular options available.

Analytics

Analytics Tools Artificial Inteligence Open Source

Innovative data integration in 2024: Pioneering the future of data integration

CIO

MAY 8, 2024

Of late, innovative data integration tools are revolutionising how organisations approach data management, unlocking new opportunities for growth, efficiency, and strategic decision-making by leveraging technical advancements in Artificial Intelligence, Machine Learning, and Natural Language Processing.

Innovation

Innovation Artificial Inteligence Data Serverless

Video security analysis for privileged access management using generative AI and Amazon Bedrock

AWS Machine Learning - AI

JANUARY 22, 2025

These services use advanced machine learning (ML) algorithms and computer vision techniques to perform functions like object detection and tracking, activity recognition, and text and audio recognition. The following diagram illustrates the solution architecture. The transcript is provided in tags.

Generative AI

Generative AI Video Analysis Technical Review

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

When Amazon Q Business became generally available in April 2024, we quickly saw an opportunity to simplify our architecture, because the service was designed to meet the needs of our use caseto provide a conversational assistant that could tap into our vast (sales) domain-specific knowledge bases.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

In this post, we describe the development journey of the generative AI companion for Mozart, the data, the architecture, and the evaluation of the pipeline. In the future, Verisk intends to use the Amazon Titan Embeddings V2 model. The following diagram illustrates the solution architecture. Connect with him on LinkedIn.

Generative AI

Generative AI Technical Review Insurance Policies

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Cost optimization – This solution uses serverless technologies, making it cost-effective for the observability infrastructure. Multiple programming language support – The GitHub repository provides the observability solution in both Python and Node.js However, some components may incur additional usage-based costs.

Generative AI

Generative AI Applications AWS Knowledge Base

Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock

AWS Machine Learning - AI

AUGUST 2, 2024

Generative artificial intelligence (AI) is rapidly emerging as a transformative force, poised to disrupt and reshape businesses of all sizes and across industries. LLM chain service – This service orchestrates the solution by invoking the LLM models with a fitting prompt and creating the response that is returned to the user.

Artificial Inteligence

Artificial Inteligence Generative AI Energy Knowledge Base

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

AWS Machine Learning - AI

AUGUST 21, 2024

Amazon SageMaker Canvas is a no-code machine learning (ML) service that empowers business analysts and domain experts to build, train, and deploy ML models without writing a single line of code. You can extend this solution to generative artificial intelligence (AI) use cases as well.

Artificial Inteligence

Artificial Inteligence Machine Learning Government Software Review

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

From deriving insights to powering generative artificial intelligence (AI) -driven applications, the ability to efficiently process and analyze large datasets is a vital capability. That’s where the new Amazon EMR Serverless application integration in Amazon SageMaker Studio can help.

Serverless

Serverless AWS Artificial Inteligence Big Data

Build a contextual chatbot application using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

FEBRUARY 19, 2024

Chatbots use the advanced natural language capabilities of large language models (LLMs) to respond to customer questions. They can understand conversational language and respond naturally. It augments prompts with these relevant chunks to generate an answer using the LLM.

Knowledge Base

Knowledge Base Artificial Inteligence Applications Lambda

Natural Language Processing & Machine Learning in Higher Education

Mentormate

MAY 31, 2023

In this article, we will discuss how MentorMate and our partner eLumen leveraged natural language processing (NLP) and machine learning (ML) for data-driven decision-making to tame the curriculum beast in higher education. The primary data sources used in eLumen Insights are on the left-hand side of the architecture.

Artificial Inteligence

Artificial Inteligence Machine Learning Education Analytics

Becoming a machine learning company means investing in foundational technologies

O'Reilly Media - Ideas

MAY 21, 2019

Companies successfully adopt machine learning either by building on existing data products and services, or by modernizing existing models and algorithms. I will highlight the results of a recent survey on machine learning adoption, and along the way describe recent trends in data and machine learning (ML) within companies.

Artificial Inteligence

Artificial Inteligence Machine Learning Technical Review Company

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock is a fully managed service that makes foundation models (FMs) from leading AI startups and Amazon Web Services available through an API, so you can choose from a wide range of FMs to find the model that is best suited for your use case.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Build generative AI–powered Salesforce applications with Amazon Bedrock

AWS Machine Learning - AI

JULY 29, 2024

In Part 3 , we demonstrate how business analysts and citizen data scientists can create machine learning (ML) models, without code, in Amazon SageMaker Canvas and deploy trained models for integration with Salesforce Einstein Studio to create powerful business applications.

Generative AI

Generative AI Artificial Inteligence Applications AWS

Can serverless fix fintech’s scaling problem?

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Webinars

Trending Sources

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Revolutionizing data management: Trends driving security, scalability, and governance in 2025

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Building a Scalable ML Pipeline and API in AWS

Build and deploy a UI for your generative AI applications with AWS and Python

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Accelerate AWS Well-Architected reviews with Generative AI

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Build a multi-tenant generative AI environment for your enterprise on AWS

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Invest in AI search as an enterprise business asset

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

How I replaced Xebia Leadership with Artificial Intelligence

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine

Building Generative AI prompt chaining workflows with human in the loop

Building accessible tools for large-scale computation and machine learning

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Generative AI operating models in enterprise organizations with Amazon Bedrock

Cloud Security — Maturing Past the Awkward Teenage Years

Improving air quality with generative AI

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Top 8 predictive analytics tools compared

Innovative data integration in 2024: Pioneering the future of data integration

Video security analysis for privileged access management using generative AI and Amazon Bedrock

How AWS sales uses Amazon Q Business for customer engagement

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Empower your generative AI application with a comprehensive custom observability solution

Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Build a contextual chatbot application using Knowledge Bases for Amazon Bedrock

Natural Language Processing & Machine Learning in Higher Education

Becoming a machine learning company means investing in foundational technologies

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Build generative AI–powered Salesforce applications with Amazon Bedrock

Stay Connected