Artificial Intelligence, AWS and Scalability

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

With rapid progress in the fields of machine learning (ML) and artificial intelligence (AI), it is important to deploy the AI/ML model efficiently in production environments. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. It stores information such as job ID, status, creation time, and other metadata.

Scalability

Scalability Lambda Generative AI AWS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This scalability allows for more frequent and comprehensive reviews.

Generative AI

Generative AI Technical Review Software Review Systems Review

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Semantic routing offers several advantages, such as efficiency gained through fast similarity search in vector databases, and scalability to accommodate a large number of task categories and downstream LLMs. Before migrating any of the provided solutions to production, we recommend following the AWS Well-Architected Framework.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

Principal wanted to use existing internal FAQs, documentation, and unstructured data and build an intelligent chatbot that could provide quick access to the right information for different roles. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

Artificial Intelligence Average salary: $130,277 Expertise premium: $23,525 (15%) AI tops the list as the skill that can earn you the highest pay bump, earning tech professionals nearly an 18% premium over other tech skills. Read on to find out how such expertise can make you stand out in any industry.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Generative and agentic artificial intelligence (AI) are paving the way for this evolution. AI practitioners and industry leaders discussed these trends, shared best practices, and provided real-world use cases during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions.

Generative AI

Generative AI AWS Technical Review Backup

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. On AWS, you can use the fully managed Amazon Bedrock Agents or tools of your choice such as LangChain agents or LlamaIndex agents.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Dzone - DevOps

APRIL 9, 2025

Amazon Web Services (AWS) provides an expansive suite of tools to help developers build and manage serverless applications with ease. By abstracting the complexities of infrastructure, AWS enables teams to focus on innovation. Why Combine AI, ML, and Serverless Computing?

Serverless

Serverless Artificial Inteligence Scalability AWS

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Without a scalable approach to controlling costs, organizations risk unbudgeted usage and cost overruns. Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. Our study used Amazon Nova Micro and Amazon Nova Lite as baseline FMs and tested their performance across different configurations.

Case Study

Case Study Artificial Inteligence Study Generative AI

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Creating asynchronous AI agents with Amazon Bedrock

AWS Machine Learning - AI

MARCH 13, 2025

Advancements in multimodal artificial intelligence (AI), where agents can understand and generate not just text but also images, audio, and video, will further broaden their applications. Conversely, asynchronous event-driven systems offer greater flexibility and scalability through their distributed nature.

Artificial Inteligence

Artificial Inteligence Lambda Travel Generative AI

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

This challenge is further compounded by concerns over scalability and cost-effectiveness. Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. Two prominent approaches among our customers are LoRAX and vLLM.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. You can monitor costs with AWS Cost Explorer.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Build your multilingual personal calendar assistant with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

JULY 3, 2024

To solve this problem, this post shows you how to apply AWS services such as Amazon Bedrock , AWS Step Functions , and Amazon Simple Email Service (Amazon SES) to build a fully-automated multilingual calendar artificial intelligence (AI) assistant. It lets you orchestrate multiple steps in the pipeline.

AWS

AWS Artificial Inteligence Generative AI Lambda

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services.

Generative AI

Generative AI Applications AWS Knowledge Base

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Intelligent document processing , translation and summarization, flexible and insightful responses for customer support agents, personalized marketing content, and image and code generation are a few use cases using generative AI that organizations are rolling out in production.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

AWS Machine Learning - AI

APRIL 11, 2024

AWS was delighted to present to and connect with over 18,000 in-person and 267,000 virtual attendees at NVIDIA GTC, a global artificial intelligence (AI) conference that took place March 2024 in San Jose, California, returning to a hybrid, in-person experience for the first time since 2019.

Generative AI

Generative AI AWS Artificial Inteligence Innovation

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 14, 2024

Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. This tutorial assumes you have the necessary AWS Identity and Access Management (IAM) permissions. For this walkthrough, we will use the AWS CLI to trigger the processing.

Software Review

Software Review Technical Review AWS Artificial Inteligence

A secure approach to generative AI with AWS

AWS Machine Learning - AI

APRIL 16, 2024

Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. At AWS, our top priority is safeguarding the security and confidentiality of our customers’ workloads. With the AWS Nitro System , we delivered a first-of-its-kind innovation on behalf of our customers.

Generative AI

Generative AI AWS Artificial Inteligence Infrastructure

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

APRIL 23, 2024

Generative artificial intelligence (AI) has gained significant momentum with organizations actively exploring its potential applications. As successful proof-of-concepts transition into production, organizations are increasingly in need of enterprise scalable solutions.

Knowledge Base

Knowledge Base Scalability Applications Generative AI

Hyperscalers place big bets on AI as cloud spending rolls on

CIO

AUGUST 14, 2024

While enterprise IT budgets have grown, a significant portion of spending is now going to investments related to artificial intelligence (AI). According to a new report from Canalys, the top three cloud providers — AWS, Microsoft Azure, and Google Cloud — collectively grew by 24% this quarter to account for 63% of total spending.

Cloud

Cloud Google Cloud Artificial Intelligence Artificial Inteligence

Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

AWS Machine Learning - AI

JULY 31, 2024

Amazon Web Services (AWS) is committed to supporting the development of cutting-edge generative artificial intelligence (AI) technologies by companies and organizations across the globe. Let’s dive in and explore how these organizations are transforming what’s possible with generative AI on AWS.

Artificial Inteligence

Artificial Inteligence AWS Programming Innovation

AWS and Mistral AI commit to democratizing generative AI with a strengthened collaboration

AWS Machine Learning - AI

APRIL 2, 2024

The generative artificial intelligence (AI) revolution is in full swing, and customers of all sizes and across industries are taking advantage of this transformative technology to reshape their businesses. It’s been amazing to see the number of companies launching innovative generative AI applications on AWS using Amazon Bedrock.

Generative AI

Generative AI Artificial Inteligence AWS Software Review

Bank of America’s big bet on AI started small

CIO

APRIL 25, 2025

BofA has relationships with Microsoft, AWS, Google, and other clouds, but like many bank CIOs, Gopalkrishnan prefers to keep workloads close for cost and security reasons. The mainframe continues to be a very important strategic platform.

Banking

Banking Generative AI Off-The-Shelf Virtualization

EBSCOlearning scales assessment generation for their online learning content with generative AI

AWS Machine Learning - AI

DECEMBER 11, 2024

In this post, we illustrate how EBSCOlearning partnered with AWS Generative AI Innovation Center (GenAIIC) to use the power of generative AI in revolutionizing their learning assessment process. Scalability and robustness With EBSCOlearnings vast content library in mind, the team built scalability into the core of their solution.

Generative AI

Generative AI Artificial Inteligence Guidelines Education

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Use the us-west-2 AWS Region to run this demo. Prerequisites This notebook is designed to run on AWS, using Amazon Bedrock for both Anthropics Claude 3 Sonnet and Stability AI model access. Make sure you have the following set up before moving forward: An AWS account. An Amazon SageMaker domain. Access to Stability AIs SD3.5

Generative AI

Generative AI Games Development AWS

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

This surge is driven by the rapid expansion of cloud computing and artificial intelligence, both of which are reshaping industries and enabling unprecedented scalability and innovation. Global IT spending is expected to soar in 2025, gaining 9% according to recent estimates. Short-term focus.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. With the significant developments in the field of generative AI , intelligent applications powered by foundation models (FMs) can help users map out an itinerary through an intuitive natural conversation interface.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

An AWS Batch job reads these documents, chunks them into smaller slices, then creates embeddings of the text chunks using the Amazon Titan Text Embeddings model through Amazon Bedrock and stores them in an Amazon OpenSearch Service vector database. In the future, Verisk intends to use the Amazon Titan Embeddings V2 model.

Generative AI

Generative AI Technical Review Insurance Policies

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Kaseya

MAY 13, 2021

The public cloud infrastructure is heavily based on virtualization technologies to provide efficient, scalable computing power and storage. Cloud adoption also provides businesses with flexibility and scalability by not restricting them to the physical limitations of on-premises servers. Amazon Web Services (AWS) Overview.

Google Cloud

Google Cloud Azure AWS Cloud

Balance between innovation and market adaptability key to Tencent Cloud’s AI success

CIO

SEPTEMBER 9, 2024

Tencent Cloud’s expansion in Asia Pacific (APAC) reflects its strategic efforts to capitalize on the growing demand for Artificial Intelligence (AI) and cloud computing services. Nevertheless, Tencent Cloud faces stiff competition from more established cloud providers like AWS, Google Cloud, and Microsoft Azure in the region.

Innovation

Innovation Marketing Azure Artificial Intelligence

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning - AI

MAY 30, 2024

CBRE is unlocking the potential of artificial intelligence (AI) to realize value across the entire commercial real estate lifecycle—from guiding investment decisions to managing buildings. AWS Prototyping developed an AWS Cloud Development Kit (AWS CDK) stack for deployment following AWS best practices.

AWS

AWS Lambda Performance Artificial Inteligence

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

With this launch, you can now access Mistrals frontier-class multimodal model to build, experiment, and responsibly scale your generative AI ideas on AWS. AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. Take a look at the Mistral-on-AWS repo.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Cloud Security — Maturing Past the Awkward Teenage Years

Palo Alto Networks

OCTOBER 22, 2024

{{interview_audio_title}} 00:00 00:00 Volume Slider 10s 10s 10s 10s Seek Slider The genesis of cloud computing can be traced back to the 1960s concept of utility computing, but it came into its own with the launch of Amazon Web Services (AWS) in 2006. As a result, another crucial misconception revolves around the shared responsibility model.

Cloud

Cloud Artificial Inteligence Software Review Systems Review

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning - AI

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! are the sessions dedicated to AWS DeepRacer ! Generative AI is at the heart of the AWS Village this year. You marked your calendars, you booked your hotel, and you even purchased the airfare. And last but not least (and always fun!)

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Automate the process to change image backgrounds using Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

MARCH 7, 2024

However, Amazon Bedrock and AWS Step Functions make it straightforward to automate this process at scale. Step Functions allows you to create an automated workflow that seamlessly connects with Amazon Bedrock and other AWS services. The DynamoDB update triggers an AWS Lambda function, which starts a Step Functions workflow.

AWS

AWS Lambda Generative AI Report

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Confirm the AWS Regions where the model is available and quotas. Complete the knowledge base evaluation prerequisites related to AWS Identity and Access Management (IAM) creation and add permissions for an S3 bucket to access and write output data. Selected evaluator and generator models enabled in Amazon Bedrock.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

Building a Scalable ML Pipeline and API in AWS

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Build and deploy a UI for your generative AI applications with AWS and Python

Multi-LLM routing strategies for generative AI applications on AWS

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

9 IT skills where expertise pays the most

AI in action: Stories of how enterprises are transforming and modernizing

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Build a multi-tenant generative AI environment for your enterprise on AWS

How AWS sales uses Amazon Q Business for customer engagement

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Model customization, RAG, or both: A case study with Amazon Nova

Enable Amazon Bedrock cross-Region inference in multi-account environments

Creating asynchronous AI agents with Amazon Bedrock

Host concurrent LLMs with LoRAX

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Build your multilingual personal calendar assistant with Amazon Bedrock and AWS Step Functions

Empower your generative AI application with a comprehensive custom observability solution

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

Automate invoice processing with Streamlit and Amazon Bedrock

A secure approach to generative AI with AWS

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

Hyperscalers place big bets on AI as cloud spending rolls on

Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

AWS and Mistral AI commit to democratizing generative AI with a strengthened collaboration

Bank of America’s big bet on AI started small

EBSCOlearning scales assessment generation for their online learning content with generative AI

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Why GreenOps will succeed where FinOps is failing

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Balance between innovation and market adaptability key to Tencent Cloud’s AI success

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Pixtral Large is now available in Amazon Bedrock

Cloud Security — Maturing Past the Awkward Teenage Years

Your guide to generative AI and ML at AWS re:Invent 2023

Automate the process to change image backgrounds using Amazon Bedrock and AWS Step Functions

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Stay Connected