AWS, Performance and Scalability

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. It stores information such as job ID, status, creation time, and other metadata.

Scalability

Scalability Lambda Generative AI AWS

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements. In contrast, more complex questions might require the application to summarize a lengthy dissertation by performing deeper analysis, comparison, and evaluation of the research results.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Implementing a Version Control System for AWS QuickSight

Xebia

OCTOBER 24, 2024

Among the myriads of BI tools available, AWS QuickSight stands out as a scalable and cost-effective solution that allows users to create visualizations, perform ad-hoc analysis, and generate business insights from their data. AWS does not provide a comprehensive list of supported dataset types.

AWS

AWS Systems Review System Azure

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This scalability allows for more frequent and comprehensive reviews.

Generative AI

Generative AI Technical Review Software Review Systems Review

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

there is an increasing need for scalable, reliable, and cost-effective solutions to deploy and serve these models. AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment.

AWS

AWS Load Balancer Software Review Artificial Inteligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions.

Generative AI

Generative AI AWS Technical Review Backup

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Without a scalable approach to controlling costs, organizations risk unbudgeted usage and cost overruns. Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

High-performance computing on AWS

Xebia

AUGUST 29, 2023

How does High-Performance Computing on AWS differ from regular computing? For this HPC will bring massive parallel computing, cluster and workload managers and high-performance components to the table. AWS has two services to support your HPC workload. However, some tasks are very complex and require a different approach.

AWS

AWS Performance Storage Linux

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

CIO

APRIL 15, 2025

Cost-performance optimizations via new chip One of the major updates announced last week was Googles seventh generation Tensor Processing Unit (TPU) chip Ironwood targeted at accelerating AI workloads, especially inferencing.

Cloud

Cloud Innovation Artificial Inteligence Google Cloud

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

Technology leaders in the financial services sector constantly struggle with the daily challenges of balancing cost, performance, and security the constant demand for high availability means that even a minor system outage could lead to significant financial and reputational losses. Scalability. Scalability. Cost forecasting.

Serverless

Serverless Architecture Microservices Scalability

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

AI practitioners and industry leaders discussed these trends, shared best practices, and provided real-world use cases during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI. Instead of performing line-by-line migrations, it analyzes and understands the business context of code, increasing efficiency.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. Through Bedrock Marketplace, organizations can use Nemotron’s advanced capabilities while benefiting from the scalable infrastructure of AWS and NVIDIA’s robust technologies.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

In this post, we explore advanced prompt engineering techniques that can enhance the performance of these models and facilitate the creation of compelling imagery through text-to-image transformations. This post provided practical tips and techniques to optimize performance and elevate the creative possibilities within Stable Diffusion 3.5

Engineering

Engineering AWS 3D Generative AI

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline. Fine-tuning is one such technique, which helps in injecting task-specific or domain-specific knowledge for improving model performance.

Case Study

Case Study Artificial Inteligence Study Generative AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

Cloud computing Average salary: $124,796 Expertise premium: $15,051 (11%) Cloud computing has been a top priority for businesses in recent years, with organizations moving storage and other IT operations to cloud data storage platforms such as AWS.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. Post-authentication, users access the UI Layer, a gateway to the Red Teaming Playground built on AWS Amplify and React.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. 70B-Instruct ), offer different trade-offs between performance and resource requirements.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

The agents also automatically call APIs to perform actions and access knowledge bases to provide additional information. Developer tools The solution also uses the following developer tools: AWS Powertools for Lambda – This is a suite of utilities for Lambda functions that generates OpenAPI schemas from your Lambda function code.

Lambda

Lambda Enterprise Automotive Knowledge Base

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

For some content, additional screening is performed to generate subtitles and captions. As DPG Media grows, they need a more scalable way of capturing metadata that enhances the consumer experience on online video services and aids in understanding key content characteristics.

Media

Media Video Artificial Inteligence Generative AI

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies such as AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon through a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.

Generative AI

Generative AI Applications AWS Knowledge Base

Best Practices for IaC using AWS CloudFormation

Perficient

MARCH 11, 2025

IaC enables developers to define infrastructure configurations using code, ensuring consistency, automation, and scalability. AWS CloudFormation, a key service in the AWS ecosystem, simplifies IaC by allowing users to easily model and set up AWS resources. Why Use AWS CloudFormation? Example: 3. Example: 4.

AWS

AWS Software Review Systems Review Policies

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

This capability enables Anthropics Claude models to identify whats on a screen, understand the context of UI elements, and recognize actions that should be performed such as clicking buttons, typing text, scrolling, and navigating between applications. Prerequisites AWS Command Line Interface (CLI), follow instructions here.

AWS

AWS Generative AI Linux Groups

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. Amazon Bedrocks broad choice of FMs from leading AI companies, along with its scalability and security features, made it an ideal solution for MaestroQA.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. We ran 5+ scaling simulations and observed consistent performance with low variations across trials. minutes) to 166 seconds (2.77

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning - AI

MAY 30, 2024

Because Amazon Bedrock is serverless, you don’t have to manage infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. AWS Prototyping developed an AWS Cloud Development Kit (AWS CDK) stack for deployment following AWS best practices.

AWS

AWS Lambda Performance Artificial Inteligence

LambdaTest raises $45 million to build ‘AWS for testers’

TechCrunch

MARCH 29, 2022

But when a developer firm — at a scale — has to perform similar activities, they don’t often have — and in many cases, want to have — the required computing power at their disposal to run such tasks locally. We have built AWS for testers,” he said in an interview with TechCrunch.

AWS

AWS Testing Web Development Operating System

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

In the current digital environment, migration to the cloud has emerged as an essential tactic for companies aiming to boost scalability, enhance operational efficiency, and reinforce resilience. Get AWS developers A step-by-step AWS migration checklist Mobilunity helps hiring dedicated development teams to businesses worldwide for 14+ years.

AWS

AWS Cloud Weak Development Team DevOps

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

In todays fast-paced digital landscape, the cloud has emerged as a cornerstone of modern business infrastructure, offering unparalleled scalability, agility, and cost-efficiency. First, cloud provisioning through automation is better in AWS CloudFormation and Azure Azure Resource Manager compared to the other cloud providers.

Cloud

Cloud Strategy Architecture Policies

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 14, 2024

Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. This tutorial assumes you have the necessary AWS Identity and Access Management (IAM) permissions. Install Python 3.7 or later on your local machine.

Software Review

Software Review Technical Review AWS Artificial Inteligence

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Cloudera

JANUARY 7, 2025

Image: The Importance of Hybrid and Multi-Cloud Strategy Key benefits of a hybrid and multi-cloud approach include: Flexible Workload Deployment: The ability to place workloads in environments that best meet performance needs and regulatory requirements allows organizations to optimize operations while maintaining compliance.

Cloud

Cloud Data Scalability Compliance

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Large organizations often have many business units with multiple lines of business (LOBs), with a central governing entity, and typically use AWS Organizations with an Amazon Web Services (AWS) multi-account strategy. LOBs have autonomy over their AI workflows, models, and data within their respective AWS accounts.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Xebia

MAY 21, 2024

Cloud modernization has become a prominent topic for organizations, and AWS plays a crucial role in helping them modernize their IT infrastructure, applications, and services. Overall, discussions on AWS modernization are focused on security, faster releases, efficiency, and steps towards GenAI and improved innovation.

AWS

AWS Strategy Serverless Microservices

Delivering better business outcomes for CIOs

CIO

NOVEMBER 4, 2024

Enterprise cloud computing, while enabling fast deployment and scalability, has also introduced rising operational costs and additional challenges in managing diverse cloud services. Market shifts, mergers, geopolitical events, and the pandemic have further driven IT to deploy point solutions, increasing complexity.

Data Center

Data Center Recruiting Cloud Government

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

BQA reviews the performance of all education and training institutions, including schools, universities, and vocational institutes, thereby promoting the professional advancement of the nations human capital. Solution overview The proposed solution uses Amazon Bedrock and the Amazon Titan Express model to enable IDP functionalities.

Education

Education Report Technical Review Generative AI

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Implementing a Version Control System for AWS QuickSight

Webinars

Accelerate AWS Well-Architected reviews with Generative AI

Build and deploy a UI for your generative AI applications with AWS and Python

Introducing AWS MCP Servers for code assistants (Part 1)

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Build a multi-tenant generative AI environment for your enterprise on AWS

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

How AWS sales uses Amazon Q Business for customer engagement

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

High-performance computing on AWS

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

Can serverless fix fintech’s scaling problem?

AI in action: Stories of how enterprises are transforming and modernizing

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

Model customization, RAG, or both: A case study with Amazon Nova

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

9 IT skills where expertise pays the most

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Empower your generative AI application with a comprehensive custom observability solution

Best Practices for IaC using AWS CloudFormation

Enable Amazon Bedrock cross-Region inference in multi-account environments

Getting started with computer use in Amazon Bedrock Agents

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

LambdaTest raises $45 million to build ‘AWS for testers’

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Automate invoice processing with Streamlit and Amazon Bedrock

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Generative AI operating models in enterprise organizations with Amazon Bedrock

Modernizing on AWS: Strategies, Benefits, and Partnerships with Xebia

Delivering better business outcomes for CIOs

How BQA streamlines education quality reporting using Amazon Bedrock

Stay Connected