AWS, Scalability and Storage

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. It stores information such as job ID, status, creation time, and other metadata.

Scalability

Scalability Lambda Generative AI AWS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This scalability allows for more frequent and comprehensive reviews.

Generative AI

Generative AI Technical Review Software Review Systems Review

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions.

Generative AI

Generative AI AWS Technical Review Backup

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Semantic routing offers several advantages, such as efficiency gained through fast similarity search in vector databases, and scalability to accommodate a large number of task categories and downstream LLMs. Before migrating any of the provided solutions to production, we recommend following the AWS Well-Architected Framework.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. On AWS, you can use the fully managed Amazon Bedrock Agents or tools of your choice such as LangChain agents or LlamaIndex agents.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

Cloud computing Average salary: $124,796 Expertise premium: $15,051 (11%) Cloud computing has been a top priority for businesses in recent years, with organizations moving storage and other IT operations to cloud data storage platforms such as AWS.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

Simplify your storage management by leveraging a universal storage layer

CIO

AUGUST 26, 2024

A universal storage layer can help tame IT complexity One way to resolve this complexity is by architecting a consistent environment on a foundation of software-defined storage services that provide the same capabilities and management interfaces regardless of where a customer’s data resides.

Storage

Storage AWS Compliance Cloud

Spend Smarter, Not More: A Guide to AWS Storage Cost Optimization

Xebia

JANUARY 8, 2024

Introduction With an ever-expanding digital universe, data storage has become a crucial aspect of every organization’s IT strategy. The cloud, particularly Amazon Web Services (AWS), has made storing vast amounts of data more uncomplicated than ever before. The following table gives you an overview of AWS storage costs.

Storage

Storage AWS Backup Policies

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure. The CCoE implemented AWS Organizations across a substantial number of business units.

Generative AI

Generative AI Government Technical Review Innovation

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. Post-authentication, users access the UI Layer, a gateway to the Red Teaming Playground built on AWS Amplify and React.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

In todays fast-paced digital landscape, the cloud has emerged as a cornerstone of modern business infrastructure, offering unparalleled scalability, agility, and cost-efficiency. First, cloud provisioning through automation is better in AWS CloudFormation and Azure Azure Resource Manager compared to the other cloud providers.

Cloud

Cloud Strategy Architecture Policies

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. The following diagram illustrates the end-to-end flow.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

This challenge is further compounded by concerns over scalability and cost-effectiveness. Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. Two prominent approaches among our customers are LoRAX and vLLM.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

The workflow includes the following steps: Documents (owner manuals) are uploaded to an Amazon Simple Storage Service (Amazon S3) bucket. x or later The AWS CDK CLI installed Deploy the solution The following steps outline the process to deploying the solution using the AWS CDK. The following diagram illustrates how it works.

Lambda

Lambda Enterprise Automotive Knowledge Base

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. Our study used Amazon Nova Micro and Amazon Nova Lite as baseline FMs and tested their performance across different configurations. To do so, we create a knowledge base.

Case Study

Case Study Artificial Inteligence Study Generative AI

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 14, 2024

The storage layer uses Amazon Simple Storage Service (Amazon S3) to hold the invoices that business users upload. Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. Install Python 3.7 or later on your local machine.

Software Review

Software Review Technical Review AWS Artificial Inteligence

High-performance computing on AWS

Xebia

AUGUST 29, 2023

How does High-Performance Computing on AWS differ from regular computing? HPC services on AWS Compute Technically you could design and build your own HPC cluster on AWS, it will work but you will spend time on plumbing and undifferentiated heavy lifting. AWS has two services to support your HPC workload.

AWS

AWS Performance Storage Linux

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

With Amazon Bedrock Data Automation, enterprises can accelerate AI adoption and develop solutions that are secure, scalable, and responsible. Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions. For example, a request made in the US stays within Regions in the US.

Data

Data Generative AI Artificial Inteligence Compliance

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. Additionally, you can choose what gets logged.

Generative AI

Generative AI Applications AWS Knowledge Base

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. Amazon Bedrocks broad choice of FMs from leading AI companies, along with its scalability and security features, made it an ideal solution for MaestroQA.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

In the current digital environment, migration to the cloud has emerged as an essential tactic for companies aiming to boost scalability, enhance operational efficiency, and reinforce resilience. Get AWS developers A step-by-step AWS migration checklist Mobilunity helps hiring dedicated development teams to businesses worldwide for 14+ years.

AWS

AWS Cloud Weak Development Team DevOps

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning - AI

NOVEMBER 20, 2024

In this post, we explore how you can use Amazon Q Business , the AWS generative AI-powered assistant, to build a centralized knowledge base for your organization, unifying structured and unstructured datasets from different sources to accelerate decision-making and drive productivity.

Data

Data AWS Groups Knowledge Base

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The collaboration between BQA and AWS was facilitated through the Cloud Innovation Center (CIC) program, a joint initiative by AWS, Tamkeen , and leading universities in Bahrain, including Bahrain Polytechnic and University of Bahrain. The extracted text data is placed into another SQS queue for the next processing step.

Education

Education Report Technical Review Generative AI

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Kaseya

MAY 13, 2021

As the name suggests, a cloud service provider is essentially a third-party company that offers a cloud-based platform for application, infrastructure or storage services. In a public cloud, all of the hardware, software, networking and storage infrastructure is owned and managed by the cloud service provider. What Is a Public Cloud?

Google Cloud

Google Cloud Azure AWS Cloud

Implement Amazon S3 Cross-Region Replication With Terraform

Dzone - DevOps

FEBRUARY 18, 2025

With the information technology element finding its roots in every financial organization and across all industries, strong storage capacity forms the backbone for availability, durability, and scalability. Among these, Amazon S3 is one of the most popular services to meet these needs.

AWS

AWS Storage Scalability Meeting

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

The Asure team was manually analyzing thousands of call transcripts to uncover themes and trends, a process that lacked scalability. Our partnership with AWS and our commitment to be early adopters of innovative technologies like Amazon Bedrock underscore our dedication to making advanced HCM technology accessible for businesses of any size.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Datavail Achieves AWS Service Delivery Designation for Amazon RDS

Datavail

JULY 7, 2021

Datavail has reached an exciting milestone : We’ve achieved the Amazon Web Services (AWS) Service Delivery Designation for Amazon Relational Database Service (Amazon RDS). This achievement recognizes that Datavail follows best practices and has proven success delivering AWS services to end customers.

AWS

AWS Database Administration Case Study Backup

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

AWS Machine Learning - AI

NOVEMBER 21, 2024

We guide you through deploying the necessary infrastructure using AWS CloudFormation , creating an internal labeling workforce, and setting up your first labeling job. Solution overview This audio/video segmentation solution combines several AWS services to create a robust annotation workflow. We demonstrate how to use Wavesurfer.js

Video

Video Lambda AWS Generative AI

Mastering AWS Infrastructure as Code with Pulumi and Python

Perficient

MARCH 27, 2025

What Youll Learn How Pulumi works with AWS Setting up Pulumi with Python Deploying various AWS services with real-world examples Best practices and advanced tips Why Pulumi for AWS? Multi-Cloud and Multi-Language Support Deploy across AWS, Azure, and Google Cloud with Python, TypeScript, Go, or.NET.

AWS

AWS Infrastructure Lambda Load Balancer

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform. Navigate to the AWS Secrets Manager console and find the secret -api-keys.

Generative AI

Generative AI Applications Technical Review Software Review

Integrating Key Vault Secrets with Azure Synapse Analytics

Apiumhub

DECEMBER 9, 2024

Azure Key Vault Secrets offers a centralized and secure storage alternative for API keys, passwords, certificates, and other sensitive statistics. Azure Key Vault is a cloud service that provides secure storage and access to confidential information such as passwords, API keys, and connection strings. What is Azure Key Vault Secret?

Azure

Azure Analytics Storage Artificial Inteligence

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

APRIL 23, 2024

As successful proof-of-concepts transition into production, organizations are increasingly in need of enterprise scalable solutions. The AWS Well-Architected Framework provides best practices and guidelines for designing and operating reliable, secure, efficient, and cost-effective systems in the cloud.

Knowledge Base

Knowledge Base Scalability Applications Generative AI

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

Solution overview The policy documents reside in Amazon Simple Storage Service (Amazon S3) storage. This action invokes an AWS Lambda function to retrieve the document embeddings from the OpenSearch Service database and present them to Anthropics Claude 3 Sonnet FM, which is accessed through Amazon Bedrock.

Generative AI

Generative AI Technical Review Insurance Policies

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

Designed with a serverless, cost-optimized architecture, the platform provisions SageMaker endpoints dynamically, providing efficient resource utilization while maintaining scalability. Multiple specialized Amazon Simple Storage Service Buckets (Amazon S3 Bucket) store different types of outputs.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

These recipes include a training stack validated by Amazon Web Services (AWS) , which removes the tedious work of experimenting with different model configurations, minimizing the time it takes for iterative evaluation and testing. You can run these recipes using SageMaker HyperPod or as SageMaker training jobs.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

AWS ECS vs. AWS Lambda: Top 5 Main Differences

Dzone - DevOps

APRIL 24, 2023

Today, most organizations prefer to host applications and services on the cloud due to ease of deployment, high security, scalability, and cheap maintenance costs over on-premise infrastructure. In 2006, Amazon launched its cloud services platform, Amazon Web Services (AWS) , one of the leading cloud providers to date.

AWS

AWS Lambda Software Review Artificial Inteligence

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Confirm the AWS Regions where the model is available and quotas. Complete the knowledge base evaluation prerequisites related to AWS Identity and Access Management (IAM) creation and add permissions for an S3 bucket to access and write output data. Selected evaluator and generator models enabled in Amazon Bedrock.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning - AI

APRIL 30, 2025

Because Amazon Bedrock is serverless, you dont have to manage infrastructure to securely integrate and deploy generative AI capabilities into your application, handle spiky traffic patterns, and enable new features like cross-Region inference, which helps provide scalability and reliability across AWS Regions.

Artificial Inteligence

Artificial Inteligence Technical Review Software Review Generative AI

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Our proposed architecture provides a scalable and customizable solution for online LLM monitoring, enabling teams to tailor your monitoring solution to your specific use cases and requirements. Through AWS Step Functions orchestration, the function calls Amazon Comprehend to detect the sentiment and toxicity.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Accelerate AWS Well-Architected reviews with Generative AI

Webinars

Trending Sources

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Webinars

Multi-LLM routing strategies for generative AI applications on AWS

Build a multi-tenant generative AI environment for your enterprise on AWS

How AWS sales uses Amazon Q Business for customer engagement

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

9 IT skills where expertise pays the most

Simplify your storage management by leveraging a universal storage layer

Spend Smarter, Not More: A Guide to AWS Storage Cost Optimization

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Host concurrent LLMs with LoRAX

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Model customization, RAG, or both: A case study with Amazon Nova

Automate invoice processing with Streamlit and Amazon Bedrock

High-performance computing on AWS

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Empower your generative AI application with a comprehensive custom observability solution

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

How BQA streamlines education quality reporting using Amazon Bedrock

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Implement Amazon S3 Cross-Region Replication With Terraform

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Datavail Achieves AWS Service Delivery Designation for Amazon RDS

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

Mastering AWS Infrastructure as Code with Pulumi and Python

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Integrating Key Vault Secrets with Azure Synapse Analytics

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS ECS vs. AWS Lambda: Top 5 Main Differences

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Techniques and approaches for monitoring large language models on AWS

Stay Connected