AWS, Machine Learning and Reference

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. Shared components refer to the functionality and features shared by all tenants. You can use AWS services such as Application Load Balancer to implement this approach. API Gateway also provides a WebSocket API.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. AWS HealthScribe combines speech recognition and generative AI trained specifically for healthcare documentation to accelerate clinical documentation and enhance the consultation experience.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The user signs in by entering a user name and a password.

Generative AI

Generative AI AWS Lambda Authentication

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. For more information on how to view and increase your quotas, refer to Amazon EC2 service quotas.

AWS

AWS Load Balancer Software Review Artificial Inteligence

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

Large Medium – This refers to the material or technique used in creating the artwork. This might involve incorporating additional data such as reference images or rough sketches as conditioning inputs alongside your text prompts. She’s passionate about machine learning technologies and environmental sustainability.

Engineering

Engineering AWS 3D Generative AI

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

Prerequisites Before you dive into the integration process, make sure you have the following prerequisites in place: AWS account – You’ll need an AWS account to access and use Amazon Bedrock. You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. Prerequisites AWS Command Line Interface (CLI), follow instructions here. Require Python 3.11

AWS

AWS Generative AI Linux Groups

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions. We're more than happy to provide further references upon request.

Generative AI

Generative AI AWS Technical Review Backup

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Security – The solution uses AWS services and adheres to AWS Cloud Security best practices so your data remains within your AWS account.

Generative AI

Generative AI Applications AWS Knowledge Base

Are you ready for MLOps? 🫵

Xebia

FEBRUARY 28, 2025

… that is not an awful lot. Both the tech and the skills are there: Machine Learning technology is by now easy to use and widely available. So then let me re-iterate: why, still, are teams having troubles launching Machine Learning models into production? Graph refers to Gartner hype cycle.

Technical Review

Technical Review Weak Development Team Artificial Inteligence Machine Learning

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

Amazon Q Business as a web experience makes AWS best practices readily accessible, providing cloud-centered recommendations quickly and making it straightforward to access AWS service functions, limits, and implementations. For more on MuleSofts journey to cloud computing, refer to Why a Cloud Operating Model?

Generative AI

Generative AI AWS Innovation Knowledge Base

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

For a comprehensive overview of metadata filtering and its benefits, refer to Amazon Bedrock Knowledge Bases now supports metadata filtering to improve retrieval accuracy. Prerequisites Before proceeding with this tutorial, make sure you have the following in place: AWS account – You should have an AWS account with access to Amazon Bedrock.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Access to your selected models hosted on Amazon Bedrock.

Scalability

Scalability Lambda Generative AI AWS

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

To evaluate the metadata quality, the team used reference-free LLM metrics, inspired by LangSmith. DPG Media chose Amazon Transcribe for its ease of transcription and low maintenance, with the added benefit of incremental improvements by AWS over the years. About the Authors Lucas Desard is GenAI Engineer at DPG Media.

Media

Media Video Artificial Inteligence Generative AI

Improving Retrieval Augmented Generation accuracy with GraphRAG

AWS Machine Learning - AI

DECEMBER 23, 2024

Lettria , an AWS Partner, demonstrated that integrating graph-based structures into RAG workflows improves answer precision by up to 35% compared to vector-only retrieval methods. In this post, we explore why GraphRAG is more comprehensive and explainable than vector RAG alone, and how you can use this approach using AWS services and Lettria.

Generative AI

Generative AI Artificial Inteligence AWS Knowledge Base

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

Developer tools The solution also uses the following developer tools: AWS Powertools for Lambda – This is a suite of utilities for Lambda functions that generates OpenAPI schemas from your Lambda function code. After deployment, the AWS CDK CLI will output the web application URL. Python 3.9 or later Node.js

Lambda

Lambda Enterprise Automotive Knowledge Base

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

It often requires managing multiple machine learning (ML) models, designing complex workflows, and integrating diverse data sources into production-ready formats. Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions.

Data

Data Generative AI Artificial Inteligence Compliance

Discover insights from Gmail using the Gmail connector for Amazon Q Business

AWS Machine Learning - AI

OCTOBER 31, 2024

The web application that the user uses to retrieve answers is connected to an identity provider (IdP) or AWS IAM Identity Center. The user’s credentials from the IdP or IAM Identity Center are referred to here as the federated user credentials. Refer to How Amazon Q Business connector crawls Gmail ACLs for more information.

AWS

AWS Generative AI Groups Applications

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

AWS Machine Learning - AI

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

Knowledge Base

Knowledge Base Lambda Enterprise AWS

MLOps: Methods and Tools of DevOps for Machine Learning

Altexsoft

JULY 23, 2020

When speaking of machine learning, we typically discuss data preparation or model building. The fusion of terms “machine learning” and “operations”, MLOps is a set of methods to automate the lifecycle of machine learning algorithms in production — from initial model training to deployment to retraining against new data.

Artificial Inteligence

Artificial Inteligence Machine Learning DevOps Tools

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Response latency refers to the time between the user finishing their speech and beginning to hear the AI assistants response. AWS Local Zones are a type of edge infrastructure deployment that places select AWS services close to large population and industry centers. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

These recipes include a training stack validated by Amazon Web Services (AWS) , which removes the tedious work of experimenting with different model configurations, minimizing the time it takes for iterative evaluation and testing. All of this runs under the SageMaker managed environment, providing optimal resource utilization and security.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

It is designed to handle the demanding computational and latency requirements of state-of-the-art transformer models, including Llama, Falcon, Mistral, Mixtral, and GPT variants for a full list of TGI supported models refer to supported models. For a complete list of runtime configurations, please refer to text-generation-launcher arguments.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Use the us-west-2 AWS Region to run this demo. Prerequisites This notebook is designed to run on AWS, using Amazon Bedrock for both Anthropics Claude 3 Sonnet and Stability AI model access. Make sure you have the following set up before moving forward: An AWS account. An Amazon SageMaker domain. Access to Stability AIs SD3.5

Generative AI

Generative AI Games Development AWS

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

This solution can serve as a valuable reference for other organizations looking to scale their cloud governance and enable their CCoE teams to drive greater impact. The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure.

Generative AI

Generative AI Government Technical Review Innovation

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Event-driven operations management Operational events refer to occurrences within your organization’s cloud environment that might impact the performance, resilience, security, or cost of your workloads.

Cloud

Cloud AWS Serverless Policies

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. You can monitor costs with AWS Cost Explorer.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock

AWS Machine Learning - AI

MARCH 13, 2025

The security measures are inherently integrated into the AWS services employed in this architecture. For detailed information, refer to the Security Best Practices section of this post. Refer to the following code: Request: POST /model-invocation-job HTTP/1.1 It shuts down the endpoint when processing is complete.

Artificial Inteligence

Artificial Inteligence Systems Review System Generative AI

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 14, 2024

Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. This tutorial assumes you have the necessary AWS Identity and Access Management (IAM) permissions. or later on your local machine. Install Python 3.7

Software Review

Software Review Technical Review AWS Artificial Inteligence

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Launching a machine learning (ML) training cluster with Amazon SageMaker training jobs is a seamless process that begins with a straightforward API call, AWS Command Line Interface (AWS CLI) command, or AWS SDK interaction. Surya Kari is a Senior Generative AI Data Scientist at AWS.

Training

Training Artificial Inteligence AWS Machine Learning

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Precision measures the proportion of generated tokens that match the reference tokens, and recall measures the proportion of reference tokens that are captured by the generated tokens. The precision would be 6/9 (6 matching tokens out of 9 generated tokens), and the recall would be 6/11 (6 matching tokens out of 11 reference tokens).

Artificial Inteligence

Artificial Inteligence Generative AI Training Metrics

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning - AI

NOVEMBER 20, 2024

In this post, we explore how you can use Amazon Q Business , the AWS generative AI-powered assistant, to build a centralized knowledge base for your organization, unifying structured and unstructured datasets from different sources to accelerate decision-making and drive productivity. In this post, we use IAM Identity Center as the SAML 2.0-aligned

Data

Data AWS Groups Knowledge Base

Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2

AWS Machine Learning - AI

DECEMBER 13, 2023

In this post, we showcase fine-tuning a Llama 2 model using a Parameter-Efficient Fine-Tuning (PEFT) method and deploy the fine-tuned model on AWS Inferentia2. We use the AWS Neuron software development kit (SDK) to access the AWS Inferentia2 device and benefit from its high performance.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Software Review

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. For human-in-the-loop evaluation, which can be done by either AWS managed or customer managed teams, you must bring your own dataset.

Applications

Applications Generative AI AWS Artificial Inteligence

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

By using AWS services, our architecture provides real-time visibility into LLM behavior and enables teams to quickly identify and address any issues or anomalies. In this post, we demonstrate a few metrics for online LLM monitoring and their respective architecture for scale using AWS services such as Amazon CloudWatch and AWS Lambda.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

AWS Machine Learning - AI

APRIL 11, 2024

AWS was delighted to present to and connect with over 18,000 in-person and 267,000 virtual attendees at NVIDIA GTC, a global artificial intelligence (AI) conference that took place March 2024 in San Jose, California, returning to a hybrid, in-person experience for the first time since 2019.

Generative AI

Generative AI AWS Artificial Inteligence Innovation

A secure approach to generative AI with AWS

AWS Machine Learning - AI

APRIL 16, 2024

At AWS, our top priority is safeguarding the security and confidentiality of our customers’ workloads. With the AWS Nitro System , we delivered a first-of-its-kind innovation on behalf of our customers. The Nitro System is an unparalleled computing backbone for AWS, with security and performance at its core.

Generative AI

Generative AI AWS Artificial Inteligence Infrastructure

Generate financial industry-specific insights using generative AI and in-context fine-tuning

AWS Machine Learning - AI

NOVEMBER 12, 2024

You may check out additional reference notebooks on aws-samples for how to use Meta’s Llama models hosted on Amazon Bedrock. You can implement these steps either from the AWS Management Console or using the latest version of the AWS Command Line Interface (AWS CLI). 0 means not expensive, 1 means expensive.

Generative AI

Generative AI Artificial Inteligence Industry Analysis

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

AWS Machine Learning - AI

AUGUST 21, 2024

Amazon DataZone is a data management service that makes it quick and convenient to catalog, discover, share, and govern data stored in AWS, on-premises, and third-party sources. An Amazon DataZone domain and an associated Amazon DataZone project configured in your AWS account.

Artificial Inteligence

Artificial Inteligence Machine Learning Government Software Review

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Our partnership with AWS and our commitment to be early adopters of innovative technologies like Amazon Bedrock underscore our dedication to making advanced HCM technology accessible for businesses of any size. We are thrilled to partner with AWS on this groundbreaking generative AI project. John Canada, VP of Engineering at Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Build a multi-tenant generative AI environment for your enterprise on AWS

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Webinars

Trending Sources

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Webinars

Accelerate AWS Well-Architected reviews with Generative AI

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

Integrate foundation models into your code with Amazon Bedrock

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Getting started with computer use in Amazon Bedrock Agents

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

How AWS sales uses Amazon Q Business for customer engagement

Empower your generative AI application with a comprehensive custom observability solution

Are you ready for MLOps? 🫵

Enable Amazon Bedrock cross-Region inference in multi-account environments

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Improving Retrieval Augmented Generation accuracy with GraphRAG

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Discover insights from Gmail using the Gmail connector for Amazon Q Business

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

MLOps: Methods and Tools of DevOps for Machine Learning

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Boost productivity by using AI in cloud operational health management

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock

Automate invoice processing with Streamlit and Amazon Bedrock

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Techniques and approaches for monitoring large language models on AWS

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

A secure approach to generative AI with AWS

Generate financial industry-specific insights using generative AI and in-context fine-tuning

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Stay Connected