AWS and Reference - CTO Universe

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers. The user prompt is then routed to the LLM associated with the task category of the reference prompt that has the closest match.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The user signs in by entering a user name and a password.

Generative AI

Generative AI AWS Lambda Authentication

Webinars

Going Beyond Chatbots: Connecting AI to Your Tools, Systems, & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

MORE WEBINARS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. Shared components refer to the functionality and features shared by all tenants. You can use AWS services such as Application Load Balancer to implement this approach. API Gateway also provides a WebSocket API.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation. AWS HealthScribe will then output two files which are also stored on Amazon S3.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions. We're more than happy to provide further references upon request.

Generative AI

Generative AI AWS Technical Review Backup

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. For more information on how to view and increase your quotas, refer to Amazon EC2 service quotas.

AWS

AWS Load Balancer Software Review Artificial Inteligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

Large Medium – This refers to the material or technique used in creating the artwork. This might involve incorporating additional data such as reference images or rough sketches as conditioning inputs alongside your text prompts. You can provide extensive details, such as the gender of a character, their clothing, and the setting.

Engineering

Engineering AWS 3D Generative AI

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Access to your selected models hosted on Amazon Bedrock.

Scalability

Scalability Lambda Generative AI AWS

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Model customization refers to adapting a pre-trained language model to better fit specific tasks, domains, or datasets. Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions.

Case Study

Case Study Artificial Inteligence Study Generative AI

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Response latency refers to the time between the user finishing their speech and beginning to hear the AI assistants response. AWS Local Zones are a type of edge infrastructure deployment that places select AWS services close to large population and industry centers. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

Amazon Q Business as a web experience makes AWS best practices readily accessible, providing cloud-centered recommendations quickly and making it straightforward to access AWS service functions, limits, and implementations. For more on MuleSofts journey to cloud computing, refer to Why a Cloud Operating Model?

Generative AI

Generative AI AWS Innovation Knowledge Base

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

Developer tools The solution also uses the following developer tools: AWS Powertools for Lambda – This is a suite of utilities for Lambda functions that generates OpenAPI schemas from your Lambda function code. After deployment, the AWS CDK CLI will output the web application URL. Python 3.9 or later Node.js

Lambda

Lambda Enterprise Automotive Knowledge Base

‘AWS for blockchain’ Alchemy boosts valuation to $3.5B with $250M raise

TechCrunch

OCTOBER 28, 2021

For the unacquainted, Web3 refers to a set of protocols led by blockchain, that intends to reinvent how the Internet is wired in the backend). Put simply, Alchemy wants to do for blockchain and Web3 what AWS (Amazon Web Services) did for the internet. ?? Alchemy raises $80M at a $505M valuation to be the ‘AWS for blockchain’.

Blockchain

Blockchain AWS Internet Comparison

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

This solution can serve as a valuable reference for other organizations looking to scale their cloud governance and enable their CCoE teams to drive greater impact. The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure.

Generative AI

Generative AI Government Technical Review Innovation

Boost productivity by using AI in cloud operational health management

AWS Machine Learning - AI

OCTOBER 11, 2024

It uses Amazon Bedrock , AWS Health , AWS Step Functions , and other AWS services. Event-driven operations management Operational events refer to occurrences within your organization’s cloud environment that might impact the performance, resilience, security, or cost of your workloads.

Cloud

Cloud AWS Serverless Lambda

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Security – The solution uses AWS services and adheres to AWS Cloud Security best practices so your data remains within your AWS account.

Generative AI

Generative AI Applications AWS Knowledge Base

Discover insights from Gmail using the Gmail connector for Amazon Q Business

AWS Machine Learning - AI

OCTOBER 31, 2024

The web application that the user uses to retrieve answers is connected to an identity provider (IdP) or AWS IAM Identity Center. The user’s credentials from the IdP or IAM Identity Center are referred to here as the federated user credentials. Refer to How Amazon Q Business connector crawls Gmail ACLs for more information.

AWS

AWS Generative AI Groups Applications

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning - AI

APRIL 10, 2025

With this launch, you can now access Mistrals frontier-class multimodal model to build, experiment, and responsibly scale your generative AI ideas on AWS. AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model. Additionally, Pixtral Large supports the Converse API and tool usage.

Generative AI

Generative AI AWS Technical Review Artificial Inteligence

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

Prerequisites Before you dive into the integration process, make sure you have the following prerequisites in place: AWS account – You’ll need an AWS account to access and use Amazon Bedrock. You can interact with Amazon Bedrock using AWS SDKs available in Python, Java, Node.js, and more.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Windows Password Recovery with AWS SSM

Perficient

FEBRUARY 25, 2025

The Systems Manager (SSM) streamlines managing Windows instances in AWS. Instead of taking a backup, creating a new instance, and reconfiguring the environmentwhich is time-consuming and impacts business operationswe leverage AWS Systems Manager (SSM) to efficiently recover access without disruption.

Windows

Windows AWS Systems Review Policies

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

The computer use agent demo powered by Amazon Bedrock Agents provides the following benefits: Secure execution environment Execution of computer use tools in a sandbox environment with limited access to the AWS ecosystem and the web. Prerequisites AWS Command Line Interface (CLI), follow instructions here. Require Python 3.11

AWS

AWS Generative AI Linux Groups

Generate financial industry-specific insights using generative AI and in-context fine-tuning

AWS Machine Learning - AI

NOVEMBER 12, 2024

You may check out additional reference notebooks on aws-samples for how to use Meta’s Llama models hosted on Amazon Bedrock. You can implement these steps either from the AWS Management Console or using the latest version of the AWS Command Line Interface (AWS CLI). 0 means not expensive, 1 means expensive.

Generative AI

Generative AI Artificial Inteligence Industry Analysis

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

To evaluate the metadata quality, the team used reference-free LLM metrics, inspired by LangSmith. DPG Media chose Amazon Transcribe for its ease of transcription and low maintenance, with the added benefit of incremental improvements by AWS over the years.

Media

Media Video Artificial Inteligence Generative AI

Best Practices for IaC using AWS CloudFormation

Perficient

MARCH 11, 2025

AWS CloudFormation, a key service in the AWS ecosystem, simplifies IaC by allowing users to easily model and set up AWS resources. This blog explores the best practices for utilizing AWS CloudFormation to achieve reliable, secure, and efficient infrastructure management. Why Use AWS CloudFormation? Example: 3.

AWS

AWS Software Review Systems Review Policies

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning - AI

NOVEMBER 20, 2024

In this post, we explore how you can use Amazon Q Business , the AWS generative AI-powered assistant, to build a centralized knowledge base for your organization, unifying structured and unstructured datasets from different sources to accelerate decision-making and drive productivity. In this post, we use IAM Identity Center as the SAML 2.0-aligned

Data

Data AWS Groups Knowledge Base

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Use the us-west-2 AWS Region to run this demo. Prerequisites This notebook is designed to run on AWS, using Amazon Bedrock for both Anthropics Claude 3 Sonnet and Stability AI model access. Make sure you have the following set up before moving forward: An AWS account. An Amazon SageMaker domain. Access to Stability AIs SD3.5

Generative AI

Generative AI Games Development AWS

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 14, 2024

Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. This tutorial assumes you have the necessary AWS Identity and Access Management (IAM) permissions. If you’re new to Amazon EC2, refer to the Amazon EC2 User Guide.

Software Review

Software Review Technical Review AWS Artificial Inteligence

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. Prerequisites For this guide, you need access to the following prerequisites: An AWS account Proper permissions to deploy EC2 G6 instances.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

IBM and AWS Create a Path to Modernization Via Industry-Specific Solutions

CIO

OCTOBER 13, 2022

As part of their partnership, IBM and Amazon Web Services (AWS) are pursuing a variety of industry-specific blueprints and solutions designed to help customers modernize apps for a hybrid IT environment, which includes AWS Cloud. AWS/IBM’s Industry Edge.

AWS

AWS Industry Technical Review Disaster Recovery

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

For a comprehensive overview of metadata filtering and its benefits, refer to Amazon Bedrock Knowledge Bases now supports metadata filtering to improve retrieval accuracy. Prerequisites Before proceeding with this tutorial, make sure you have the following in place: AWS account – You should have an AWS account with access to Amazon Bedrock.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

High-performance computing on AWS

Xebia

AUGUST 29, 2023

How does High-Performance Computing on AWS differ from regular computing? HPC services on AWS Compute Technically you could design and build your own HPC cluster on AWS, it will work but you will spend time on plumbing and undifferentiated heavy lifting. AWS has two services to support your HPC workload.

AWS

AWS Performance Storage Linux

Add a generative AI experience to your website or web application with Amazon Q embedded

AWS Machine Learning - AI

DECEMBER 19, 2024

If you dont have an existing application, you can create an application integrated with AWS IAM Identity Center or AWS Identity and Access Management (IAM) identity federation. You can find your web experience ID with the list-web-experiences AWS CLI command. Amazon Q Business hosts the web experience on an AWS domain.

Generative AI

Generative AI Applications AWS Examples

AWS Security Essentials: From Prevention to Detection

Xebia

SEPTEMBER 24, 2024

AWS offers a range of security services like AWS Security Hub, AWS GuardDuty, Amazon Inspector, Amazon Macie etc. This post will dive into how we can monitor these AWS Security services and build a layered security approach, emphasizing the importance of both prevention and detection.

AWS

AWS Lambda Firewall Policies

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

By modern, I refer to an engineering-driven methodology that fully capitalizes on automation and software engineering best practices. Organizations must decide on their hosting provider, whether it be an on-prem setup, cloud solutions like AWS, GCP, Azure or specialized data platform providers such as Snowflake and Databricks.

Data

Data Technical Review Software Review Weak Development Team

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. For human-in-the-loop evaluation, which can be done by either AWS managed or customer managed teams, you must bring your own dataset.

Applications

Applications Generative AI AWS Artificial Inteligence

Introducing multi-turn conversation with an agent node for Amazon Bedrock Flows (preview)

AWS Machine Learning - AI

JANUARY 22, 2025

Amazon Bedrock Flows offers an intuitive visual builder and a set of APIs to seamlessly link foundation models (FMs), Amazon Bedrock features, and AWS services to build and automate user-defined generative AI workflows at scale. Amazon Bedrock Agents offers a fully managed solution for creating, deploying, and scaling AI agents on AWS.

Travel

Travel Hotels AWS Generative AI

Mastering AWS IaC with Pulumi and Python – Part 2

Perficient

APRIL 4, 2025

In Part 1 of this series, we learned about the importance of AWS and Pulumi. Now, lets explore the demo part in this practical session, which will create a service on AWS VPC by using Pulumi. AdministratorAccess or a custom policy). us-east-1) Output format (e.g., us-east-1) Output format (e.g., py to create a VPC: Step 3.

AWS

AWS Google Cloud Azure Policies

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. You can monitor costs with AWS Cost Explorer.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Building Resilient Public Networking on AWS: Part 4

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Webinars

Accelerate AWS Well-Architected reviews with Generative AI

Build a multi-tenant generative AI environment for your enterprise on AWS

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

How AWS sales uses Amazon Q Business for customer engagement

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Enable Amazon Bedrock cross-Region inference in multi-account environments

Model customization, RAG, or both: A case study with Amazon Nova

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

‘AWS for blockchain’ Alchemy boosts valuation to $3.5B with $250M raise

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Boost productivity by using AI in cloud operational health management

Empower your generative AI application with a comprehensive custom observability solution

Discover insights from Gmail using the Gmail connector for Amazon Q Business

Pixtral Large is now available in Amazon Bedrock

Integrate foundation models into your code with Amazon Bedrock

Windows Password Recovery with AWS SSM

Getting started with computer use in Amazon Bedrock Agents

Generate financial industry-specific insights using generative AI and in-context fine-tuning

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Best Practices for IaC using AWS CloudFormation

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Automate invoice processing with Streamlit and Amazon Bedrock

Host concurrent LLMs with LoRAX

IBM and AWS Create a Path to Modernization Via Industry-Specific Solutions

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

High-performance computing on AWS

Add a generative AI experience to your website or web application with Amazon Q embedded

AWS Security Essentials: From Prevention to Detection

The future of data: A 5-pillar approach to modern data management

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Introducing multi-turn conversation with an agent node for Amazon Bedrock Flows (preview)

Mastering AWS IaC with Pulumi and Python – Part 2

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Stay Connected