Applications and AWS - CTO Universe

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. This strategy results in more robust, versatile, and efficient applications that better serve diverse user needs and business objectives. In this post, we provide an overview of common multi-LLM applications.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. Choose the us-east-1 AWS Region from the top right corner. Choose Manage model access.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Cross-Stack RDS User Provisioning and Schema Migrations with AWS Lambda

Xebia

MARCH 4, 2025

Use identity and access management (AWS IAM). You can compare these credentials with the root credentials of a Linux system or the root account for your AWS account. You could use AWS IAM, and this will give us the ability to be more least privileged. Afterward, your user is ready to use your application.

Lambda

Lambda AWS Authentication Linux

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Monitoring AWS Container Environments at Scale

Advertiser: Datadog

Containers power many of the applications we use every day. Particularly well-suited for microservice-oriented architectures and agile workflows, containers help organizations improve developer efficiency, feature velocity, and optimization of resources.

AWS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The following diagram illustrates the architecture of the application.

Generative AI

Generative AI AWS Lambda Authentication

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Recently, we’ve been witnessing the rapid development and evolution of generative AI applications, with observability and evaluation emerging as critical aspects for developers, data scientists, and stakeholders. In this post, we set up the custom solution for observability and evaluation of Amazon Bedrock applications.

Generative AI

Generative AI Applications AWS Knowledge Base

AWS Extends Generative AI Reach to Third-Party IT Platforms

DevOps.com

NOVEMBER 26, 2024

Amazon Web Services (AWS) has extended the reach of its generative artificial intelligence (AI) platform for application development to include a set of plug-in extensions, that make it possible to launch natural language queries against data residing in platforms from Datadog and Wiz.

AWS

AWS Generative AI Artificial Inteligence Artificial Intelligence

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

While organizations continue to discover the powerful applications of generative AI , adoption is often slowed down by team silos and bespoke workflows. It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

Building generative AI applications presents significant challenges for organizations: they require specialized ML expertise, complex infrastructure management, and careful orchestration of multiple services. You can obtain the SageMaker Unified Studio URL for your domains by accessing the AWS Management Console for Amazon DataZone.

Generative AI

Generative AI Applications Technical Review Software Review

Discover, Protect and Respond with AWS and Prisma Cloud

Prisma Clud

NOVEMBER 22, 2024

Unmanaged cloud resources, human error, misconfigurations and the increasing sophistication of cyber threats, including those from AI-powered applications, create vulnerabilities that can expose sensitive data and disrupt business operations. Enhance Security Posture – Proactively identify and mitigate threats to your AWS infrastructure.

AWS

AWS Cloud Network Compliance

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. These dimensions make up the foundation for developing and deploying AI applications in a responsible and safe manner.

Applications

Applications Generative AI AWS Artificial Inteligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. Adjust the following configuration to suit your needs, such as the Amazon EKS version, cluster name, and AWS Region.

AWS

AWS Load Balancer Software Review Artificial Inteligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

Cloud providers have recognized the need to offer model inference through an API call, significantly streamlining the implementation of AI within applications. AWS Step Functions is a fully managed service that makes it easier to coordinate the components of distributed applications and microservices using visual workflows.

Generative AI

Generative AI AWS Technical Review Backup

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Artificial Inteligence Generative AI Training

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Automate actions across enterprise applications using Amazon Q Business plugins

AWS Machine Learning - AI

DECEMBER 10, 2024

In this post, we explore how Amazon Q Business plugins enable seamless integration with enterprise applications through both built-in and custom plugins. This provides a more straightforward and quicker experience for users, who no longer need to use multiple applications to complete tasks. Choose Add plugin.

Applications

Applications Enterprise Authentication AWS

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. This highlights an interest in a more efficient approach to share and deploy applications across multiple App Studio instances.

AWS

AWS Software Review Technical Review Generative AI

From vision to value: A strategic approach to generative AI adoption

CIO

NOVEMBER 22, 2024

Generative AI has the potential to redefine productivity, create novel applications, and reinvent customer experience. However, both AWS and Caylent have helped dozens of organizations adopt generative AI, and Backeberg and Henderson understand that starting this journey can be daunting.

Generative AI

Generative AI AWS Energy Innovation

AWS launches new $30M accelerator program aimed at minority founders

TechCrunch

APRIL 20, 2022

Amazon Web Services (AWS) today launched a new program, AWS Impact Accelerator , that will give up to $30 million to early-stage startups led by Black, Latino, LGBTQIA+ and women founders. But critics contend that AWS Impact Accelerator doesn’t go far enough in supporting historically marginalized entrepreneurs.

AWS

AWS Programming Training Banking

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Prerequisites To implement the proposed solution, make sure that you have the following: An AWS account and a working knowledge of FMs, Amazon Bedrock , Amazon SageMaker , Amazon OpenSearch Service , Amazon S3 , and AWS Identity and Access Management (IAM). Amazon Titan Multimodal Embeddings model access in Amazon Bedrock.

AWS

AWS Engineering Serverless eCommerce

United Airlines sets its flight plan for gen AI success

CIO

DECEMBER 20, 2024

United claims to be among the earliest users of the Amazon SageMaker ML platform, and it has leveraged its own United Data Hub and AWS Bedrock-based Mars ML platform to create this first batch of production gen AI LLMs. These are prime applications for leveraging AI and many organizations are doing these things, Nag says.

Airlines

Airlines Generative AI Artificial Inteligence Weak Development Team

AWS Lambda Enhances Local IDE Experience With AI Support

Dzone - DevOps

JANUARY 27, 2025

AWS Lambda is enhancing the local IDE experience to make developing Lambda-based applications more efficient. These new features enable developers to author, build, debug, test, and deploy Lambda applications seamlessly within their local IDE using Visual Studio Code (VS Code).

Lambda

Lambda AWS Applications Testing

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Organizations building and deploying AI applications, particularly those using large language models (LLMs) with Retrieval Augmented Generation (RAG) systems, face a significant challenge: how to evaluate AI outputs effectively throughout the application lifecycle.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

What happens when you leak AWS credentials and how AWS minimizes the damage

Xebia

APRIL 5, 2023

I heard multiple times that AWS scans public GitHub repositories for AWS credentials and informs its users of the leaked credentials. So I am curious to see this for myself, so I decided to intentionally leak AWS credentials to a Public GitHub repository. Below you will find detailed information about every event.

AWS

AWS Policies Testing Linux

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

With demand for generative AI applications surging across projects and multiple lines of business, accurately allocating and tracking spend becomes more complex. This scalable, programmatic approach eliminates inefficient manual processes, reduces the risk of excess spending, and ensures that critical applications receive priority.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Add a generative AI experience to your website or web application with Amazon Q embedded

AWS Machine Learning - AI

DECEMBER 19, 2024

However, adding generative AI assistants to your website or web application requires significant domain knowledge and the technical expertise to build, deploy, and maintain the infrastructure and end-user experience. Prerequisites In this section, we walk through how to set up an Amazon Q Business application, permissions, and user access.

Generative AI

Generative AI Applications AWS Examples

With Neptune Analytics, AWS combines the power of vector search and graph data

TechCrunch

NOVEMBER 29, 2023

There’s been a debate of sorts in AI circles about which database is more important in finding truthful information in generative AI applications: graph or vector databases. AWS decided to leave the debate to others by combining the best of both capabilities in a new service announced today at AWS re:Invent called Neptune Analytics.

AWS

AWS Analytics Generative AI Data

Enter the next phase of Industry 4.0 with edge AI

CIO

DECEMBER 3, 2024

Two things play an essential role in a firm’s ability to adapt successfully: its data and its applications. Which is why modernising applications is so important, especially for traditional businesses – they need to keep pace with the challenges facing trade and commerce nowadays. That’s why the issue is so important today.

Industry

Industry AWS Banking Agile

Oracle inks deal with AWS to offer database services

CIO

SEPTEMBER 10, 2024

In continuation of its efforts to help enterprises migrate to the cloud, Oracle said it is partnering with Amazon Web Services (AWS) to offer database services on the latter’s infrastructure. Oracle Database@AWS is expected to be available in preview later in the year with broader availability expected in 2025.

AWS

AWS Azure Database Administration Google Cloud

What is a cloud architect? A vital role for success in the cloud

CIO

APRIL 30, 2025

Some cloud architect roles are tailored to AWS or Azure while others may be targeted at specific knowledge areas such as infrastructure or blockchain. This credential certifies your ability to manage AWS applications and infrastructure, and the associate level exam is for those with at least one year of hands-on experience with AWS.

Cloud

Cloud AWS Azure Disaster Recovery

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

This blog post discusses an end-to-end ML pipeline on AWS SageMaker that leverages serverless computing, event-trigger-based data processing, and external API integrations. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. Post-authentication, users access the UI Layer, a gateway to the Red Teaming Playground built on AWS Amplify and React.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

ComfyUI is an open source, node-based application that empowers users to generate images, videos, and audio using advanced AI models, offering a highly customizable workflow for creative projects. Start with 28 denoising steps to balance image quality and generation time. For the Guidance Scale (CFG), set it between 3.5–4.5

Engineering

Engineering AWS 3D Generative AI

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

Cloud computing Average salary: $124,796 Expertise premium: $15,051 (11%) Cloud computing has been a top priority for businesses in recent years, with organizations moving storage and other IT operations to cloud data storage platforms such as AWS. Its designed to achieve complex results, with a low learning curve for beginners and new users.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

Chinese firms bypass US export restrictions on AI chips using AWS cloud

CIO

AUGUST 23, 2024

Among these, four entities explicitly named Amazon Web Services (AWS) as their cloud service provider, accessing the services through Chinese intermediaries rather than directly from AWS. The report also shows how US companies are profiting from China’s increasing demand for computing resources.

AWS

AWS Technical Review Cloud Software Review

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

This capability enables Anthropics Claude models to identify whats on a screen, understand the context of UI elements, and recognize actions that should be performed such as clicking buttons, typing text, scrolling, and navigating between applications. Sonnet V2 and Anthropics Claude Sonnet 3.7 models on Amazon Bedrock.

AWS

AWS Generative AI Linux Groups

Amazon unveils Q, an AI-powered chatbot for businesses

TechCrunch

NOVEMBER 28, 2023

Amazon is launching an AI-powered chatbot for AWS customers called Q. Unveiled during a keynote at Amazon’s re:Invent conference in Las Vegas this morning, Q — starting at $20 per user per year — can answer questions like “how do I build a web application using AWS?”

AWS

AWS Conference Training Applications

Multi-LLM routing strategies for generative AI applications on AWS

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

Trending Sources

Cross-Stack RDS User Provisioning and Schema Migrations with AWS Lambda

Webinars

Building Resilient Public Networking on AWS: Part 4

Monitoring AWS Container Environments at Scale

Accelerate AWS Well-Architected reviews with Generative AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Empower your generative AI application with a comprehensive custom observability solution

AWS Extends Generative AI Reach to Third-Party IT Platforms

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Build a multi-tenant generative AI environment for your enterprise on AWS

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Discover, Protect and Respond with AWS and Prisma Cloud

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

How AWS sales uses Amazon Q Business for customer engagement

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Automate actions across enterprise applications using Amazon Q Business plugins

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

From vision to value: A strategic approach to generative AI adoption

AWS launches new $30M accelerator program aimed at minority founders

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

United Airlines sets its flight plan for gen AI success

AWS Lambda Enhances Local IDE Experience With AI Support

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

What happens when you leak AWS credentials and how AWS minimizes the damage

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Add a generative AI experience to your website or web application with Amazon Q embedded

With Neptune Analytics, AWS combines the power of vector search and graph data

Enter the next phase of Industry 4.0 with edge AI

Oracle inks deal with AWS to offer database services

What is a cloud architect? A vital role for success in the cloud

Building a Scalable ML Pipeline and API in AWS

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

9 IT skills where expertise pays the most

Chinese firms bypass US export restrictions on AI chips using AWS cloud

Getting started with computer use in Amazon Bedrock Agents

Amazon unveils Q, an AI-powered chatbot for businesses

Stay Connected