Applications, AWS and Infrastructure

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. This strategy results in more robust, versatile, and efficient applications that better serve diverse user needs and business objectives. In this post, we provide an overview of common multi-LLM applications.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. Choose the us-east-1 AWS Region from the top right corner. Choose Manage model access.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Cross-Stack RDS User Provisioning and Schema Migrations with AWS Lambda

Xebia

MARCH 4, 2025

Use identity and access management (AWS IAM). You can compare these credentials with the root credentials of a Linux system or the root account for your AWS account. You could use AWS IAM, and this will give us the ability to be more least privileged. Afterward, your user is ready to use your application.

Lambda

Lambda AWS Authentication Linux

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

Building cloud infrastructure based on proven best practices promotes security, reliability and cost efficiency. To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Recently, we’ve been witnessing the rapid development and evolution of generative AI applications, with observability and evaluation emerging as critical aspects for developers, data scientists, and stakeholders. In this post, we set up the custom solution for observability and evaluation of Amazon Bedrock applications.

Generative AI

Generative AI Applications AWS Knowledge Base

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Discover, Protect and Respond with AWS and Prisma Cloud

Prisma Clud

NOVEMBER 22, 2024

Unmanaged cloud resources, human error, misconfigurations and the increasing sophistication of cyber threats, including those from AI-powered applications, create vulnerabilities that can expose sensitive data and disrupt business operations. Enhance Security Posture – Proactively identify and mitigate threats to your AWS infrastructure.

AWS

AWS Cloud Network Compliance

What is a cloud architect? A vital role for success in the cloud

CIO

APRIL 30, 2025

As organizations continue to implement cloud-based AI services, cloud architects will be tasked with ensuring the proper infrastructure is in place to accommodate growth. Organizations have accelerated cloud adoption now that AI tools are readily available, which has driven a demand for cloud architects to help manage cloud infrastructure.

Cloud

Cloud AWS Azure Disaster Recovery

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

Building generative AI applications presents significant challenges for organizations: they require specialized ML expertise, complex infrastructure management, and careful orchestration of multiple services. You can obtain the SageMaker Unified Studio URL for your domains by accessing the AWS Management Console for Amazon DataZone.

Generative AI

Generative AI Applications Technical Review Software Review

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

Cloud providers have recognized the need to offer model inference through an API call, significantly streamlining the implementation of AI within applications. AWS Step Functions is a fully managed service that makes it easier to coordinate the components of distributed applications and microservices using visual workflows.

Generative AI

Generative AI AWS Technical Review Backup

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod.

AWS

AWS Artificial Inteligence Generative AI Training

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

While organizations continue to discover the powerful applications of generative AI , adoption is often slowed down by team silos and bespoke workflows. It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. These dimensions make up the foundation for developing and deploying AI applications in a responsible and safe manner.

Applications

Applications Generative AI AWS Artificial Inteligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. Adjust the following configuration to suit your needs, such as the Amazon EKS version, cluster name, and AWS Region.

AWS

AWS Load Balancer Software Review Artificial Inteligence

Oracle inks deal with AWS to offer database services

CIO

SEPTEMBER 10, 2024

In continuation of its efforts to help enterprises migrate to the cloud, Oracle said it is partnering with Amazon Web Services (AWS) to offer database services on the latter’s infrastructure. This is Oracle’s third partnership with a hyperscaler to offer its database services on the hyperscaler’s infrastructure.

AWS

AWS Azure Database Administration Google Cloud

Choosing a cloud infrastructure provider: A beginner’s guide

TechCrunch

FEBRUARY 6, 2023

Developers at startups thought they could maintain multiple application code bases that work independently with each cloud provider. Deploying cloud infrastructure also involves analyzing tools and software solutions, like application monitoring and activity logging, leading many developers to suffer from analysis paralysis.

Infrastructure

Infrastructure Cloud Minimum Viable Product Weak Development Team

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

Cloud computing Average salary: $124,796 Expertise premium: $15,051 (11%) Cloud computing has been a top priority for businesses in recent years, with organizations moving storage and other IT operations to cloud data storage platforms such as AWS. Its designed to achieve complex results, with a low learning curve for beginners and new users.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

Alchemy raises $80M at a $505M valuation to be the ‘AWS for blockchain’

TechCrunch

APRIL 28, 2021

Alchemy’s goal is to be the starting place for developers considering to build a product on top of a blockchain or mainstream blockchain applications. Its developer platform aims to remove the complexity and costs of building infrastructure while improving applications through “necessary” developer tools. While in Web 2.0,

Blockchain

Blockchain AWS Comparison Industry

Add a generative AI experience to your website or web application with Amazon Q embedded

AWS Machine Learning - AI

DECEMBER 19, 2024

However, adding generative AI assistants to your website or web application requires significant domain knowledge and the technical expertise to build, deploy, and maintain the infrastructure and end-user experience. Amazon Q Business application The Amazon Q embedded feature requires an Amazon Q Business application.

Generative AI

Generative AI Applications AWS Examples

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

AWS Machine Learning - AI

JANUARY 13, 2025

Amazon Q Business as a web experience makes AWS best practices readily accessible, providing cloud-centered recommendations quickly and making it straightforward to access AWS service functions, limits, and implementations. This post covers how to integrate Amazon Q Business into your enterprise setup.

Generative AI

Generative AI AWS Innovation Knowledge Base

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

Although the principles discussed are applicable across various industries, we use an automotive parts retailer as our primary example throughout this post. A web application serves as the frontend interface where users can initiate parts lookup requests. A user interacts with the Car Parts Agent through a web application interface.

Lambda

Lambda Enterprise Automotive Knowledge Base

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

This capability enables Anthropics Claude models to identify whats on a screen, understand the context of UI elements, and recognize actions that should be performed such as clicking buttons, typing text, scrolling, and navigating between applications. Sonnet V2 and Anthropics Claude Sonnet 3.7 models on Amazon Bedrock.

AWS

AWS Generative AI Linux Groups

Marsh McLennan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Elevate customer experience by using the Amazon Q Business custom plugin for New Relic AI

AWS Machine Learning - AI

DECEMBER 3, 2024

Application failures, slow load times, and service unavailability can lead to user frustration, decreased engagement, and revenue loss. 45% of support engineers, application engineers, and SREs use five different monitoring tools on average. It also offers direct links to detailed New Relic interfaces.

Technical Review

Technical Review AWS eCommerce Systems Review

Mastering AWS Infrastructure as Code with Pulumi and Python

Perficient

MARCH 27, 2025

Pulumi is a modern Infrastructure as Code (IaC) tool that allows you to define, deploy, and manage cloud infrastructure using general-purpose programming languages. Multi-Cloud and Multi-Language Support Deploy across AWS, Azure, and Google Cloud with Python, TypeScript, Go, or.NET. A history of deployments and updates.

AWS

AWS Infrastructure Lambda Load Balancer

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

In todays fast-paced digital landscape, the cloud has emerged as a cornerstone of modern business infrastructure, offering unparalleled scalability, agility, and cost-efficiency. Technology modernization strategy : Evaluate the overall IT landscape through the lens of enterprise architecture and assess IT applications through a 7R framework.

Cloud

Cloud Strategy Architecture Policies

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Open foundation models (FMs) have become a cornerstone of generative AI innovation, enabling organizations to build and customize AI applications while maintaining control over their costs and deployment strategies. You can access your imported custom models on-demand and without the need to manage underlying infrastructure.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Lumigo raises $29M for its cloud-native application monitoring platform

TechCrunch

NOVEMBER 2, 2021

Lumigo , a cloud-native application monitoring and debugging platform, today announced that it has raised a $29 million Series A funding round led by Redline Capital. The company started with a focus on distributed tracing for serverless platforms like AWS’ API Gateway, DynamoDB, S3 and Lambda. Image Credits: Lumigo.

Applications

Applications Cloud Lambda Serverless

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

AWS Machine Learning - AI

NOVEMBER 22, 2024

Seamless integration of latest foundation models (FMs), Prompts, Agents, Knowledge Bases, Guardrails, and other AWS services. Reduced time and effort in testing and deploying AI workflows with SDK APIs and serverless infrastructure. They spend a lot of time and effort in troubleshooting issues in their application.

Generative AI

Generative AI Artificial Inteligence Knowledge Base AWS

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

With serverless components, there is no need to manage infrastructure, and the inbuilt tracing, logging, monitoring and debugging make it easy to run these workloads in production and maintain service levels. Legacy infrastructure. Our cloud strategy was to use a single cloud provider for our enterprise cloud platform AWS.

Serverless

Serverless Architecture Microservices Scalability

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure. The CCoE implemented AWS Organizations across a substantial number of business units.

Generative AI

Generative AI Government Technical Review Innovation

Delivering better business outcomes for CIOs

CIO

NOVEMBER 4, 2024

Facing increasing demand and complexity CIOs manage a complex portfolio spanning data centers, enterprise applications, edge computing, and mobile solutions, resulting in a surge of apps generating data that requires analysis. Enterprise IT struggles to keep up with siloed technologies while ensuring security, compliance, and cost management.

Data Center

Data Center Recruiting Cloud Government

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. Through Bedrock Marketplace, organizations can use Nemotron’s advanced capabilities while benefiting from the scalable infrastructure of AWS and NVIDIA’s robust technologies. You can find him on LinkedIn.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

AWS Machine Learning - AI

APRIL 21, 2025

A prompt that works well in one scenario may underperform in another, necessitating extensive customization and fine-tuning for different applications. Therefore, developing a universally applicable prompt optimization method that generalizes well across diverse tasks remains a significant challenge.

Artificial Inteligence

Artificial Inteligence Groups Applications Innovation

Oracle launches AI Agent Studio for Fusion Cloud to retain customers

CIO

MARCH 20, 2025

Oracle has added a new AI Agent Studio to its Fusion Cloud business applications, at no additional cost, in an effort to retain its enterprise customers as rival software vendors ramp up their agent-based offerings with the aim of garnering more market share. billion in 2024, is expected to grow at a CAGR of 45.8%

Cloud

Cloud Artificial Inteligence Enterprise AWS

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Dzone - DevOps

APRIL 9, 2025

Amazon Web Services (AWS) provides an expansive suite of tools to help developers build and manage serverless applications with ease. By abstracting the complexities of infrastructure, AWS enables teams to focus on innovation. Why Combine AI, ML, and Serverless Computing?

Serverless

Serverless Artificial Inteligence Scalability AWS

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

These latency-sensitive applications enable real-time text and voice interactions, responding naturally to human conversations. Their applications span a variety of sectors, including customer service, healthcare, education, personal and business productivity, and many others. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

‘AWS for blockchain’ Alchemy boosts valuation to $3.5B with $250M raise

TechCrunch

OCTOBER 28, 2021

Put simply, Alchemy wants to do for blockchain and Web3 what AWS (Amazon Web Services) did for the internet. ?? The startup’s goal is to be the starting place for developers considering building a product on top of a blockchain or mainstream blockchain applications. ” Google chairman and Stanford University President John L.

Blockchain

Blockchain AWS Internet Comparison

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

Marsh McLellan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Multi-LLM routing strategies for generative AI applications on AWS

Build and deploy a UI for your generative AI applications with AWS and Python

Webinars

Trending Sources

Building Resilient Public Networking on AWS: Part 4

Webinars

Cross-Stack RDS User Provisioning and Schema Migrations with AWS Lambda

Accelerate AWS Well-Architected reviews with Generative AI

Empower your generative AI application with a comprehensive custom observability solution

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Discover, Protect and Respond with AWS and Prisma Cloud

What is a cloud architect? A vital role for success in the cloud

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

How AWS sales uses Amazon Q Business for customer engagement

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Build a multi-tenant generative AI environment for your enterprise on AWS

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Oracle inks deal with AWS to offer database services

Choosing a cloud infrastructure provider: A beginner’s guide

9 IT skills where expertise pays the most

Alchemy raises $80M at a $505M valuation to be the ‘AWS for blockchain’

Add a generative AI experience to your website or web application with Amazon Q embedded

Boosting team innovation, productivity, and knowledge sharing with Amazon Q Business – Web experience

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Getting started with computer use in Amazon Bedrock Agents

Marsh McLennan IT reorg lays foundation for gen AI

Elevate customer experience by using the Amazon Q Business custom plugin for New Relic AI

Mastering AWS Infrastructure as Code with Pulumi and Python

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Lumigo raises $29M for its cloud-native application monitoring platform

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Can serverless fix fintech’s scaling problem?

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Delivering better business outcomes for CIOs

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

Oracle launches AI Agent Studio for Fusion Cloud to retain customers

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Reduce conversational AI response time through inference at the edge with AWS Local Zones

‘AWS for blockchain’ Alchemy boosts valuation to $3.5B with $250M raise

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Marsh McLellan IT reorg lays foundation for gen AI

Stay Connected