Applications, AWS and Load Balancer

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. Choose the us-east-1 AWS Region from the top right corner. Choose Manage model access.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. Adjust the following configuration to suit your needs, such as the Amazon EKS version, cluster name, and AWS Region.

AWS

AWS Load Balancer Software Review Artificial Inteligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

While organizations continue to discover the powerful applications of generative AI , adoption is often slowed down by team silos and bespoke workflows. It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. API Gateway also provides a WebSocket API.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Automate Application Load Balancers With AWS Load Balancer Controller and Ingress

Dzone - DevOps

JANUARY 17, 2024

Automating AWS Load Balancers is essential for managing cloud infrastructure efficiently. This article delves into the importance of automation using the AWS Load Balancer controller and Ingress template. A high-level illustration of AWS Application Load Balancer with Kubernetes cluster

Load Balancer

Load Balancer AWS Applications Infrastructure

VM-Series Virtual Firewalls Integrate With AWS Gateway Load Balancer

Palo Alto Networks

NOVEMBER 10, 2020

The just-announced general availability of the integration between VM-Series virtual firewalls and the new AWS Gateway Load Balancer (GWLB) introduces customers to massive security scaling and performance acceleration – while bypassing the awkward complexities traditionally associated with inserting virtual appliances in public cloud environments.

Load Balancer

Load Balancer Firewall AWS Virtualization

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. Post-authentication, users access the UI Layer, a gateway to the Red Teaming Playground built on AWS Amplify and React.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

Microservices on AWS [Video]

Dzone - DevOps

JUNE 2, 2021

In this tutorial, I will explain different CI/CD concepts and tools provided by AWS for continuous integration and continuous delivery. I will be creating a Spring Boot microservice and deploy it to AWS EC2 instances running behind an application load balancer in an automated way using the AWS Code Pipeline.

Microservices

Microservices AWS Video Load Balancer

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. Its serverless architecture allowed the team to rapidly prototype and refine their application without the burden of managing complex hardware infrastructure.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Building Resilient Public Networking on AWS: Part 2

Xebia

JANUARY 18, 2024

Deploy Secure Public Web Endpoints Welcome to Building Resilient Public Networking on AWS—our comprehensive blog series on advanced networking strategies tailored for regional evacuation, failover, and robust disaster recovery. We laid the groundwork for understanding the essentials that underpin the forthcoming discussions.

AWS

AWS Network Load Balancer Software Review

Cloud Load Balancing- Facilitating Performance & Efficiency of Cloud Resources

RapidValue

APRIL 20, 2020

Cloud load balancing is the process of distributing workloads and computing resources within a cloud environment. Cloud load balancing also involves hosting the distribution of workload traffic within the internet. Cloud load balancing also involves hosting the distribution of workload traffic within the internet.

Load Balancer

Load Balancer Resources Cloud Performance

Deploy a Clojure web application to AWS using Terraform

CircleCI

JUNE 27, 2019

This is the third blog post in a three-part series about building, testing, and deploying a Clojure web application. If you don’t want to go through the laborious task of creating the web application described in the first two posts from scratch, you can get the source by forking this repository and checking out the part-2 branch.

AWS

AWS Film Load Balancer Applications

Securing S3 Downloads with ALB and Cognito Authentication

Xebia

FEBRUARY 25, 2025

AWS has a service called Cognito that allows you to manage a pool of users. I am using an Application Load Balancer to invoke a Lambda function. In this case, we can use the native Cognito integration of the application load balancer. First, we need to make sure that we know who the user is.

Authentication

Authentication Load Balancer Lambda AWS

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Kaseya

MAY 13, 2021

As the name suggests, a cloud service provider is essentially a third-party company that offers a cloud-based platform for application, infrastructure or storage services. In this blog, we’ll compare the three leading public cloud providers, namely Amazon Web Services (AWS), Microsoft Azure and Google Cloud. Scalability and Elasticity.

Google Cloud

Google Cloud Azure AWS Cloud

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. However, the complexity of vLLM currently limits ease of implementing custom integrations for applications. vLLM also has limited quantization support.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

How to Deploy Tomcat App using AWS ECS Fargate with Load Balancer

Perficient

FEBRUARY 7, 2023

Amazon Elastic Container Service (ECS): It is a highly scalable, high-performance container management service that supports Docker containers and allows to run applications easily on a managed cluster of Amazon EC2 instances. Before that let’s create a load balancer by performing the following steps.

Load Balancer

Load Balancer AWS Serverless How To

Can VPC Lattice replace AWS Transit Gateway?

Xebia

AUGUST 29, 2023

VPC Lattice offers a new mechanism to connect microservices across AWS accounts and across VPCs in a developer-friendly way. Or if you have an existing landing zone with AWS Transit Gateway, do you already plan to replace it with VPC Lattice? You can also use AWS PrivateLink to inter-connect your VPCs across accounts.

AWS

AWS Load Balancer Microservices Lambda

AWS Disaster Recovery Strategies – PoC with Terraform

Xebia

DECEMBER 21, 2022

A regional failure is an uncommon event in AWS (and other Public Cloud providers), where all Availability Zones (AZs) within a region are affected by any condition that impedes the correct functioning of the provisioned Cloud infrastructure. For demonstration purposes, we are using HTTP instead of HTTPS. Pilot Light strategy diagram.

Disaster Recovery

Disaster Recovery AWS Strategy Backup

Mastering AWS Infrastructure as Code with Pulumi and Python

Perficient

MARCH 27, 2025

Unlike Terraform, which uses HCL, Pulumi enables you to define infrastructure using Python, making it easier for developers to integrate infrastructure with application code. Multi-Cloud and Multi-Language Support Deploy across AWS, Azure, and Google Cloud with Python, TypeScript, Go, or.NET.

AWS

AWS Infrastructure Lambda Load Balancer

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

AWS Machine Learning - AI

JANUARY 24, 2024

Therefore, it’s important to understand and control the flow of your data through the generative AI application: Where is the model located? This post discusses how enterprises can build accurate, transparent, and secure generative AI applications while keeping full control over proprietary data. Where is the data processed?

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

Additionally, SageMaker endpoints support automatic load balancing and autoscaling, enabling your LLM deployment to scale dynamically based on incoming requests. During non-peak hours, the endpoint can scale down to zero , optimizing resource usage and cost efficiency. Model Base Model Download DeepSeek-R1-Distill-Qwen-1.5B

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Build a custom UI for Amazon Q Business

AWS Machine Learning - AI

JUNE 12, 2024

The workflow includes the following steps: The user accesses the chatbot application, which is hosted behind an Application Load Balancer. For more information about trusted token issuers and how token exchanges are performed, see Using applications with a trusted token issuer.

Load Balancer

Load Balancer AWS Authentication Applications

AWS Open Source Observability: Visualization and Security Auditing with CloudMapper (Part 1)

Xebia

FEBRUARY 23, 2023

For this reason, it is common for users to integrate third-party applications to fulfill their requirements. Visualization and AWS There are many paid options to dynamically visualize your AWS environment as a complete diagram. After setting up CloudMapper, make sure you have configured your AWS CLI dependencies.

Open Source

Open Source AWS Load Balancer Disaster Recovery

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

AWS Machine Learning - AI

SEPTEMBER 17, 2024

Generative artificial intelligence (AI) applications are commonly built using a technique called Retrieval Augmented Generation (RAG) that provides foundation models (FMs) access to additional data they didn’t have during training. The post is co-written with Michael Shaul and Sasha Korman from NetApp.

Generative AI

Generative AI AWS Applications Serverless

Build, test, and deploy a Go application to AWS ECS

CircleCI

SEPTEMBER 11, 2019

In this tutorial example, we will deploy a simple Go application to Amazon EC2 Container Service (ECS). Create and configure an Amazon Elastic Load Balancer (ELB) and target group that will associate with our cluster’s ECS service. Use the DNS name on our ELB to access the application (to test that it works). main.go ???

AWS

AWS Load Balancer Applications Testing

VPC Service Controls – A step by step guide

Xebia

JANUARY 23, 2025

For ingress access to your application, services like Cloud Load Balancer should be preferred and for egress to the public internet a service like Cloud NAT. This can cause different problems for applications that in some ways depend on having internet access or even accessing Google services operations.

Policies

Policies Storage Google Cloud Cloud

How to Use Application Load Balancer and Amazon Cognito to Authenticate Users for Your Kubernetes Web Apps

DevOps.com

JULY 14, 2023

This post describes how to use Amazon Cognito to authenticate users for web apps running in an Amazon Elastic Kubernetes Services (Amazon EKS) cluster.

Load Balancer

Load Balancer Authentication How To Applications

Deploy Django apps to AWS Elastic Beanstalk

CircleCI

NOVEMBER 15, 2022

This tutorial covers: Setting up a Django application on AWS. Just as dev teams can now build APIs with JavaScript, they can also build web applications powered by Python. And more tooling providers are adding support for Python-based applications in their service offering. AWS account. Prerequisites.

AWS

AWS Software Review Authentication Load Balancer

9 Best Free Node.js Hosting 2023

The Crazy Programmer

SEPTEMBER 25, 2023

Constant deployment that will keep applications updated. Try Render Vercel Earlier known as Zeit, the Vercel app acts as the top layer of AWS Lambda which will make running your applications easy. Even though Vercel mainly focuses on front-end applications, it has built-in support that will host serverless Node.js

Serverless

Serverless AWS Google Cloud Azure

A Guide to Automating AWS Infrastructure Deployment

Dzone - DevOps

FEBRUARY 4, 2025

When it comes to managing infrastructure in the cloud, AWS provides several powerful tools that help automate the creation and management of resources. One of the most effective ways to handle deployments is through AWS CloudFormation.

AWS

AWS Infrastructure Load Balancer Serverless

Why you must extend Zero Trust to public cloud workloads

CIO

NOVEMBER 8, 2023

Zscaler’s zero trust-based architecture to secure workload in the public cloud With Zscaler Workload Communication, you can: Eliminate Lateral Movement Zscaler zero trust architecture ensures least-privileged access for cloud workloads and applications.

Cloud

Cloud Spyware AWS Malware

Advanced RAG patterns on Amazon SageMaker

AWS Machine Learning - AI

MARCH 28, 2024

These generative AI applications are not only used to automate existing business processes, but also have the ability to transform the experience for customers using these applications. Mixtral-8x7B uses an MoE architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Load Balancer

Getting started with cross-region inference in Amazon Bedrock

AWS Machine Learning - AI

AUGUST 27, 2024

Amazon Bedrock has emerged as the preferred choice for numerous customers seeking to innovate and launch generative AI applications, leading to an exponential surge in demand for model inference capabilities. As a result, customers can enhance their applications’ reliability, performance, and efficiency.

AWS

AWS Generative AI Load Balancer Applications

Netflix Information Security: Preventing Credential Compromise in AWS

Netflix Tech

NOVEMBER 28, 2018

by Will Bengtson Previously we wrote about a method for detecting credential compromise in your AWS environment. If an attacker has remote code execution (RCE) or local presence on the AWS server, these methods discussed will not prevent compromise. The originating IP address will be one from AWS and not reflect what is in your policy.

AWS

AWS Policies Software Review Network

Why AWS PrivateLink Is Not Recommended With Kafka or Cassandra

Instaclustr

JULY 28, 2021

AWS PrivateLink (also known as a VPC endpoint) is a technology that allows the user to securely access services using a private IP address. It is not recommended to configure an AWS PrivateLink connection with Apache Kafka or Apache Cassandra mainly due to a single entry point problem. Kafka Connection Without AWS PrivateLink.

AWS

AWS Load Balancer Technical Review Policies

Top 10 Most Popular Hands-On Labs

Linux Academy

NOVEMBER 19, 2019

Creating and configuring Secure AWS RDS Instances with a Reader and Backup Solution. In this live AWS environment, you will learn how to create an RDS database, then successfully implement a read replica and backups for that database. Elastic Compute Cloud (EC2) is AWS’s Infrastructure as a Service product.

Load Balancer

Load Balancer AWS Backup Linux

AWS Elastic Beanstalk: Simplifying Web Application Deployment

Perficient

JULY 31, 2024

In today’s fast-paced digital world, deploying and managing web applications efficiently is crucial for developers and businesses alike. AWS Elastic Beanstalk offers a powerful and user-friendly platform to streamline this process, allowing you to focus on writing code rather than managing infrastructure.

AWS

AWS Applications Load Balancer Software Review

AWS – The Silver Lining of the Cloud

RapidValue

MAY 15, 2019

What does AWS say to the other competing cloud computing services out there? AWS has 5 times more deployed cloud structure as their next 14 competitors have in aggregate. So how does AWS do it? However, that has not been the only advantage that AWS has had over the others. In the words of Arya Stark, “Not Today!”.

AWS

AWS Cloud Load Balancer Infrastructure

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

From deriving insights to powering generative artificial intelligence (AI) -driven applications, the ability to efficiently process and analyze large datasets is a vital capability. That’s where the new Amazon EMR Serverless application integration in Amazon SageMaker Studio can help.

Serverless

Serverless AWS Artificial Inteligence Big Data

Troubleshooting HTTP 502 Bad Gateway in AWS EBS

Dzone - DevOps

JUNE 20, 2022

The application that we are going to discuss in this post was running on Elastic Beanstalk (EBS) service in Amazon Web Services (AWS). Intermittently this application was throwing an HTTP 502 Bad Gateway error. AWS Elastic Beanstalk Architecture. AWS Elastic Beanstalk Architecture.

AWS

AWS Load Balancer Linux Architecture

Build ultra-low latency multimodal generative AI applications using sticky session routing in Amazon

AWS Machine Learning - AI

SEPTEMBER 13, 2024

We are announcing the availability of sticky session routing on Amazon SageMaker Inference which helps customers improve the performance and user experience of their generative AI applications by leveraging their previously processed information. This feature is available in all AWS Regions where SageMaker is available.

Generative AI

Generative AI Applications Artificial Inteligence AWS

Enhance conversational AI with advanced routing techniques with Amazon Bedrock

AWS Machine Learning - AI

APRIL 24, 2024

With AWS generative AI services like Amazon Bedrock , developers can create systems that expertly manage and respond to user requests. Agents for Amazon Bedrock approach Agents for Amazon Bedrock allows you to build generative AI applications that can run multi-step tasks across a company’s systems and data sources.

Artificial Inteligence

Artificial Inteligence Lambda Knowledge Base IoT

Build and deploy a UI for your generative AI applications with AWS and Python

Building Resilient Public Networking on AWS: Part 4

Webinars

Trending Sources

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Webinars

Build a multi-tenant generative AI environment for your enterprise on AWS

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Automate Application Load Balancers With AWS Load Balancer Controller and Ingress

VM-Series Virtual Firewalls Integrate With AWS Gateway Load Balancer

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Microservices on AWS [Video]

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Building Resilient Public Networking on AWS: Part 2

Cloud Load Balancing- Facilitating Performance & Efficiency of Cloud Resources

Deploy a Clojure web application to AWS using Terraform

Securing S3 Downloads with ALB and Cognito Authentication

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Host concurrent LLMs with LoRAX

How to Deploy Tomcat App using AWS ECS Fargate with Load Balancer

Can VPC Lattice replace AWS Transit Gateway?

AWS Disaster Recovery Strategies – PoC with Terraform

Mastering AWS Infrastructure as Code with Pulumi and Python

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Build a custom UI for Amazon Q Business

AWS Open Source Observability: Visualization and Security Auditing with CloudMapper (Part 1)

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

Build, test, and deploy a Go application to AWS ECS

VPC Service Controls – A step by step guide

How to Use Application Load Balancer and Amazon Cognito to Authenticate Users for Your Kubernetes Web Apps

Deploy Django apps to AWS Elastic Beanstalk

9 Best Free Node.js Hosting 2023

A Guide to Automating AWS Infrastructure Deployment

Why you must extend Zero Trust to public cloud workloads

Advanced RAG patterns on Amazon SageMaker

Getting started with cross-region inference in Amazon Bedrock

Netflix Information Security: Preventing Credential Compromise in AWS

Why AWS PrivateLink Is Not Recommended With Kafka or Cassandra

Top 10 Most Popular Hands-On Labs

AWS Elastic Beanstalk: Simplifying Web Application Deployment

AWS – The Silver Lining of the Cloud

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Troubleshooting HTTP 502 Bad Gateway in AWS EBS

Build ultra-low latency multimodal generative AI applications using sticky session routing in Amazon

Enhance conversational AI with advanced routing techniques with Amazon Bedrock

Stay Connected