AWS, Load Balancer and Reference

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. For more information on how to view and increase your quotas, refer to Amazon EC2 service quotas.

AWS

AWS Load Balancer Software Review Artificial Inteligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. Shared components refer to the functionality and features shared by all tenants. Load balancer – Another option is to use a load balancer that exposes an HTTPS endpoint and routes the request to the orchestrator.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. Prerequisites For this guide, you need access to the following prerequisites: An AWS account Proper permissions to deploy EC2 G6 instances.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

It is designed to handle the demanding computational and latency requirements of state-of-the-art transformer models, including Llama, Falcon, Mistral, Mixtral, and GPT variants for a full list of TGI supported models refer to supported models. For a complete list of runtime configurations, please refer to text-generation-launcher arguments.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Building Resilient Public Networking on AWS: Part 2

Xebia

JANUARY 18, 2024

Deploy Secure Public Web Endpoints Welcome to Building Resilient Public Networking on AWS—our comprehensive blog series on advanced networking strategies tailored for regional evacuation, failover, and robust disaster recovery. We laid the groundwork for understanding the essentials that underpin the forthcoming discussions.

AWS

AWS Network Load Balancer Software Review

Can VPC Lattice replace AWS Transit Gateway?

Xebia

AUGUST 29, 2023

VPC Lattice offers a new mechanism to connect microservices across AWS accounts and across VPCs in a developer-friendly way. Or if you have an existing landing zone with AWS Transit Gateway, do you already plan to replace it with VPC Lattice? You can also use AWS PrivateLink to inter-connect your VPCs across accounts.

AWS

AWS Load Balancer Microservices Lambda

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Kaseya

MAY 13, 2021

In addition, you can also take advantage of the reliability of multiple cloud data centers as well as responsive and customizable load balancing that evolves with your changing demands. In this blog, we’ll compare the three leading public cloud providers, namely Amazon Web Services (AWS), Microsoft Azure and Google Cloud.

Google Cloud

Google Cloud Azure AWS Cloud

Build a custom UI for Amazon Q Business

AWS Machine Learning - AI

JUNE 12, 2024

The workflow includes the following steps: The user accesses the chatbot application, which is hosted behind an Application Load Balancer. For instructions, refer to How do I integrate IAM Identity Center with an Amazon Cognito user pool and the associated demo video. For more details, refer to Importing a certificate.

Load Balancer

Load Balancer AWS Authentication Applications

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

AWS Machine Learning - AI

JANUARY 24, 2024

We demonstrate how to build an end-to-end RAG application using Cohere’s language models through Amazon Bedrock and a Weaviate vector database on AWS Marketplace. Additionally, you can securely integrate and easily deploy your generative AI applications using the AWS tools you are already familiar with.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Managing multiple environments in the AWS CDK using YAML configuration files

Xebia

MAY 31, 2023

The DTAP street refers to the progression of software through different stages, starting from development and testing to final deployment in the production environment. 16 Let’s see how we can parse the previous information per environment by diving into the details of how to implement this in the AWS CDK using Python and Go.

AWS

AWS Construction Infrastructure Load Balancer

Deploy a Clojure web application to AWS using Terraform

CircleCI

JUNE 27, 2019

Leiningen - Leiningen, usually referred to as lein (pronounced ‘line’) is the most commonly used Clojure build tool. AWS account - Amazon Web Services provides on-demand computing platforms. Note: The infrastructure we are going to build will involve a small cost in standing up the AWS services we require.

AWS

AWS Film Load Balancer Applications

Advanced RAG patterns on Amazon SageMaker

AWS Machine Learning - AI

MARCH 28, 2024

For more information on Mixtral-8x7B Instruct on AWS, refer to Mixtral-8x7B is now available in Amazon SageMaker JumpStart. For more detailed and step-by-step instructions, refer to the Advanced RAG Patterns with Mixtral on SageMaker Jumpstart GitHub repo. Before you get started with the solution, create an AWS account.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Load Balancer

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning - AI

SEPTEMBER 3, 2024

Reduced operational overhead – The EMR Serverless integration with AWS streamlines big data processing by managing the underlying infrastructure, freeing up your team’s time and resources. Runtime roles are AWS Identity and Access Management (IAM) roles that you can specify when submitting a job or query to an EMR Serverless application.

Serverless

Serverless AWS Artificial Inteligence Big Data

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

AWS Machine Learning - AI

SEPTEMBER 17, 2024

In this post, we demonstrate a solution using Amazon FSx for NetApp ONTAP with Amazon Bedrock to provide a RAG experience for your generative AI applications on AWS by bringing company-specific, unstructured user file data to Amazon Bedrock in a straightforward, fast, and secure way. Install the AWS Command Line Interface (AWS CLI).

Generative AI

Generative AI AWS Applications Serverless

Build, test, and deploy a Go application to AWS ECS

CircleCI

SEPTEMBER 11, 2019

Create and configure an Amazon Elastic Load Balancer (ELB) and target group that will associate with our cluster’s ECS service. Configure CircleCI using the circleci/aws-ecr@6.2.0 Configure CircleCI using the circleci/aws-ecs@0.0.11 A service configuration references a task definition. A simple Go application.

AWS

AWS Load Balancer Applications Testing

Getting started with cross-region inference in Amazon Bedrock

AWS Machine Learning - AI

AUGUST 27, 2024

Currently, users might have to engineer their applications to handle scenarios involving traffic spikes that can use service quotas from multiple regions by implementing complex techniques such as client-side load balancing between AWS regions, where Amazon Bedrock service is supported.

AWS

AWS Generative AI Load Balancer Applications

Build generative AI chatbots using prompt engineering with Amazon Redshift and Amazon Bedrock

AWS Machine Learning - AI

FEBRUARY 14, 2024

In this solution, we demonstrate how we can generate a custom, personalized travel itinerary that users can reference, which will be generated based on their hobbies, interests, favorite foods, and more. Prerequisites Before you deploy this solution, make sure you have the following prerequisites set up: A valid AWS account.

Generative AI

Generative AI Engineering Artificial Inteligence Travel

5 Best Practices for Optimizing PeopleSoft Performance on AWS

Datavail

JANUARY 18, 2024

Optimizing the performance of PeopleSoft enterprise applications is crucial for empowering businesses to unlock the various benefits of Amazon Web Services (AWS) infrastructure effectively. Research indicates that AWS has approximately five times more deployed cloud infrastructure than their next 14 competitors.

AWS

AWS Performance Load Balancer Scalability

Unlock the power of structured data for enterprises using natural language with Amazon Q Business

AWS Machine Learning - AI

AUGUST 20, 2024

Steps 3 and 4 augment the AWS IAM Identity Center integration with Amazon Q Business for an authorization flow. The workflow includes the following steps: The user initiates the interaction with the Streamlit application, which is accessible through an Application Load Balancer, acting as the entry point.

Enterprise

Enterprise Data Artificial Inteligence Technical Review

Moving to the Cloud: Exploring the API Gateway to Success

Daniel Bryant

SEPTEMBER 16, 2022

When we talk about both technologies, we refer to the end user’s experience in achieving a successful API call within an environment. In Kubernetes, there are various choices for load balancing external traffic to pods, each with different tradeoffs. That is, “should I start with an API gateway or use a Service Mesh ?”

Load Balancer

Load Balancer Cloud Continuous Delivery Microservices

Announcing Complete Azure Observability for Kentik Cloud

Kentik

JUNE 27, 2023

Live traffic flow arrows demonstrate how Azure Express Routes, Firewalls, Load Balancers, Application Gateways, and VWANs connect in the Kentik Map, which updates dynamically as topology changes for effortless architecture reference. Why do you need complete network telemetry?

Azure

Azure Cloud Load Balancer Firewall

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

AWS Machine Learning - AI

AUGUST 8, 2024

Webex works with the world’s leading business and productivity apps—including AWS. The following diagram illustrates the WxAI architecture on AWS. The model details are as follows: Call driver extraction – This generative model summarizes the primary reason or intent (referred to as the call driver ) behind a customer’s call.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Announcing Honeycomb support for event ingestion with OTLP

Honeycomb

DECEMBER 15, 2020

Today, AWS announced enhancements for AWS Distro for OpenTelemetry. We’re working with AWS to build in additional support from partners. Using Honeycomb’s OTLP event ingestion with AWS. You can refer to the AWS Distro OpenTelemetry docs for more information. It’s that simple!

AWS

AWS Load Balancer Software Review Infrastructure

AWS vs Google Cloud Pricing – A Comprehensive Look

ParkMyCloud

SEPTEMBER 30, 2019

Since ParkMyCloud provides cost control for Amazon Web Services (AWS) along with Google Cloud Platform (GCP) resources, we thought it might be useful to compare AWS vs Google Cloud pricing. There are other “services” involved, such as networking, storage and load balancing, when looking at your overall bill.

Google Cloud

Google Cloud AWS Cloud Sustainability

Vitech uses Amazon Bedrock to revolutionize information access with AI-powered chatbot

AWS Machine Learning - AI

MAY 30, 2024

With Bedrock’s serverless experience, one can get started quickly, privately customize FMs with their own data, and easily integrate and deploy them into applications using the AWS tools without having to manage any infrastructure. Vitech thereby selected Amazon Bedrock to host LLMs and integrate seamlessly with their existing infrastructure.

Artificial Inteligence

Artificial Inteligence Technical Review Development Team Review Software Review

SaaS Platfrom Development – How to Start

Existek

MARCH 24, 2025

These objectives can refer to increased market share, expansion to new segments, or higher user retention. Creating a product roadmap The roadmap balances your short-term needs and long-term goals with SaaS platform development. It must be tested under different conditions so it is prepared to perform well even in peak loads.

Development

Development How To Technical Review Quality Assurance

Apps in the Cloud: Deploying Your App Using Docker Containers and AWS

Gorilla Logic

SEPTEMBER 23, 2019

At the end of this post , you will have utilized Docker containers and AWS to create a good starting point and a tangible cloud foundation that will be agnostic but, at the same time, the canvas on which your application will draw its next iteration in the cloud deployment process. All AWS resources used here are free.

AWS

AWS Load Balancer Cloud Virtualization

AWS vs Azure vs Google Cloud: What’s the best cloud platform?

Openxcell

MAY 12, 2023

Through AWS, Azure, and GCP’s respective cloud platforms, customers have access to a variety of storage, computation, and networking options.Some of the features shared by all three systems include fast provisioning, self-service, autoscaling, identity management, security, and compliance. What is AWS Cloud Platform?:

Google Cloud

Google Cloud Azure AWS Cloud

Build ultra-low latency multimodal generative AI applications using sticky session routing in Amazon

AWS Machine Learning - AI

SEPTEMBER 13, 2024

This feature is available in all AWS Regions where SageMaker is available. For more about this feature, refer to Stateful sessions with Amazon SageMaker models. SageMaker has implemented a robust solution that combines two key strategies: sticky session routing in SageMaker with load balancing, and stateful sessions in TorchServe.

Generative AI

Generative AI Applications Artificial Inteligence AWS

PeopleSoft on AWS: Understanding Design Methods and Scaling Functionality

Datavail

JANUARY 23, 2024

With the rapidly increasing adoption of cloud computing solutions, deploying PeopleSoft applications on Amazon Web Services (AWS) has become extremely popular for modern businesses trying to improve the flexibility and scalability of their business processes. Studies have shown that AWS currently has more than 1 million users.

AWS

AWS Disaster Recovery Scalability Resources

Quickly Turn ALB/ELB Status Codes into an Issue-Seeking Heatmap

Honeycomb

MAY 18, 2022

So you start digging through AWS logs to see what you can find, but it’s hard to reproduce. The example below uses an AWS account, ALB/ELB, S3, and a Lambda to send log data to Honeycomb. To get data into Honeycomb, begin by reviewing the following step-by-step AWS ALB documentation. What’s wrong? S3 Bucket Name.

Software Review

Software Review Lambda AWS Load Balancer

Implementing a Cost-aware Cloud Networking Infrastructure

Kentik

FEBRUARY 20, 2023

For example, a particular microservice might be hosted on AWS for better serverless performance but sends sampled data to a larger Azure data lake. Hybrid cloud networking Hybrid cloud networking refers specifically to the connectivity between two different types of cloud environments.

Network

Network Infrastructure Cloud Artificial Inteligence

What’s Free at Linux Academy — March 2019

Linux Academy

FEBRUARY 26, 2019

By the end of the course, you will have experienced configuring NGINX as a web server, reverse proxy, cache, and load balancer, while also having learned how to compile additional modules, tune for performance, and integrate with third-party tools like Let’s Encrypt. AWS Concepts — This course is for the absolute beginner.

Linux

Linux AWS Big Data Course

The History of Pets vs Cattle and How to Use the Analogy Properly

CloudScaling

SEPTEMBER 29, 2016

So this post aims to set the record straight and assure a canonical history that everyone can reference and use. Some time in 2011 or 2012 I was struggling with explaining to customers how AWS, cloud native apps, and cloud more generally was fundamentally different from what had gone before[1]. The History.

How To

How To Load Balancer Cloud Storage

Infinite Retention with OpenTelemetry and Honeycomb

Honeycomb

AUGUST 17, 2023

Screenshots How to achieve these amazing results for yourself The summary of steps: Create two AWS IAM roles, one for S3 writing, and one for Glue and Athena Create two AWS S3 Buckets, one for trace data, and one for Athena results Configure the awss3 exporter Create a Glue crawler Create an Athena view Search and enjoy!

AWS

AWS Software Review Compliance Data

Why enterprise CIOs need to plan for Microsoft gen AI

CIO

AUGUST 14, 2024

of the market according to IDC , Microsoft 2023 revenue from its AI platform services was more than double Google (5.3%) and AWS (5.1%) combined. Walker refers to “guided play sessions” and users were encouraged to share what worked with their peers. Although competitors have similar model gardens, at 13.8%

Enterprise

Enterprise Azure ChatGPT Open Source

How to Protect Your Azure App with a Web Application Firewall

Modus Create

JANUARY 20, 2021

Firewalls operate at the network layer 4 (transport layer – Reference: OSI Model ) and make processing decisions based on network addresses, ports, or protocols, which protect data transfer and network traffic, but not the application. The Difference Between a Firewall and a Web Application Firewall.

Firewall

Firewall Azure Applications Load Balancer

Tips for Better Cloud Expense Management

ParkMyCloud

JANUARY 21, 2020

Remember there are literally hundreds of IaaS and PaaS services offered in the public cloud — as of this blog writing AWS alone has 190+ cloud services. Infrastructure-as-a-service (IaaS) is a category that offers traditional IT services like compute, database, storage, network, load balancers, firewalls, etc.

Cloud

Cloud Disaster Recovery Load Balancer AWS

16 Tips to Manage Cloud Costs

ParkMyCloud

FEBRUARY 25, 2020

For example, on the issue of resource on/off scheduling, AWS, Azure, and Google Cloud each offer a tool. Another example is the AWS Compute Optimizer – a big name in promise, and certainly worth reviewing for AWS users. Here’s a guide to get a consolidated billing view in AWS. Use AWS’s New Savings Plans.

Cloud

Cloud Lambda AWS Azure

The Good and the Bad of Kubernetes Container Orchestration

Altexsoft

FEBRUARY 24, 2023

Kubernetes load balancer to optimize performance and improve app stability The goal of load balancing is to evenly distribute incoming traffic across machines, enabling an app to remain stable and easily handle a large number of client requests. But there are other pros worth mentioning.

Weak Development Team

Weak Development Team Load Balancer Technical Review Microservices

Are Cloud Serverless Functions Exposing Your Data?

Prisma Clud

JUNE 6, 2024

In this blog post, we'll examine the question of public access, focusing on the main offerings of the three leading cloud providers — AWS Lambda, Azure Functions and GCP Cloud Functions. Just need a quick reference? AWS Cheat Sheet: Is my Lambda exposed? Network in AWS The first aspect of public access is the network.

Serverless

Serverless Cloud Data Azure

Building Resilient Public Networking on AWS: Part 4

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Webinars

Trending Sources

Build a multi-tenant generative AI environment for your enterprise on AWS

Webinars

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Host concurrent LLMs with LoRAX

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Building Resilient Public Networking on AWS: Part 2

Can VPC Lattice replace AWS Transit Gateway?

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Build a custom UI for Amazon Q Business

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

Managing multiple environments in the AWS CDK using YAML configuration files

Deploy a Clojure web application to AWS using Terraform

Advanced RAG patterns on Amazon SageMaker

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

Build, test, and deploy a Go application to AWS ECS

Getting started with cross-region inference in Amazon Bedrock

Build generative AI chatbots using prompt engineering with Amazon Redshift and Amazon Bedrock

5 Best Practices for Optimizing PeopleSoft Performance on AWS

Unlock the power of structured data for enterprises using natural language with Amazon Q Business

Moving to the Cloud: Exploring the API Gateway to Success

Announcing Complete Azure Observability for Kentik Cloud

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

Announcing Honeycomb support for event ingestion with OTLP

AWS vs Google Cloud Pricing – A Comprehensive Look

Vitech uses Amazon Bedrock to revolutionize information access with AI-powered chatbot

SaaS Platfrom Development – How to Start

Apps in the Cloud: Deploying Your App Using Docker Containers and AWS

AWS vs Azure vs Google Cloud: What’s the best cloud platform?

Build ultra-low latency multimodal generative AI applications using sticky session routing in Amazon

PeopleSoft on AWS: Understanding Design Methods and Scaling Functionality

Quickly Turn ALB/ELB Status Codes into an Issue-Seeking Heatmap

Implementing a Cost-aware Cloud Networking Infrastructure

What’s Free at Linux Academy — March 2019

The History of Pets vs Cattle and How to Use the Analogy Properly

Infinite Retention with OpenTelemetry and Honeycomb

Why enterprise CIOs need to plan for Microsoft gen AI

How to Protect Your Azure App with a Web Application Firewall

Tips for Better Cloud Expense Management

16 Tips to Manage Cloud Costs

The Good and the Bad of Kubernetes Container Orchestration

Are Cloud Serverless Functions Exposing Your Data?

Stay Connected