AWS and Performance - CTO Universe

Boost Your Productivity with awscurl: Simplifying IAM-Secured API Testing in AWS

Xebia

DECEMBER 21, 2024

When you use AWS, you can interact with it through the console, sdk, or cli. They all use the same set of APIs to perform the actions requested by the user. In the past, I used a simple Python script to perform these API calls, but that always took some time and energy to build. How are these security controls blocking me?

AWS

AWS Testing Authentication Comparison

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Cross-Stack RDS User Provisioning and Schema Migrations with AWS Lambda

Xebia

MARCH 4, 2025

Use identity and access management (AWS IAM). You can compare these credentials with the root credentials of a Linux system or the root account for your AWS account. You could use AWS IAM, and this will give us the ability to be more least privileged. Use the credentials that you created at deployment time.

Lambda

Lambda AWS Authentication Linux

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

AWS Performance Tuning: Why EC2 Autoscaling Isn’t a Silver Bullet

Dzone - DevOps

DECEMBER 25, 2024

AWS EC2 Autoscaling is frequently regarded as the ideal solution for managing fluctuating workloads. Nevertheless, depending exclusively on EC2 Autoscaling can result in inefficiencies, overspending, and performance issues. Although Autoscaling is an effective tool, it does not serve as a one-size-fits-all remedy.

AWS

AWS Performance Resources Engineering

Cloud-Scale Monitoring With AWS and Datadog

Advertiser: Datadog

In this eBook, find out about the benefits and complexities of migrating workloads to AWS, and dive into services that AWS offers for containers and serverless computing. Find out the key performance metrics for each service to track in order to ensure workloads are operating efficiently.

AWS

Implementing a Version Control System for AWS QuickSight

Xebia

OCTOBER 24, 2024

Among the myriads of BI tools available, AWS QuickSight stands out as a scalable and cost-effective solution that allows users to create visualizations, perform ad-hoc analysis, and generate business insights from their data. AWS does not provide a comprehensive list of supported dataset types.

AWS

AWS Systems Review System Azure

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements. In contrast, more complex questions might require the application to summarize a lengthy dissertation by performing deeper analysis, comparison, and evaluation of the research results.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Zeroing in on Cloud Technical Risk for Targeted Customer Impact

Yet keeping all the moving parts of cloud running right – especially in a fast-moving, competitive market – can cause conflict between technical and business objectives.

Cloud

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). Authentication is performed against the Amazon Cognito user pool.

Generative AI

Generative AI AWS Lambda Authentication

AWS Extends Generative AI Reach to Third-Party IT Platforms

DevOps.com

NOVEMBER 26, 2024

Amazon Web Services (AWS) has extended the reach of its generative artificial intelligence (AI) platform for application development to include a set of plug-in extensions, that make it possible to launch natural language queries against data residing in platforms from Datadog and Wiz.

AWS

AWS Generative AI Artificial Intelligence Artificial Inteligence

From vision to value: A strategic approach to generative AI adoption

CIO

NOVEMBER 22, 2024

To that end, Kristen Backeberg, Director of Global ISV Partner Marketing at AWS, and Val Henderson, President and CRO at Caylent, recently sat down to discuss maybe the most important consideration around adoption: How to tailor your generative AI strategy around clear goals that can drive your organization forward. Why did we do this?

Generative AI

Generative AI AWS Energy Innovation

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

The New Tech Experience: Innovation, Optimization, and Collaboration

Speaker: Paul Weald, Contact Center Innovator

Join us for this exclusive webinar with expert innovator Paul Weald to learn more about: How artificial intelligence technology can complement employee performance and optimize business performance with intelligent insights and analytics. Embrace automation, collaborate with new technology, and watch how you thrive!

Artificial Inteligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

However, companies are discovering that performing full fine tuning for these models with their data isnt cost effective. In addition to cost, performing fine tuning for LLMs at scale presents significant technical challenges. To learn more about Trainium chips and the Neuron SDK, see Welcome to AWS Neuron.

AWS

AWS Artificial Inteligence Generative AI Training

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. Adjust the following configuration to suit your needs, such as the Amazon EKS version, cluster name, and AWS Region.

AWS

AWS Load Balancer Software Review Artificial Inteligence

What happens when you leak AWS credentials and how AWS minimizes the damage

Xebia

APRIL 5, 2023

I heard multiple times that AWS scans public GitHub repositories for AWS credentials and informs its users of the leaked credentials. So I am curious to see this for myself, so I decided to intentionally leak AWS credentials to a Public GitHub repository. Below you will find detailed information about every event.

AWS

AWS Policies Testing Linux

AWS launches S3 Express One Zone, promises 10x write speed improvement

TechCrunch

NOVEMBER 28, 2023

At its annual re:Invent conference in Las Vegas, Amazon AWS cloud arm today announced a major update to its S3 object storage service: AWS S3 Express One Zone, a new high-performance and low latency tier for S3. The company promises that Express One Zone offers a 10x performance improvement over the standard S3 service.

AWS

AWS Storage Conference Performance

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

Amazon Titan FMs provide customers with a breadth of high-performing image, multimodal, and text model choices, through a fully managed API. Run similarity search : Perform a similarity search on the vector database to find product images that closely match the search query embedding. Replace with the name of your S3 bucket.

AWS

AWS Engineering Serverless eCommerce

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation. Speaker role identification (clinician or patient).

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

Cloudera

DECEMBER 2, 2024

To that end, we’re collaborating with Amazon Web Services (AWS) to deliver a high-performance, energy-efficient, and cost-effective solution by supporting many data services on AWS Graviton. And AWS is a crucial ally for Cloudera in enabling companies to scale AI operations responsibly.

Sustainability

Sustainability AWS Analytics Infrastructure

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions. Haiku model to receive answers to an array of questions because it’s a performant, fast, and cost-effective option.

Generative AI

Generative AI AWS Technical Review Backup

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

CIO

APRIL 15, 2025

Cost-performance optimizations via new chip One of the major updates announced last week was Googles seventh generation Tensor Processing Unit (TPU) chip Ironwood targeted at accelerating AI workloads, especially inferencing.

Cloud

Cloud Innovation Artificial Inteligence Google Cloud

New AWS Control Policy on the Block

Tenable

NOVEMBER 18, 2024

AWS has released an important new feature that allows you to apply permission boundaries around resources at scale called Resource Control Policies (RCPs). AWS just launched Resource Control Policies (RCPs), a new feature in AWS Organizations that lets you restrict the permissions granted to resources. What are RCPs?

Policies

Policies AWS Resources Software Review

Chinese firms bypass US export restrictions on AI chips using AWS cloud

CIO

AUGUST 23, 2024

Over the past two years, the US government has tightened regulations that prevent top US AI chip designers, such as Nvidia and AMD, from selling their high-performance AI chips to China, aiming to curb their military’s technological advancements. A query seeking comments from the US Department of Commerce remains unanswered.

AWS

AWS Technical Review Cloud Software Review

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

In this post, we explore advanced prompt engineering techniques that can enhance the performance of these models and facilitate the creation of compelling imagery through text-to-image transformations. This post provided practical tips and techniques to optimize performance and elevate the creative possibilities within Stable Diffusion 3.5

Engineering

Engineering AWS 3D Generative AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

SAP AI pact with AWS offers customers more gen AI options

CIO

MAY 29, 2024

SAP is expanding its AI ecosystem with a partnership with AWS. The cloud hyperscalers AWS, Google and Microsoft are also important platform partners to operate SAP’s cloud applications. The cloud hyperscalers AWS, Google and Microsoft are also important platform partners to operate SAP’s cloud applications.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications. By assigning AWS cost allocation tags, the organization can effectively monitor and track their Bedrock spend patterns.

Generative AI

Generative AI AWS Artificial Inteligence Budget

AWS launches no-code service AppFabric with generative AI assistance

CIO

JUNE 28, 2023

Amazon Web Services (AWS) on Tuesday unveiled a new no-code offering, dubbed AppFabric, designed to simplify SaaS integration for enterprises by increasing application observability and reducing operational costs associated with building point-to-point solutions. AppFabric, which is available across AWS’ US East (N.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone. Amazon Linux 2).

AWS

AWS Artificial Inteligence Technical Review Systems Review

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

Best practices for migrating between public clouds

CIO

APRIL 17, 2025

Historically, cloud migration usually meant moving on-premises workloads to a public cloud, like Amazon Web Services (AWS) or Microsoft Azure. And few guides to cloud migration offer best practices on how to perform a cloud-to-cloud migration. These are both managed NoSQL databases on Azure and AWS, respectively.

Cloud

Cloud AWS Data Center Azure

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

The agents also automatically call APIs to perform actions and access knowledge bases to provide additional information. Developer tools The solution also uses the following developer tools: AWS Powertools for Lambda – This is a suite of utilities for Lambda functions that generates OpenAPI schemas from your Lambda function code.

Lambda

Lambda Enterprise Automotive Knowledge Base

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline. Fine-tuning is one such technique, which helps in injecting task-specific or domain-specific knowledge for improving model performance.

Case Study

Case Study Artificial Inteligence Study Generative AI

Setting up my own landing zone on AWS

Xebia

DECEMBER 25, 2023

But once you have set it up you will be capable of performing experiments very easily. My landing zone For my landing zone I used the Customizations for AWS Control Tower (CfCt) project. Service Catalog – Used to host all my AWS Service Catalog products used within my landing zone.

AWS

AWS Policies Testing Resources

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies such as AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon through a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.

Generative AI

Generative AI Applications AWS Knowledge Base

Boost Your Productivity with awscurl: Simplifying IAM-Secured API Testing in AWS

Building Resilient Public Networking on AWS: Part 4

Webinars

Trending Sources

Cross-Stack RDS User Provisioning and Schema Migrations with AWS Lambda

Webinars

AWS Performance Tuning: Why EC2 Autoscaling Isn’t a Silver Bullet

Cloud-Scale Monitoring With AWS and Datadog

Implementing a Version Control System for AWS QuickSight

Build and deploy a UI for your generative AI applications with AWS and Python

Introducing AWS MCP Servers for code assistants (Part 1)

Multi-LLM routing strategies for generative AI applications on AWS

Zeroing in on Cloud Technical Risk for Targeted Customer Impact

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Extends Generative AI Reach to Third-Party IT Platforms

From vision to value: A strategic approach to generative AI adoption

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

The New Tech Experience: Innovation, Optimization, and Collaboration

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Build a multi-tenant generative AI environment for your enterprise on AWS

Accelerate AWS Well-Architected reviews with Generative AI

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

What happens when you leak AWS credentials and how AWS minimizes the damage

AWS launches S3 Express One Zone, promises 10x write speed improvement

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

How AWS sales uses Amazon Q Business for customer engagement

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

New AWS Control Policy on the Block

Chinese firms bypass US export restrictions on AI chips using AWS cloud

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

SAP AI pact with AWS offers customers more gen AI options

Enable Amazon Bedrock cross-Region inference in multi-account environments

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS launches no-code service AppFabric with generative AI assistance

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Best practices for migrating between public clouds

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Model customization, RAG, or both: A case study with Amazon Nova

Setting up my own landing zone on AWS

Empower your generative AI application with a comprehensive custom observability solution

Stay Connected