AWS, Development and Scalability

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Access to your selected models hosted on Amazon Bedrock.

Scalability

Scalability Lambda Generative AI AWS

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

However, as exciting as these advancements are, data scientists often face challenges when it comes to developing UIs and to prototyping and interacting with their business users. With Streamlit, you can quickly build and iterate on your application without the need for extensive frontend development experience.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning - AI

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS Software Review Knowledge Base Artificial Inteligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Implementing a Version Control System for AWS QuickSight

Xebia

OCTOBER 24, 2024

Among the myriads of BI tools available, AWS QuickSight stands out as a scalable and cost-effective solution that allows users to create visualizations, perform ad-hoc analysis, and generate business insights from their data. We have developed three separate modules: dashboard, dataset, and role_custom_permission.

AWS

AWS Systems Review System Azure

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This scalability allows for more frequent and comprehensive reviews.

Generative AI

Generative AI Technical Review Software Review Systems Review

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Adding a new task would necessitate the development of a new UI component in addition to the selection and integration of a new model. Semantic routing offers several advantages, such as efficiency gained through fast similarity search in vector databases, and scalability to accommodate a large number of task categories and downstream LLMs.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Discover, Protect and Respond with AWS and Prisma Cloud

Prisma Clud

NOVEMBER 22, 2024

Organizations are increasingly turning to cloud providers, like Amazon Web Services (AWS), to address these challenges and power their digital transformation initiatives. However, the vastness of AWS environments and the ease of spinning up new resources and services can lead to cloud sprawl and ongoing security risks.

AWS

AWS Cloud Network Compliance

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal sought to develop natural language processing (NLP) and question-answering capabilities to accurately query and summarize this unstructured data at scale.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. Responsible AI components promote the safe and responsible development of AI across tenants. You can use AWS services such as Application Load Balancer to implement this approach.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

there is an increasing need for scalable, reliable, and cost-effective solutions to deploy and serve these models. AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment.

AWS

AWS Load Balancer Software Review Artificial Inteligence

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

AWS App Studio is a generative AI-powered service that uses natural language to build business applications, empowering a new set of builders to create applications in minutes. Cross-instance Import and Export Enabling straightforward and self-service migration of App Studio applications across AWS Regions and AWS accounts.

AWS

AWS Software Review Technical Review Generative AI

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions.

Generative AI

Generative AI AWS Technical Review Backup

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

Add to this the escalating costs of maintaining legacy systems, which often act as bottlenecks for scalability. The latter option had emerged as a compelling solution, offering the promise of enhanced agility, reduced operational costs, and seamless scalability. Scalability. Developer productivity. Cost forecasting.

Serverless

Serverless Architecture Microservices Scalability

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

Cloud computing Average salary: $124,796 Expertise premium: $15,051 (11%) Cloud computing has been a top priority for businesses in recent years, with organizations moving storage and other IT operations to cloud data storage platforms such as AWS. Its designed to achieve complex results, with a low learning curve for beginners and new users.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

In the competitive world of game development, staying ahead of technological advancements is crucial. This shift towards AI-assisted content creation in gaming promises to open up new realms of possibilities for both developers and players alike. Use the us-west-2 AWS Region to run this demo. Large (SD3.5

Generative AI

Generative AI Games Development AWS

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Without a scalable approach to controlling costs, organizations risk unbudgeted usage and cost overruns. Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

The future of Cloud-native software development with Radius

Xebia

DECEMBER 16, 2024

The rise of platform engineering Over the years, the process of software development has changed a lot. This approach made the development process straightforward initially, but as applications grew in complexity, maintaining and scaling them became increasingly challenging.

Software Development

Software Development Software Cloud Development

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. It provides developers and organizations access to an extensive catalog of over 100 popular, emerging, and specialized FMs, complementing the existing selection of industry-leading models in Amazon Bedrock.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

CIO

APRIL 15, 2025

Solving the agentic DevOps problem with open frameworks Last week also saw Google announcing new open frameworks the Agent Development Kit (ADK) and the Agent2Agent (A2A) protocol to help enterprises build, manage, and connect multiple agents, even across different ecosystems.

Cloud

Cloud Innovation Artificial Inteligence Google Cloud

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Dzone - DevOps

APRIL 9, 2025

Amazon Web Services (AWS) provides an expansive suite of tools to help developers build and manage serverless applications with ease. By abstracting the complexities of infrastructure, AWS enables teams to focus on innovation. Why Combine AI, ML, and Serverless Computing?

Serverless

Serverless Artificial Inteligence Scalability AWS

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

The challenge: Enabling self-service cloud governance at scale Hearst undertook a comprehensive governance transformation for their Amazon Web Services (AWS) infrastructure. The CCoE implemented AWS Organizations across a substantial number of business units.

Generative AI

Generative AI Government Technical Review Innovation

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

In the rapidly evolving world of generative AI image modeling, prompt engineering has become a crucial skill for developers, designers, and content creators. He is passionate about creating accessible resources for people to learn and develop proficiency with AI.

Engineering

Engineering AWS 3D Generative AI

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AWS Machine Learning - AI

APRIL 29, 2025

At Data Reply and AWS, we are committed to helping organizations embrace the transformative opportunities generative AI presents, while fostering the safe, responsible, and trustworthy development of AI systems. This practice helps develop AI systems that are functional, safe, and trustworthy.

Generative AI

Generative AI Weak Development Team AWS Artificial Inteligence

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

AI practitioners and industry leaders discussed these trends, shared best practices, and provided real-world use cases during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI. And its modular architecture distributes tasks across multiple agents in parallel, increasing the speed and scalability of migrations.

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Marsh McLennan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider. The biggest challenge is data.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Getting started with computer use in Amazon Bedrock Agents

AWS Machine Learning - AI

MARCH 14, 2025

Traditional automation approaches require custom API integrations for each application, creating significant development overhead. Rather than build custom integrations for each system, developers can now create agents that perceive and interact with existing interfaces in a managed, secure way. AWS CDK CLI, follow instructions here.

AWS

AWS Generative AI Linux Groups

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

In todays fast-paced digital landscape, the cloud has emerged as a cornerstone of modern business infrastructure, offering unparalleled scalability, agility, and cost-efficiency. First, cloud provisioning through automation is better in AWS CloudFormation and Azure Azure Resource Manager compared to the other cloud providers.

Cloud

Cloud Strategy Architecture Policies

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Vibe Coding: Shaping the Future of Software

Hacker Earth Developers Blog

APRIL 16, 2025

In the past, being able to produce functional code was a strong advantage for developers. This development does not only increase speed but also changes how we approach problem solving. It is important for us to rethink our role as developers and focus on architecture and system design rather than simply on typing code.

Software

Software Architecture System Design System Architecture

LambdaTest raises $45 million to build ‘AWS for testers’

TechCrunch

MARCH 29, 2022

Web developers have to run hundreds of tasks and they are able to do so on their own machines. But when a developer firm — at a scale — has to perform similar activities, they don’t often have — and in many cases, want to have — the required computing power at their disposal to run such tasks locally.

AWS

AWS Testing Web Development Operating System

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

Developer tools The solution also uses the following developer tools: AWS Powertools for Lambda – This is a suite of utilities for Lambda functions that generates OpenAPI schemas from your Lambda function code. After deployment, the AWS CDK CLI will output the web application URL. Python 3.9 or later Node.js

Lambda

Lambda Enterprise Automotive Knowledge Base

OpsLevel raises $15M to help developers manage their microservices

TechCrunch

MARCH 1, 2022

OpsLevel , a startup that helps development teams organize and track their microservices in a centralized developer portal, today announced that it has raised a $15 million Series A funding round. The company plans to use the new funding to expand its engineering team in order to speed up its product development efforts.

Microservices

Microservices Development DevOps AWS

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

As businesses and developers increasingly seek to optimize their language models for specific tasks, the decision between model customization and Retrieval Augmented Generation (RAG) becomes critical. Our study used Amazon Nova Micro and Amazon Nova Lite as baseline FMs and tested their performance across different configurations.

Case Study

Case Study Artificial Inteligence Study Generative AI

Best Practices for IaC using AWS CloudFormation

Perficient

MARCH 11, 2025

IaC enables developers to define infrastructure configurations using code, ensuring consistency, automation, and scalability. AWS CloudFormation, a key service in the AWS ecosystem, simplifies IaC by allowing users to easily model and set up AWS resources. Why Use AWS CloudFormation? Example: 3.

AWS

AWS Software Review Systems Review Policies

Marsh McLellan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider. The biggest challenge is data.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

AWS Machine Learning - AI

JULY 31, 2024

Amazon Web Services (AWS) is committed to supporting the development of cutting-edge generative artificial intelligence (AI) technologies by companies and organizations across the globe. Let’s dive in and explore how these organizations are transforming what’s possible with generative AI on AWS.

Artificial Inteligence

Artificial Inteligence AWS Programming Innovation

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning - AI

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

Artificial Inteligence

Artificial Inteligence AWS Technical Review Policies

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

As DPG Media grows, they need a more scalable way of capturing metadata that enhances the consumer experience on online video services and aids in understanding key content characteristics. Irina Radu is a Prototyping Engagement Manager, part of AWS EMEA Prototyping and Cloud Engineering.

Media

Media Video Artificial Inteligence Generative AI

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. Consequently, MaestroQA had to develop a solution capable of scaling to meet their clients extensive needs.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

By providing high-quality, openly available models, the AI community fosters rapid iteration, knowledge sharing, and cost-effective solutions that benefit both developers and end-users. This serverless approach eliminates the need for infrastructure management while providing enterprise-grade security and scalability.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Recently, we’ve been witnessing the rapid development and evolution of generative AI applications, with observability and evaluation emerging as critical aspects for developers, data scientists, and stakeholders. This feature allows you to separate data into logical partitions, making it easier to analyze and process data later.

Generative AI

Generative AI Applications AWS Knowledge Base

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Build and deploy a UI for your generative AI applications with AWS and Python

Introducing AWS MCP Servers for code assistants (Part 1)

Webinars

Implementing a Version Control System for AWS QuickSight

Accelerate AWS Well-Architected reviews with Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

Discover, Protect and Respond with AWS and Prisma Cloud

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Build a multi-tenant generative AI environment for your enterprise on AWS

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

How AWS sales uses Amazon Q Business for customer engagement

Can serverless fix fintech’s scaling problem?

9 IT skills where expertise pays the most

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

The future of Cloud-native software development with Radius

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

Unlocking the Power of Serverless AI/ML on AWS: Expert Strategies for Scalable and Secure Applications

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

AI in action: Stories of how enterprises are transforming and modernizing

Marsh McLennan IT reorg lays foundation for gen AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Getting started with computer use in Amazon Bedrock Agents

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Vibe Coding: Shaping the Future of Software

LambdaTest raises $45 million to build ‘AWS for testers’

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

OpsLevel raises $15M to help developers manage their microservices

Model customization, RAG, or both: A case study with Amazon Nova

Best Practices for IaC using AWS CloudFormation

Marsh McLellan IT reorg lays foundation for gen AI

Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

Enable Amazon Bedrock cross-Region inference in multi-account environments

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Empower your generative AI application with a comprehensive custom observability solution

Stay Connected