Architecture, Reference and Scalability

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Select the created stack and choose Delete , as shown in the following screenshot.

Scalability

Scalability Lambda Generative AI AWS

The AI Future According to Google Cloud Next ’25: My Interesting Finds

Xebia

APRIL 17, 2025

Thinking refers to an internal reasoning process using the first output tokens, allowing it to solve more complex tasks. Native Multi-Agent Architecture: Build scalable applications by composing specialized agents in a hierarchy. Gemini 2.5 BigFrames 2.0 offers a scikit-learn-like API for ML.

Google Cloud

Google Cloud Artificial Inteligence Cloud Video

CIOs contend with gen AI growing pains

CIO

NOVEMBER 22, 2024

Unfortunately, despite hard-earned lessons around what works and what doesn’t, pressure-tested reference architectures for gen AI — what IT executives want most — remain few and far between, she said. It’s time for them to actually relook at their existing enterprise architecture for data and AI,” Guan said. “A

Airlines

Airlines LAN Generative AI Travel

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

In todays fast-paced digital landscape, the cloud has emerged as a cornerstone of modern business infrastructure, offering unparalleled scalability, agility, and cost-efficiency. Technology modernization strategy : Evaluate the overall IT landscape through the lens of enterprise architecture and assess IT applications through a 7R framework.

Cloud

Cloud Strategy Architecture Policies

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

In these uses case, we have enough reference implementations to point to and say, Theres value to be had here.' Weve seen so many reference implementations, and weve done so many reference implementations, that were going to see massive adoption. Now, it will evolve again, says Malhotra. Agents are the next phase, he says.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. The solution incorporates the following key features: Using a Retrieval Augmented Generation (RAG) architecture, the system generates a context-aware detailed assessment.

Generative AI

Generative AI Technical Review Software Review Systems Review

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Software-as-a-service (SaaS) applications with tenant tiering SaaS applications are often architected to provide different pricing and experiences to a spectrum of customer profiles, referred to as tiers. The user prompt is then routed to the LLM associated with the task category of the reference prompt that has the closest match.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server. The following diagram illustrates this workflow.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Security Reference Architecture Summary for Cloudera Data Platform

Cloudera

JANUARY 21, 2022

This blog will summarise the security architecture of a CDP Private Cloud Base cluster. The architecture reflects the four pillars of security engineering best practice, Perimeter, Data, Access and Visibility. Security Architecture Improvements. Logical Architecture. Logical Architecture. Apache Atlas.

Architecture

Architecture Data Authentication Policies

A strategic approach to legacy platform modernization: Minimizing risk while maximizing value

CIO

MARCH 26, 2025

The meaning of legacy system modernization can be a bit challenging to pin down because IT leaders often use the term to refer to two fundamentally different processes. At Lemongrass, he is responsible for platform and enterprise architecture, product management capability and platform enablement of the delivery service team.

Backup

Backup Weak Development Team Applications System

How an architecture-led transformation puts the customer first

CIO

SEPTEMBER 14, 2023

With this in mind, we embarked on a digital transformation that enables us to better meet customer needs now and in the future by adopting a lightweight, microservices architecture. We found that being architecturally led elevates the customer and their needs so we can design the right solution for the right problem.

Architecture

Architecture Technical Review Microservices Engineering

Safeguard OT Environments with the Power of Precision AI

Palo Alto Networks

OCTOBER 21, 2024

Hollie Hennessy, Principal Analyst, Omdia Our remote access solution features a simple, browser-based architecture with an integrated jump server that reduces deployment complexity, making secure remote access management easier for both users and administrators. The PA-410R features a DIN-rail mount for easy installation in industrial setups.

Compliance

Compliance Virtualization Conference Generative AI

What is Private Cloud Architecture: Complete Overview

OTS Solutions

DECEMBER 4, 2023

Private cloud architecture is an increasingly popular approach to cloud computing that offers organizations greater control, security, and customization over their cloud infrastructure. What is Private Cloud Architecture? Why is Private Cloud Architecture important for Businesses?

Architecture

Architecture Cloud Disaster Recovery Scalability

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we evaluate different generative AI operating model architectures that could be adopted. Governance in the context of generative AI refers to the frameworks, policies, and processes that streamline the responsible development, deployment, and use of these technologies.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

AWS Machine Learning - AI

NOVEMBER 6, 2024

This solution can serve as a valuable reference for other organizations looking to scale their cloud governance and enable their CCoE teams to drive greater impact. Limited scalability – As the volume of requests increased, the CCoE team couldn’t disseminate updated directives quickly enough. About the Authors Steven Craig is a Sr.

Generative AI

Generative AI Government Technical Review Innovation

Analyzing the Limits of Connection Scalability in Postgres

The Citus Data

OCTOBER 8, 2020

team—where I work on open source Postgres—I have spent a lot of time analyzing and addressing some of the issues with connection scalability in Postgres. Followed by an analysis of the different limiting aspects to connection scalability in Postgres. Why connection scalability in Postgres is important. Memory usage.

Scalability

Scalability Software Review Systems Review Applications

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

By implementing this architectural pattern, organizations that use Google Workspace can empower their workforce to access groundbreaking AI solutions powered by Amazon Web Services (AWS) and make informed decisions without leaving their collaboration tool. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. 8B ) and DeepSeek-R1-Distill-Llama-70B (from base model Llama-3.3-70B-Instruct

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Cloudera

JANUARY 7, 2025

While multi-cloud generally refers to the use of multiple cloud providers, hybrid encompasses both cloud and on-premises integrations, as well as multi-cloud setups. A leading meal kit provider migrated its data architecture to Cloudera on AWS, utilizing Cloudera’s Open Data Lakehouse capabilities.

Cloud

Cloud Data Scalability Compliance

The Importance of Assessing Interpersonal Skills in Recruitment

Hacker Earth Developers Blog

DECEMBER 4, 2024

Example: Ask a group of candidates to design an architecture for a scalable web application. Feedback and Reference checks Use references and peer feedback to validate interpersonal skills. Example questions for references: “Can you describe how they handled disagreements or conflicts within the team?” “How

Recruiting

Recruiting Technical Review Software Review Exercises

SAP publishes open source manifesto

CIO

JUNE 27, 2024

It arrives alongside the announcement of SAP’s Open Reference Architecture project as part of the EU’s IPCEI-CIS initiative. Organizations are choosing these platforms based on effective cost, performance, and scalability.”

Open Source

Open Source Architecture Linux Exercises

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

Shared components refer to the functionality and features shared by all tenants. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures. A centralized service that exposes APIs for common prompt-chaining architectures to your tenants can accelerate development.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Big Data Realtime Data Pipeline Architecture

Dzone - DevOps

JANUARY 23, 2024

It refers to the massive amount of structured and unstructured data that is too large to be handled by traditional database systems. To efficiently process and analyze this vast amount of data, organizations need a robust and scalable architecture. Big data has become increasingly important in today's data-driven world.

Big Data

Big Data Architecture Data Scalability

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. These recipes are processed through the HyperPod recipe launcher, which serves as the orchestration layer responsible for launching a job on the corresponding architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Use Sidecar Pattern in AWS Lambda To Optimize Your Microservices Architecture

Dzone - DevOps

JUNE 21, 2023

Microservices architecture is becoming increasingly popular as it enables organizations to build complex, scalable applications by breaking them down into smaller, independent services. This approach offers several benefits, including improved modularity, scalability, and flexibility, as well as easier management and maintenance.

Microservices

Microservices Architecture Lambda AWS

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

AWS Machine Learning - AI

SEPTEMBER 26, 2024

To accelerate iteration and innovation in this field, sufficient computing resources and a scalable platform are essential. In this post, we share an ML infrastructure architecture that uses SageMaker HyperPod to support research team innovation in video generation.

Case Study

Case Study Video Training Scalability

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

This challenge is further compounded by concerns over scalability and cost-effectiveness. The following diagram is the solution architecture. For the full list of available kernels, refer to available Amazon SageMaker kernels. For more information, refer to Run container with base LLM.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

The Asure team was manually analyzing thousands of call transcripts to uncover themes and trends, a process that lacked scalability. Staying ahead in this competitive landscape demands agile, scalable, and intelligent solutions that can adapt to changing demands. Architecture The following diagram illustrates the solution architecture.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

AWS Machine Learning - AI

APRIL 23, 2024

As successful proof-of-concepts transition into production, organizations are increasingly in need of enterprise scalable solutions. However, to unlock the long-term success and viability of these AI-powered solutions, it is crucial to align them with well-established architectural principles.

Knowledge Base

Knowledge Base Scalability Applications Generative AI

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

Model Variants The current DeepSeek model collection consists of the following models: DeepSeek-V3 An LLM that uses a Mixture-of-Experts (MoE) architecture. These models retain their existing architecture while gaining additional reasoning capabilities through a distillation process. deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

AWS Machine Learning - AI

MARCH 26, 2025

Its improved architecture, based on the Multimodal Diffusion Transformer (MMDiT), combines multiple pre-trained text encoders for enhanced text understanding and uses QK-normalization to improve training stability. Finally, use the generated images as reference material for 3D artists to create fully realized game environments.

Generative AI

Generative AI Games Development AWS

Comparing API Architectural Styles: SOAP vs REST vs GraphQL vs RPC

Altexsoft

MAY 29, 2020

These specifications make up the API architecture. Over time, different API architectural styles have been released. A pull of choices raises endless debates as to which architectural style is best. RPC’s tight coupling makes scalability requirements and loosely coupled teams hard to achieve. Tedious message updating.

Architecture

Architecture Microservices Systems Review Weak Development Team

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning - AI

APRIL 30, 2025

Similarly, when an incident occurs in IT, the responding team must provide a precise, documented history for future reference and troubleshooting. The following diagram illustrates the architecture using AWS services. In this post, we highlight how the AI-powered accounting transformation platform uses Amazon Bedrock.

Artificial Inteligence

Artificial Inteligence Technical Review Software Review Generative AI

Evolving django-multitenant to build scalable SaaS apps on Postgres & Citus

The Citus Data

MAY 9, 2023

In this blog post, we’ll dive deeper into the concept of multi-tenancy and explore how Django-multitenant can help you build scalable, secure, and maintainable multi-tenant applications on top of PostgreSQL and the Citus database extension. Distribute ( "Country" , reference = True ), tenant_migrations. What is multi-tenancy?

Scalability

Scalability Software Review Applications Open Source

The future of Cloud-native software development with Radius

Xebia

DECEMBER 16, 2024

Initially, our industry relied on monolithic architectures, where the entire application was a single, simple, cohesive unit. Ever increasing complexity To overcome these limitations, we transitioned to Service-Oriented Architecture (SOA). Notice that the Application has a Connection that references the State Store.

Software Development

Software Development Software Cloud Development

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning - AI

MAY 2, 2025

The following diagram illustrates the solution architecture on AWS. Secure access using Route 53 and Amplify The journey begins with the user accessing the WordFinder app through a domain managed by Amazon Route 53 , a highly available and scalable cloud DNS web service.

Generative AI

Generative AI AWS Lambda Authentication

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Model customization refers to adapting a pre-trained language model to better fit specific tasks, domains, or datasets. The following diagram illustrates the solution architecture. For more information, refer to the following GitHub repo , which contains sample code.

Case Study

Case Study Artificial Inteligence Study Generative AI

Build scalable Low-Code backends with Booster

The Agile Monkey

DECEMBER 22, 2022

However, these tools may not be suitable for more complex data or situations requiring scalability and robust business logic. In short, Booster is a Low-Code TypeScript framework that allows you to quickly and easily create a backend application in the cloud that is highly efficient, scalable, and reliable. WTF is Booster?

Scalability

Scalability AWS Authentication Open Source

Monolithic vs. Microservice Architecture: Choosing the Right Path for Your Application

Perficient

AUGUST 10, 2023

Introduction In the ever-evolving landscape of software development, choosing the right architectural approach is crucial for building robust and scalable applications. Two popular architectural styles that often come into consideration are Monolithic and Microservice.

Microservices

Microservices Architecture Applications Software Review

Astera Labs, a fabless chip startup, nabs $50M at a $950M valuation to remove bottlenecks in high-bandwidth cloud applications

TechCrunch

SEPTEMBER 27, 2021

As more enterprises migrate to cloud-based architectures, they are also taking on more applications (because they can) and, as a result of that, more complex workloads and storage needs. Machine learning and other artificial intelligence applications add even more complexity.

Artificial Inteligence

Artificial Inteligence Applications Cloud Artificial Intelligence

The half-life of code & the ship of Theseus

Erik Bernhardsson

DECEMBER 4, 2016

It refers to a philosophical paradox, where the pieces of a ship are replaced for hundreds of years. The drivers directory has by far the most number of files (22,091) followed by arch (17,967) which contains support for various architectures. Is the architecture basically not as “linear” and consistent? That’s a fair point.

Linux

Linux Open Source Scalability Architecture

Navigating the future of national tech independence with sovereign AI

CIO

MARCH 31, 2025

Sovereign AI refers to a national or regional effort to develop and control artificial intelligence (AI) systems, independent of the large non-EU foreign private tech platforms that currently dominate the field.

Technical Review

Technical Review Artificial Inteligence Compliance Open Source

McKinsey, eyeing the MLOps space, buys Tel Aviv-based Iguazio

TechCrunch

JANUARY 23, 2023

The consulting giant reportedly paid around $50 million for Iguazio, a Tel Aviv-based company offering an MLOps platform for large-scale businesses — “MLOps” referring to a set of tools to deploy and maintain machine learning models in production.

Artificial Inteligence

Artificial Inteligence Machine Learning Continuous Delivery ChatGPT

TOGAF: An Insight into the Enterprise Architecture Framework

Apiumhub

NOVEMBER 22, 2023

This is where TOGAF (the Open Group Architecture Framework) comes into play. It is an enterprise architecture framework that offers a systematic and comprehensive approach to achieving business transformation and sustainable success. It helps architects organize and document the architecture effectively. Benefits of TOGAF 1.

Architecture

Architecture Enterprise Case Study Agile

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

The AI Future According to Google Cloud Next ’25: My Interesting Finds

Webinars

Trending Sources

CIOs contend with gen AI growing pains

Webinars

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

12 AI predictions for 2025

Accelerate AWS Well-Architected reviews with Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Security Reference Architecture Summary for Cloudera Data Platform

A strategic approach to legacy platform modernization: Minimizing risk while maximizing value

How an architecture-led transformation puts the customer first

Safeguard OT Environments with the Power of Precision AI

What is Private Cloud Architecture: Complete Overview

Generative AI operating models in enterprise organizations with Amazon Bedrock

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

Analyzing the Limits of Connection Scalability in Postgres

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

The Importance of Assessing Interpersonal Skills in Recruitment

SAP publishes open source manifesto

Build a multi-tenant generative AI environment for your enterprise on AWS

Big Data Realtime Data Pipeline Architecture

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Use Sidecar Pattern in AWS Lambda To Optimize Your Microservices Architecture

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

Host concurrent LLMs with LoRAX

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock

Comparing API Architectural Styles: SOAP vs REST vs GraphQL vs RPC

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Evolving django-multitenant to build scalable SaaS apps on Postgres & Citus

The future of Cloud-native software development with Radius

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Model customization, RAG, or both: A case study with Amazon Nova

Build scalable Low-Code backends with Booster

Monolithic vs. Microservice Architecture: Choosing the Right Path for Your Application

Astera Labs, a fabless chip startup, nabs $50M at a $950M valuation to remove bottlenecks in high-bandwidth cloud applications

The half-life of code & the ship of Theseus

Navigating the future of national tech independence with sovereign AI

McKinsey, eyeing the MLOps space, buys Tel Aviv-based Iguazio

TOGAF: An Insight into the Enterprise Architecture Framework

Stay Connected