Architecture, Scalability and Training

The key to operational AI: Modern data architecture

CIO

NOVEMBER 27, 2024

Technology: The workloads a system supports when training models differ from those in the implementation phase. To succeed, Operational AI requires a modern data architecture. Ensuring effective and secure AI implementations demands continuous adaptation and investment in robust, scalable data infrastructures.

Architecture

Architecture Artificial Inteligence Data Development Team Review

How today’s enterprise architect juggles strategy, tech and innovation

CIO

APRIL 16, 2025

Jenga builder: Enterprise architects piece together both reusable and replaceable components and solutions enabling responsive (adaptable, resilient) architectures that accelerate time-to-market without disrupting other components or the architecture overall (e.g. compromising quality, structure, integrity, goals).

Technical Review

Technical Review Enterprise Strategy Innovation

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

You pull an open-source large language model (LLM) to train on your corporate data so that the marketing team can build better assets, and the customer service team can provide customer-facing chatbots. You export, move, and centralize your data for training purposes with all the associated time and capacity inefficiencies that entails.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

From legacy to lakehouse: Centralizing insurance data with Delta Lake

CIO

APRIL 23, 2025

This is where Delta Lakehouse architecture truly shines. Approach Sid Dixit Implementing lakehouse architecture is a three-phase journey, with each stage demanding dedicated focus and independent treatment. Step 2: Transformation (using ELT and Medallion Architecture ) Bronze layer: Keep it raw.

Insurance

Insurance Artificial Inteligence Data Architecture

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

In todays fast-paced digital landscape, the cloud has emerged as a cornerstone of modern business infrastructure, offering unparalleled scalability, agility, and cost-efficiency. Technology modernization strategy : Evaluate the overall IT landscape through the lens of enterprise architecture and assess IT applications through a 7R framework.

Cloud

Cloud Strategy Architecture Policies

CIOs contend with gen AI growing pains

CIO

NOVEMBER 22, 2024

Unfortunately, despite hard-earned lessons around what works and what doesn’t, pressure-tested reference architectures for gen AI — what IT executives want most — remain few and far between, she said. It’s time for them to actually relook at their existing enterprise architecture for data and AI,” Guan said. “A

Airlines

Airlines LAN Generative AI Travel

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

This surge is driven by the rapid expansion of cloud computing and artificial intelligence, both of which are reshaping industries and enabling unprecedented scalability and innovation. The result was a compromised availability architecture. Global IT spending is expected to soar in 2025, gaining 9% according to recent estimates.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

12 AI predictions for 2025

CIO

DECEMBER 30, 2024

Plus, they can be more easily trained on a companys own data, so Upwork is starting to embrace this shift, training its own small language models on more than 20 years of interactions and behaviors on its platform. Agents can be more loosely coupled than services, making these architectures more flexible, resilient and smart.

Fractional CTO

Fractional CTO Software Development CTO Coach Architecture

2025 Middle East tech trends: How CIOs will drive innovation with AI

CIO

DECEMBER 30, 2024

AI-powered threat detection systems will play a vital role in identifying and mitigating risks in real time, while zero-trust architectures will become the norm to ensure stringent access controls. Organizations will also prioritize workforce training and cybersecurity awareness to mitigate risks and build a resilient digital ecosystem.

Trends

Trends Innovation IoT Disaster Recovery

How the wow factor drives innovation at Northeast Grocery

CIO

APRIL 2, 2025

And third, systems consolidation and modernization focuses on building a cloud-based, scalable infrastructure for integration speed, security, flexibility, and growth. To drive democratization, we follow ECTERS, which is educate, coach, train the trainer, empower, reinforce, and support, which helps nurture and embed internal AI talent.

Innovation

Innovation ChatGPT Education Coaching

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. The solution incorporates the following key features: Using a Retrieval Augmented Generation (RAG) architecture, the system generates a context-aware detailed assessment.

Generative AI

Generative AI Technical Review Software Review Systems Review

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

Scalable infrastructure – Bedrock Marketplace offers configurable scalability through managed endpoints, allowing organizations to select their desired number of instances, choose appropriate instance types, define custom auto scaling policies that dynamically adjust to workload demands, and optimize costs while maintaining performance.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

AWS Machine Learning - AI

SEPTEMBER 26, 2024

Trained on the Amazon SageMaker HyperPod , Dream Machine excels in creating consistent characters, smooth motion, and dynamic camera movements. To accelerate iteration and innovation in this field, sufficient computing resources and a scalable platform are essential.

Case Study

Case Study Video Training Scalability

Marsh McLennan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider. The biggest challenge is data.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server. The following diagram illustrates this workflow.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Scaling Startups: The Ultimate Guide For Founders

Luis Goncalves

APRIL 11, 2025

This isn’t merely about hiring more salespeopleit’s about creating scalable systems efficiently converting prospects into customers. This requires specific approaches to product development, architecture, and delivery processes. Explore strategies for scaling your digital product with continuous delivery 3.

Weak Development Team

Weak Development Team Technical Review Sustainability Systems Review

Marsh McLellan IT reorg lays foundation for gen AI

CIO

NOVEMBER 1, 2024

As part of MMTech’s unifying strategy, Beswick chose to retire the data centers and form an “enterprisewide architecture organization” with a set of standards and base layers to develop applications and workloads that would run on the cloud, with AWS as the firm’s primary cloud provider. The biggest challenge is data.

Generative AI

Generative AI Technical Advisors Insurance Weak Development Team

Navigating the future of national tech independence with sovereign AI

CIO

MARCH 31, 2025

There are two main considerations associated with the fundamentals of sovereign AI: 1) Control of the algorithms and the data on the basis of which the AI is trained and developed; and 2) the sovereignty of the infrastructure on which the AI resides and operates.

Technical Review

Technical Review Artificial Inteligence Compliance Open Source

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. These recipes are processed through the HyperPod recipe launcher, which serves as the orchestration layer responsible for launching a job on the corresponding architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Creating asynchronous AI agents with Amazon Bedrock

AWS Machine Learning - AI

MARCH 13, 2025

This post will discuss agentic AI driven architecture and ways of implementing. Agentic AI architecture Agentic AI architecture is a shift in process automation through autonomous agents towards the capabilities of AI, with the purpose of imitating cognitive abilities and enhancing the actions of traditional autonomous agents.

Artificial Inteligence

Artificial Inteligence Lambda Travel Generative AI

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. 8B ) and DeepSeek-R1-Distill-Llama-70B (from base model Llama-3.3-70B-Instruct

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Cloudera

JANUARY 7, 2025

Leveraging Clouderas hybrid architecture, the organization optimized operational efficiency for diverse workloads, providing secure and compliant operations across jurisdictions while improving response times for public health initiatives. Scalability: Choose platforms that can dynamically scale to meet fluctuating workload demands.

Cloud

Cloud Data Scalability Compliance

Data trends in 2025

Xebia

FEBRUARY 23, 2025

And data.world ([link] a company that we are particularly interested in because of their knowledge graph architecture. By boosting productivity and fostering innovation, human-AI collaboration will reshape workplaces, making operations more efficient, scalable, and adaptable.

Trends

Trends Data Artificial Inteligence Weak Development Team

EXL Code Harbor streamlines platform migration, data governance, and workflow assessment

CIO

FEBRUARY 18, 2025

By taking EXLs expertise in helping enterprises design both legacy and modern architectures and building it into these agents, the tool tackles every migration task with greater accuracy and efficiency: Business Analyst: Code explanation, documentation, pseudo code.

Software Review

Software Review Artificial Inteligence Government Data

Top 11 LLM Tools That Ensure Smooth LLM Operations

Openxcell

JANUARY 20, 2025

LLM or large language models are deep learning models trained on vast amounts of linguistic data so they understand and respond in natural language (human-like texts). The inner transformer architecture comprises a bunch of neural networks in the form of an encoder and a decoder.

Artificial Inteligence

Artificial Inteligence Tools Open Source Architecture

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The Education and Training Quality Authority (BQA) plays a critical role in improving the quality of education and training services in the Kingdom Bahrain. BQA oversees a comprehensive quality assurance process, which includes setting performance standards and conducting objective reviews of education and training institutions.

Education

Education Report Technical Review Generative AI

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we evaluate different generative AI operating model architectures that could be adopted. Generative AI architecture components Before diving deeper into the common operating model patterns, this section provides a brief overview of a few components and AWS services used in the featured architectures.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

Cloudera AI Inference Service Enables Easy Integration and Deployment of GenAI Into Your Production Environments

Cloudera

DECEMBER 4, 2024

The Cloudera AI Inference service is a highly scalable, secure, and high-performance deployment environment for serving production AI models and related applications. Services like Hugging Face and the ONNX Model Zoo made it easy to access a wide range of pre-trained models. What is the Cloudera AI Inference service?

Artificial Inteligence

Artificial Inteligence Architecture Machine Learning Metrics

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Demystifying RAG and model customization RAG is a technique to enhance the capability of pre-trained models by allowing the model access to external domain-specific data sources. Unlike fine-tuning, in RAG, the model doesnt undergo any training and the model weights arent updated to learn the domain knowledge.

Case Study

Case Study Artificial Inteligence Study Generative AI

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

We walk through the key components and services needed to build the end-to-end architecture, offering example code snippets and explanations for each critical element that help achieve the core functionality. Amazon DynamoDB is a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

The Pro tier, however, would require a highly customized LLM that has been trained on specific data and terminology, enabling it to assist with intricate tasks like drafting complex legal documents. This hybrid approach combines the scalability and flexibility of semantic search with the precision and context-awareness of classifier LLMs.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

This challenge is further compounded by concerns over scalability and cost-effectiveness. LoRA is a technique for efficiently adapting large pre-trained language models to new tasks or domains by introducing small trainable weight matrices, called adapters, within each linear layer of the pre-trained model.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

This advancement makes sophisticated agent architectures more accessible and economically viable across a broader range of applications and scales of deployment. Amazon Bedrock provides two primary methods for preparing your training data: uploading JSONL files to Amazon S3 or using historical invocation logs.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

The flexible, scalable nature of AWS services makes it straightforward to continually refine the platform through improvements to the machine learning models and addition of new features. The first round of testers needed more training on fine-tuning the prompts to improve returned results.

Generative AI

Generative AI AWS Groups Artificial Inteligence

How AerCap’s CIO has been a catalyst for a robust M&A strategy

CIO

JANUARY 1, 2025

Koletzki would use the move to upgrade the IT environment from a small data room to something more scalable. He knew that scalability was a big win for a company in aggressive growth mode, but he just needed to be persuaded that the platforms were more robust, and the financials made sense. I just subscribed to their service.

Strategy

Strategy Azure Technical Review Systems Review

Foundation Model for Personalized Recommendation

Netflix Tech

MARCH 28, 2025

Furthermore, it was difficult to transfer innovations from one model to another, given that most are independently trained despite using common data sources. This scenario underscored the need for a new recommender system architecture where member preference learning is centralized, enhancing accessibility and utility across different models.

Artificial Inteligence

Artificial Inteligence Systems Review Training Windows

Beyond Data Fabrics: Cloudera Modern Data Architectures

Cloudera

JULY 11, 2022

What used to be bespoke and complex enterprise data integration has evolved into a modern data architecture that orchestrates all the disparate data sources intelligently and securely, even in a self-service manner: a data fabric. Data fabrics are one of the more mature modern data architectures. Next steps.

Architecture

Architecture Data Government Scalability

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

Amazon Bedrocks broad choice of FMs from leading AI companies, along with its scalability and security features, made it an ideal solution for MaestroQA. MaestroQA integrated Amazon Bedrock into their existing architecture using Amazon Elastic Container Service (Amazon ECS).

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Adept aims to build AI that can automate any software process

TechCrunch

APRIL 26, 2022

“[W]e’re training a neural network to use every software tool in the world, building on the vast amount of existing capabilities that people have already created.” Vaswani and Parmar helped to pioneer the Transformer, an AI architecture that has gained considerable attention within the last several years.

Software

Software Fractional CTO CTO Coach Training

CIOs take note: Platform engineering teams are the future core of IT orgs

CIO

JUNE 19, 2024

Those highly scalable platforms are typically designed to optimize developer productivity, leverage economies of scale to lower costs, improve reliability, and accelerate software delivery. They may also ensure consistency in terms of processes, architecture, security, and technical governance.

Weak Development Team

Weak Development Team Engineering UI/UX Software Development

Salesforce certification guide: Roles, paths, exams, cost, training, requirements

CIO

FEBRUARY 20, 2023

The Salesforce Sharing and Visibility Architect certification is designed for architects, analysts, and administrators with the knowledge and skills to design secure, scalable security models on Force.com. The certification emphasizes testing, governance, and integration with external systems within an organization’s infrastructure.

Training

Training B2C Technical Review Systems Review

Implementing and Deploying a Real-Time AI-Powered Chatbot With Serverless Architecture

Dzone - DevOps

JULY 29, 2024

In this article, we'll walk through the process of creating and deploying a real-time AI-powered chatbot using serverless architecture. This approach not only streamlines development but also ensures scalability and cost-efficiency. Overview of the Project We'll be building a simple chatbot that interacts with users in real time.

Serverless

Serverless Architecture Lambda AWS

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Efficient Pre-training of Llama 3-like model architectures using torchtitan on Amazon SageMaker

AWS Machine Learning - AI

OCTOBER 8, 2024

This post is co-written with Less Wright and Wei Feng from Meta Pre-training large language models (LLMs) is the first step in developing powerful AI systems that can understand and generate human-like text. Introduction to torchtitan torchtitan is a reference architecture for large-scale LLM training using native PyTorch.

Training

Training Architecture Artificial Inteligence AWS

The key to operational AI: Modern data architecture

How today’s enterprise architect juggles strategy, tech and innovation

Webinars

Trending Sources

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

Webinars

From legacy to lakehouse: Centralizing insurance data with Delta Lake

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIOs contend with gen AI growing pains

Why GreenOps will succeed where FinOps is failing

12 AI predictions for 2025

2025 Middle East tech trends: How CIOs will drive innovation with AI

How the wow factor drives innovation at Northeast Grocery

Accelerate AWS Well-Architected reviews with Generative AI

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

Marsh McLennan IT reorg lays foundation for gen AI

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Scaling Startups: The Ultimate Guide For Founders

Marsh McLellan IT reorg lays foundation for gen AI

Navigating the future of national tech independence with sovereign AI

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Creating asynchronous AI agents with Amazon Bedrock

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Mastering Multi-Cloud with Cloudera: Strategic Data & AI Deployments Across Clouds

Data trends in 2025

EXL Code Harbor streamlines platform migration, data governance, and workflow assessment

Top 11 LLM Tools That Ensure Smooth LLM Operations

How BQA streamlines education quality reporting using Amazon Bedrock

Generative AI operating models in enterprise organizations with Amazon Bedrock

Cloudera AI Inference Service Enables Easy Integration and Deployment of GenAI Into Your Production Environments

Model customization, RAG, or both: A case study with Amazon Nova

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Multi-LLM routing strategies for generative AI applications on AWS

Host concurrent LLMs with LoRAX

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

How AerCap’s CIO has been a catalyst for a robust M&A strategy

Foundation Model for Personalized Recommendation

Beyond Data Fabrics: Cloudera Modern Data Architectures

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Adept aims to build AI that can automate any software process

CIOs take note: Platform engineering teams are the future core of IT orgs

Salesforce certification guide: Roles, paths, exams, cost, training, requirements

Implementing and Deploying a Real-Time AI-Powered Chatbot With Serverless Architecture

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Efficient Pre-training of Llama 3-like model architectures using torchtitan on Amazon SageMaker

Stay Connected