Architecture and Artificial Inteligence

Architecture

Artificial Inteligence

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

We will deep dive into the MCP architecture later in this post. For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Agentic AI design: An architectural case study

CIO

NOVEMBER 19, 2024

From obscurity to ubiquity, the rise of large language models (LLMs) is a testament to rapid technological advancement. Just a few short years ago, models like GPT-1 (2018) and GPT-2 (2019) barely registered a blip on anyone’s tech radar. If the LLM didn’t create enough output, the agent would need to run again.

Case Study

Case Study Artificial Inteligence Study Architecture

Join 49,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

The key to operational AI: Modern data architecture

CIO

NOVEMBER 27, 2024

Recent research shows that 67% of enterprises are using generative AI to create new content and data based on learned patterns; 50% are using predictive AI, which employs machine learning (ML) algorithms to forecast future events; and 45% are using deep learning, a subset of ML that powers both generative and predictive models.

Architecture

Architecture Artificial Inteligence Data Development Team Review

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Have we reached the end of ‘too expensive’ for enterprise software?

CIO

JANUARY 9, 2025

Generative artificial intelligence ( genAI ) and in particular large language models ( LLMs ) are changing the way companies develop and deliver software. These autoregressive models can ultimately process anything that can be easily broken down into tokens: image, video, sound and even proteins.

Artificial Inteligence

Artificial Inteligence Software Review Software Enterprise

Embedding BI: Architectural Considerations and Technical Requirements

While data platforms, artificial intelligence (AI), machine learning (ML), and programming platforms have evolved to leverage big data and streaming data, the front-end user experience has not kept up. Traditional Business Intelligence (BI) aren’t built for modern data platforms and don’t work on modern architectures.

Architecture

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

What is data architecture? A framework to manage data

CIO

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects. Ensure security and access controls.

Architecture

Architecture Data Fractional CTO Technical Review

How AI orchestration has become more important than the models themselves

CIO

DECEMBER 10, 2024

Large language models (LLMs) just keep getting better. In just about two years since OpenAI jolted the news cycle with the introduction of ChatGPT, weve already seen the launch and subsequent upgrades of dozens of competing models. From Llama3.1 to Gemini to Claude3.5 In fact, business spending on AI rose to $13.8

Artificial Inteligence

Artificial Inteligence Off-The-Shelf Insurance Analytics

Top 11 LLM Tools That Ensure Smooth LLM Operations

Openxcell

JANUARY 20, 2025

LLM or large language models are deep learning models trained on vast amounts of linguistic data so they understand and respond in natural language (human-like texts). The inner transformer architecture comprises a bunch of neural networks in the form of an encoder and a decoder.

Artificial Inteligence

Artificial Inteligence Tools Open Source Architecture

Are enterprises ready to adopt AI at scale?

CIO

OCTOBER 30, 2024

Whether it’s a financial services firm looking to build a personalized virtual assistant or an insurance company in need of ML models capable of identifying potential fraud, artificial intelligence (AI) is primed to transform nearly every industry.

Enterprise

Enterprise Artificial Inteligence Architecture Survey

From legacy to lakehouse: Centralizing insurance data with Delta Lake

CIO

APRIL 23, 2025

This is where Delta Lakehouse architecture truly shines. Approach Sid Dixit Implementing lakehouse architecture is a three-phase journey, with each stage demanding dedicated focus and independent treatment. Step 2: Transformation (using ELT and Medallion Architecture ) Bronze layer: Keep it raw.

Insurance

Insurance Artificial Inteligence Data Architecture

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Generative and agentic artificial intelligence (AI) are paving the way for this evolution. And its modular architecture distributes tasks across multiple agents in parallel, increasing the speed and scalability of migrations. The EXLerate.AI

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

All industries and modern applications are undergoing rapid transformation powered by advances in accelerated computing, deep learning, and artificial intelligence. The next phase of this transformation requires an intelligent data infrastructure that can bring AI closer to enterprise data.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

The AI Future According to Google Cloud Next ’25: My Interesting Finds

Xebia

APRIL 17, 2025

It also supports the newly announced Agent 2 Agent (A2A) protocol which Google is positioning as an open, secure standard for agent-agent collaboration, driven by a large community of Technology, Platform and Service partners. Native Multi-Agent Architecture: Build scalable applications by composing specialized agents in a hierarchy.

Google Cloud

Google Cloud Artificial Inteligence Cloud Video

CAIOs are stepping out from the CIO’s shadow

CIO

MARCH 14, 2025

But the increase in use of intelligent tools in recent years since the arrival of generative AI has begun to cement the CAIO role as a key tech executive position across a wide range of sectors. The role of artificial intelligence is very closely tied to generating efficiencies on an ongoing basis, as well as implying continuous adoption.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Government Generative AI

Layoffs, AI demand create mismatched talent market for IT skills

CIO

OCTOBER 23, 2024

Just days later, Cisco Systems announced it planned to reduce its workforce by 7%, citing shifts to other priorities such as artificial intelligence and cybersecurity — after having already laid off over 4,000 employees in February.

Marketing

Marketing Artificial Inteligence Generative AI Artificial Intelligence

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

AWS Machine Learning - AI

APRIL 21, 2025

In this blog post, we discuss how Prompt Optimization improves the performance of large language models (LLMs) for intelligent text processing task in Yuewen Group. Evolution from Traditional NLP to LLM in Intelligent Text Processing Yuewen Group leverages AI for intelligent analysis of extensive web novel texts.

Artificial Inteligence

Artificial Inteligence Groups Applications Innovation

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

National Laboratory has implemented an AI-driven document processing platform that integrates named entity recognition (NER) and large language models (LLMs) on Amazon SageMaker AI. In this post, we discuss how you can build an AI-powered document processing platform with open source NER and LLMs on SageMaker.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

The rise of large language models (LLMs) and foundation models (FMs) has revolutionized the field of natural language processing (NLP) and artificial intelligence (AI). He is passionate about cloud and machine learning.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Revolutionizing data management: Trends driving security, scalability, and governance in 2025

CIO

JANUARY 30, 2025

Augmented data management with AI/ML Artificial Intelligence and Machine Learning transform traditional data management paradigms by automating labour-intensive processes and enabling smarter decision-making. With machine learning, these processes can be refined over time and anomalies can be predicted before they arise.

Scalability

Scalability Government Trends Artificial Inteligence

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

In this post, we explore the new Container Caching feature for SageMaker inference, addressing the challenges of deploying and scaling large language models (LLMs). You’ll learn about the key benefits of Container Caching, including faster scaling, improved resource utilization, and potential cost savings.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

About the NVIDIA Nemotron model family At the forefront of the NVIDIA Nemotron model family is Nemotron-4, as stated by NVIDIA, it is a powerful multilingual large language model (LLM) trained on an impressive 8 trillion text tokens, specifically optimized for English, multilingual, and coding tasks.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

United Airlines sets its flight plan for gen AI success

CIO

DECEMBER 20, 2024

With the core architectural backbone of the airlines gen AI roadmap in place, including United Data Hub and an AI and ML platform dubbed Mars, Birnbaum has released a handful of models into production use for employees and customers alike.

Airlines

Airlines Generative AI Artificial Inteligence Weak Development Team

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

Architecture The following figure shows the architecture of the solution. Through natural language processing algorithms and machine learning techniques, the large language model (LLM) analyzes the user’s queries in real time, extracting relevant context and intent to deliver tailored responses.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

The Struggle Between Data Dark Ages and LLM Accuracy

Cloudera

DECEMBER 6, 2024

Artificial Intelligence promises to transform lives and business as we know it. The AI Forecast: Data and AI in the Cloud Era , sponsored by Cloudera, aims to take an objective look at the impact of AI on business, industry, and the world at large. But what does that future look like? That’s context, that’s location.

Artificial Inteligence

Artificial Inteligence Data Artificial Intelligence Retail

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

The effectiveness of RAG heavily depends on the quality of context provided to the large language model (LLM), which is typically retrieved from vector stores based on user queries. The relevance of this context directly impacts the model’s ability to generate accurate and contextually appropriate responses.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

AI brings order to observability disorder

CIO

APRIL 16, 2025

Digital tools are the lifeblood of todays enterprises, but the complexity of hybrid cloud architectures, involving thousands of containers, microservices and applications, frustratesoperational leaders trying to optimize business outcomes. Artificial intelligence has contributed to complexity.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Analysis Banking

Securing AI Infrastructure for a More Resilient Future

Palo Alto Networks

OCTOBER 30, 2024

As policymakers across the globe approach regulating artificial intelligence (AI), there is an emerging and welcomed discussion around the importance of securing AI systems themselves. These models are increasingly being integrated into applications and networks across every sector of the economy.

Security

Security Artificial Inteligence Infrastructure Government

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. The Streamlit application will now display a button labeled Get LLM Response.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

How MCP can revolutionize the way DevOps teams use AI

CIO

APRIL 29, 2025

Imagine, for example, asking an LLM which Amazon S3 storage buckets or Azure storage accounts contain data that is publicly accessible, then change their access settings? Or having an LLM identify documents in an Amazon DynamoDB database that havent been updated in over a year and delete or archive them.

DevOps

DevOps Artificial Inteligence Technical Review Software Review

Improve Amazon Nova migration performance with data-aware prompt optimization

AWS Machine Learning - AI

APRIL 29, 2025

In the era of generative AI , new large language models (LLMs) are continually emerging, each with unique capabilities, architectures, and optimizations. Among these, Amazon Nova foundation models (FMs) deliver frontier intelligence and industry-leading cost-performance, available exclusively on Amazon Bedrock.

Artificial Inteligence

Artificial Inteligence Performance Data Generative AI

Unbundling the Graph in GraphRAG

O'Reilly Media - Ideas

NOVEMBER 19, 2024

Reasons for using RAG are clear: large language models (LLMs), which are effectively syntax engines, tend to “hallucinate” by inventing answers from pieces of their training data. Also, in place of expensive retraining or fine-tuning for an LLM, this approach allows for quick data updates at low cost.

Artificial Inteligence

Artificial Inteligence Construction Open Source Training

Goodbye digital transformation, hello AI-first business transformation

CIO

FEBRUARY 4, 2025

Instead of seeing digital as a new paradigm for our business, we over-indexed on digitizing legacy models and processes and modernizing our existing organization. The rise of artificial intelligence is giving us all a second chance. They were new products, interfaces, and architectures to do the same thing we always did.

Business Transformation

Business Transformation Artificial Inteligence Enterprise Artificial Intelligence

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Out-of-the-box models often lack the specific knowledge required for certain domains or organizational terminologies. To address this, businesses are turning to custom fine-tuned models, also known as domain-specific large language models (LLMs). The following diagram is the solution architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

INE Security Alert: Using AI-Driven Cybersecurity Training to Counter Emerging Threats

CIO

MARCH 14, 2025

As Artificial Intelligence (AI)-powered cyber threats surge, INE Security , a global leader in cybersecurity training and certification, is launching a new initiative to help organizations rethink cybersecurity training and workforce development.

Artificial Inteligence

Artificial Inteligence Training Generative AI Artificial Intelligence

Cybersecurity Snapshot: AI Security Roundup: Best Practices, Research and Insights

Tenable

NOVEMBER 29, 2024

The agencies recommend that organizations developing and deploying AI systems incorporate the following: Ensure a secure deployment environment : Confirm that the organization’s IT infrastructure is robust, with good governance, a solid architecture and secure configurations in place.

Artificial Inteligence

Artificial Inteligence Research Generative AI Technical Review

How today’s enterprise architect juggles strategy, tech and innovation

CIO

APRIL 16, 2025

Jenga builder: Enterprise architects piece together both reusable and replaceable components and solutions enabling responsive (adaptable, resilient) architectures that accelerate time-to-market without disrupting other components or the architecture overall (e.g. compromising quality, structure, integrity, goals).

Technical Review

Technical Review Enterprise Strategy Innovation

AIMMO bags $12M Series A to advance data labeling technology

TechCrunch

JANUARY 2, 2022

Most artificial intelligence models are trained through supervised learning, meaning that humans must label raw data. Data labeling is a critical part of automating artificial intelligence and machine learning model, but at the same time, it can be time-consuming and tedious work.

Artificial Inteligence

Artificial Inteligence Technology Artificial Intelligence Data

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Large language models (LLMs) have witnessed an unprecedented surge in popularity, with customers increasingly using publicly available models such as Llama, Stable Diffusion, and Mistral. With activations being partitioned along the sequence dimension, we need to consider how our model’s computations are affected.

Training

Training Artificial Inteligence AWS Machine Learning

Not your father’s avatar: The real future of artificial intelligence

CIO

OCTOBER 24, 2024

Generative artificial intelligence (genAI) is the latest milestone in the “AAA” journey, which began with the automation of the mundane, lead to augmentation — mostly machine-driven but lately also expanding into human augmentation — and has built up to artificial intelligence. Artificial?

Artificial Intelligence

Artificial Intelligence Artificial Inteligence Technical Review Software Review

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The following were some initial challenges in automation: Language diversity – The services host both Dutch and English shows. Some local shows feature Flemish dialects, which can be difficult for some large language models (LLMs) to understand. The secondary LLM is used to evaluate the summaries on a large scale.

Media

Media Video Artificial Inteligence Generative AI

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

With rapid progress in the fields of machine learning (ML) and artificial intelligence (AI), it is important to deploy the AI/ML model efficiently in production environments. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Agentic AI design: An architectural case study

Webinars

Trending Sources

The key to operational AI: Modern data architecture

Webinars

Have we reached the end of ‘too expensive’ for enterprise software?

Embedding BI: Architectural Considerations and Technical Requirements

Multi-LLM routing strategies for generative AI applications on AWS

What is data architecture? A framework to manage data

How AI orchestration has become more important than the models themselves

Top 11 LLM Tools That Ensure Smooth LLM Operations

Are enterprises ready to adopt AI at scale?

From legacy to lakehouse: Centralizing insurance data with Delta Lake

AI in action: Stories of how enterprises are transforming and modernizing

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

The AI Future According to Google Cloud Next ’25: My Interesting Finds

CAIOs are stepping out from the CIO’s shadow

Layoffs, AI demand create mismatched talent market for IT skills

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Integrate foundation models into your code with Amazon Bedrock

Revolutionizing data management: Trends driving security, scalability, and governance in 2025

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

United Airlines sets its flight plan for gen AI success

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

The Struggle Between Data Dark Ages and LLM Accuracy

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AI brings order to observability disorder

Securing AI Infrastructure for a More Resilient Future

Build and deploy a UI for your generative AI applications with AWS and Python

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

How MCP can revolutionize the way DevOps teams use AI

Improve Amazon Nova migration performance with data-aware prompt optimization

Unbundling the Graph in GraphRAG

Goodbye digital transformation, hello AI-first business transformation

Host concurrent LLMs with LoRAX

INE Security Alert: Using AI-Driven Cybersecurity Training to Counter Emerging Threats

Cybersecurity Snapshot: AI Security Roundup: Best Practices, Research and Insights

How today’s enterprise architect juggles strategy, tech and innovation

AIMMO bags $12M Series A to advance data labeling technology

Model customization, RAG, or both: A case study with Amazon Nova

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Not your father’s avatar: The real future of artificial intelligence

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Building a Scalable ML Pipeline and API in AWS

Stay Connected