Architecture, Artificial Inteligence and Storage

Architecture

Artificial Inteligence

Storage

What is data architecture? A framework to manage data

CIO

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects. Cloud storage.

Architecture

Architecture Data Fractional CTO Technical Review

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Join 49,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Are enterprises ready to adopt AI at scale?

CIO

OCTOBER 30, 2024

Whether it’s a financial services firm looking to build a personalized virtual assistant or an insurance company in need of ML models capable of identifying potential fraud, artificial intelligence (AI) is primed to transform nearly every industry.

Enterprise

Enterprise Artificial Inteligence Architecture Artificial Intelligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

All industries and modern applications are undergoing rapid transformation powered by advances in accelerated computing, deep learning, and artificial intelligence. The next phase of this transformation requires an intelligent data infrastructure that can bring AI closer to enterprise data.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

How MCP can revolutionize the way DevOps teams use AI

CIO

APRIL 29, 2025

Imagine, for example, asking an LLM which Amazon S3 storage buckets or Azure storage accounts contain data that is publicly accessible, then change their access settings? Or having an LLM identify documents in an Amazon DynamoDB database that havent been updated in over a year and delete or archive them.

DevOps

DevOps Artificial Inteligence Technical Review Software Review

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. Monitoring the performance and behavior of LLMs is a critical task for ensuring their safety and effectiveness.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Out-of-the-box models often lack the specific knowledge required for certain domains or organizational terminologies. To address this, businesses are turning to custom fine-tuned models, also known as domain-specific large language models (LLMs). The following diagram is the solution architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

National Laboratory has implemented an AI-driven document processing platform that integrates named entity recognition (NER) and large language models (LLMs) on Amazon SageMaker AI. In this post, we discuss how you can build an AI-powered document processing platform with open source NER and LLMs on SageMaker.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

AI brings order to observability disorder

CIO

APRIL 16, 2025

Digital tools are the lifeblood of todays enterprises, but the complexity of hybrid cloud architectures, involving thousands of containers, microservices and applications, frustratesoperational leaders trying to optimize business outcomes. Artificial intelligence has contributed to complexity.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Analysis Banking

Stability AI backs effort to bring machine learning to biomed

TechCrunch

NOVEMBER 4, 2022

Called OpenBioML , the endeavor’s first projects will focus on machine learning-based approaches to DNA sequencing, protein folding and computational biochemistry. Stability AI’s ethically questionable decisions to date aside, machine learning in medicine is a minefield. ” Generating DNA sequences.

Artificial Inteligence

Artificial Inteligence Machine Learning Biotech Training

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

Architecture The following figure shows the architecture of the solution. Through natural language processing algorithms and machine learning techniques, the large language model (LLM) analyzes the user’s queries in real time, extracting relevant context and intent to deliver tailored responses.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Large language models (LLMs) have witnessed an unprecedented surge in popularity, with customers increasingly using publicly available models such as Llama, Stable Diffusion, and Mistral. With activations being partitioned along the sequence dimension, we need to consider how our model’s computations are affected.

Training

Training Artificial Inteligence AWS Machine Learning

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning - AI

APRIL 30, 2025

With advancement in AI technology, the time is right to address such complexities with large language models (LLMs). Amazon Bedrock has helped democratize access to LLMs, which have been challenging to host and manage. The following diagram illustrates the architecture using AWS services.

Artificial Inteligence

Artificial Inteligence Technical Review Software Review Generative AI

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. He helps support large enterprise customers at AWS and is part of the Machine Learning TFC.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Real-time Data, Machine Learning, and Results: The Evidence Mounts

CIO

OCTOBER 4, 2022

From delightful consumer experiences to attacking fuel costs and carbon emissions in the global supply chain, real-time data and machine learning (ML) work together to power apps that change industries. Data architecture coherence. more machine learning use casesacross the company.

Machine Learning

Machine Learning Artificial Inteligence Data Architecture

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning - AI

MARCH 18, 2025

This is where the integration of cutting-edge technologies, such as audio-to-text translation and large language models (LLMs), holds the potential to revolutionize the way patients receive, process, and act on vital medical information. These insights can include: Potential adverse event detection and reporting.

Artificial Inteligence

Artificial Inteligence Technical Review Healthcare Systems Review

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. The solution incorporates the following key features: Using a Retrieval Augmented Generation (RAG) architecture, the system generates a context-aware detailed assessment.

Generative AI

Generative AI Technical Review Software Review Systems Review

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. The resulting distilled models, such as DeepSeek-R1-Distill-Llama-8B (from base model Llama-3.1-8B

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

This need for customization has become even more pronounced with the emergence of new models, such as those released by DeepSeek. However, customizing DeepSeek models effectively while managing computational resources remains a significant challenge. You can run these recipes using SageMaker HyperPod or as SageMaker training jobs.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. It doesn’t retain audio or output text, and users have control over data storage with encryption in transit and at rest. This can lead to more personalized and effective care.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

AWS Machine Learning - AI

MARCH 18, 2025

This application allows users to ask questions in natural language and then generates a SQL query for the users request. Large language models (LLMs) are trained to generate accurate SQL queries for natural language instructions. However, off-the-shelf LLMs cant be used without some modification.

Artificial Inteligence

Artificial Inteligence Applications Generative AI Off-The-Shelf

IT leaders brace for the AI agent management challenge

CIO

MARCH 4, 2025

There are organizations who spend $1 million plus per year on LLM calls, Ricky wrote. Agent ops is a critical capability think Python SDKs for agent monitoring, LLM cost tracking, benchmarking, to gain visibility into API calls, real-time cost management, and reliability scores for agents in production.

Software Review

Software Review Artificial Inteligence Technical Review Government

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures. Prompt catalog – Crafting effective prompts is important for guiding large language models (LLMs) to generate the desired outputs. It’s serverless so you don’t have to manage the infrastructure.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Perficient

NOVEMBER 20, 2024

Introduction to Multiclass Text Classification with LLMs Multiclass text classification (MTC) is a natural language processing (NLP) task where text is categorized into multiple predefined categories or classes. Traditional approaches rely on training machine learning models, requiring labeled data and iterative fine-tuning.

Artificial Inteligence

Artificial Inteligence Metrics Airlines Travel

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

It prevents vendor lock-in, gives a lever for strong negotiation, enables business flexibility in strategy execution owing to complicated architecture or regional limitations in terms of security and legal compliance if and when they rise and promotes portability from an application architecture perspective.

Cloud

Cloud Strategy Architecture Policies

Foundational data protection for enterprise LLM acceleration with Protopia AI

AWS Machine Learning - AI

DECEMBER 5, 2023

New and powerful large language models (LLMs) are changing businesses rapidly, improving efficiency and effectiveness for a variety of enterprise use cases. Speed is of the essence, and adoption of LLM technologies can make or break a business’s competitive advantage.

Artificial Inteligence

Artificial Inteligence Enterprise Data Generative AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

The solution integrates large language models (LLMs) with your organization’s data and provides an intelligent chat assistant that understands conversation context and provides relevant, interactive responses directly within the Google Chat interface. It can be a local machine or a cloud instance.

Generative AI

Generative AI Lambda Applications AWS

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

AWS Machine Learning - AI

JULY 24, 2024

Large language models (LLMs) have achieved remarkable success in various natural language processing (NLP) tasks, but they may not always generalize well to specific domains or tasks. You may need to customize an LLM to adapt to your unique use case, improving its performance on your specific dataset or task.

Artificial Inteligence

Artificial Inteligence Generative AI Training AWS

Data distilleries: CIOs turn to new efficient enterprise data platforms

CIO

DECEMBER 5, 2024

Consolidating data and improving accessibility through tenanted access controls can typically deliver a 25-30% reduction in data storage expenses while driving more informed decisions. When evaluating options, prioritize platforms that facilitate data democratization through low-code or no-code architectures.

Enterprise

Enterprise Data Insurance Business Intelligence

Astera Labs, a fabless chip startup, nabs $50M at a $950M valuation to remove bottlenecks in high-bandwidth cloud applications

TechCrunch

SEPTEMBER 27, 2021

As more enterprises migrate to cloud-based architectures, they are also taking on more applications (because they can) and, as a result of that, more complex workloads and storage needs. Machine learning and other artificial intelligence applications add even more complexity.

Artificial Inteligence

Artificial Inteligence Applications Cloud Artificial Intelligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

Training large language models (LLMs) models has become a significant expense for businesses. For many use cases, companies are looking to use LLM foundation models (FM) with their domain-specific data. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Artificial Inteligence Generative AI Training

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

DeepSeek-R1 is a large language model (LLM) developed by DeepSeek AI that uses reinforcement learning to enhance reasoning capabilities through a multi-stage training process from a DeepSeek-V3-Base foundation. DeepSeek-R1 uses a Mixture of Experts (MoE) architecture and is 671 billion parameters in size.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

Traditionally, transforming raw data into actionable intelligence has demanded significant engineering effort. It often requires managing multiple machine learning (ML) models, designing complex workflows, and integrating diverse data sources into production-ready formats.

Data

Data Generative AI Artificial Inteligence Compliance

Transforming workloads: Harnessing AI within VMware environments

CIO

APRIL 9, 2025

CEOs and boards of directors are tasking their CIOs to enable artificial intelligence (AI) within the organization as rapidly as possible. The networking, compute, and storage needs not to mention power and cooling are significant, and market pressures require the assembly to happen quickly.

Google Cloud

Google Cloud Load Balancer Virtualization Artificial Inteligence

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Organizations building and deploying AI applications, particularly those using large language models (LLMs) with Retrieval Augmented Generation (RAG) systems, face a significant challenge: how to evaluate AI outputs effectively throughout the application lifecycle.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

Data trends in 2025

Xebia

FEBRUARY 23, 2025

Data governance is rapidly rising on the priority lists of large companies that want to work with AI in a data-driven manner. In many companies, data is spread across different storage locations and platforms, thus, ensuring effective connections and governance is crucial. Poor data quality automatically results in poor decisions.

Trends

Trends Data Artificial Inteligence Weak Development Team

Build a gen AI–powered financial assistant with Amazon Bedrock multi-agent collaboration

AWS Machine Learning - AI

MAY 2, 2025

The use of a multi-agent system, rather than relying on a single large language model (LLM) to handle all tasks, enables more focused and in-depth analysis in specialized areas. Furthermore, the systems modular architecture facilitates seamless maintenance, updates, and scalability.

Real Estate

Real Estate Artificial Inteligence Knowledge Base Lambda

AI on the mainframe? IBM may be onto something

CIO

OCTOBER 3, 2024

Rather than pull away from big iron in the AI era, Big Blue is leaning into it, with plans in 2025 to release its next-generation Z mainframe , with a Telum II processor and Spyre AI Accelerator Card, positioned to run large language models (LLMs) and machine learning models for fraud detection and other use cases.

Artificial Inteligence

Artificial Inteligence Generative AI Machine Learning Enterprise

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

The flexible, scalable nature of AWS services makes it straightforward to continually refine the platform through improvements to the machine learning models and addition of new features. The following diagram illustrates the Principal generative AI chatbot architecture with AWS services.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

In this post, we describe the development journey of the generative AI companion for Mozart, the data, the architecture, and the evaluation of the pipeline. Solution overview The policy documents reside in Amazon Simple Storage Service (Amazon S3) storage. The following diagram illustrates the solution architecture.

Generative AI

Generative AI Technical Review Insurance Policies

Medical content creation in the age of generative AI

AWS Machine Learning - AI

JULY 3, 2024

Generative AI and transformer-based large language models (LLMs) have been in the top headlines recently. These models demonstrate impressive performance in question answering, text summarization, code, and text generation. Amazon Bedrock : to interact with supported LLMs and embedding models.

Artificial Inteligence

Artificial Inteligence Generative AI Lambda Healthcare

Scaling AI? First—get your data storage right

CIO

JUNE 23, 2023

Artificial intelligence (AI) is the analytics vehicle that extracts data’s tremendous value and translates it into actionable, usable insights. In my role at Dell Technologies, I strive to help organizations advance the use of data, especially unstructured data, by democratizing the at-scale deployment of artificial intelligence (AI).

Storage

Storage Data Artificial Inteligence Artificial Intelligence

What is data architecture? A framework to manage data

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Are enterprises ready to adopt AI at scale?

Webinars

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

How MCP can revolutionize the way DevOps teams use AI

Techniques and approaches for monitoring large language models on AWS

Host concurrent LLMs with LoRAX

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AI brings order to observability disorder

Stability AI backs effort to bring machine learning to biomed

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Model customization, RAG, or both: A case study with Amazon Nova

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

FloQast builds an AI-powered accounting transformation solution with Anthropic’s Claude 3 on Amazon Bedrock

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Real-time Data, Machine Learning, and Results: The Evidence Mounts

Revolutionizing clinical trials with the power of voice and AI

Accelerate AWS Well-Architected reviews with Generative AI

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

IT leaders brace for the AI agent management challenge

Build a multi-tenant generative AI environment for your enterprise on AWS

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Foundational data protection for enterprise LLM acceleration with Protopia AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Data distilleries: CIOs turn to new efficient enterprise data platforms

Astera Labs, a fabless chip startup, nabs $50M at a $950M valuation to remove bottlenecks in high-bandwidth cloud applications

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Transforming workloads: Harnessing AI within VMware environments

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Data trends in 2025

Build a gen AI–powered financial assistant with Amazon Bedrock multi-agent collaboration

AI on the mainframe? IBM may be onto something

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Medical content creation in the age of generative AI

Scaling AI? First—get your data storage right

Stay Connected