Architecture, Artificial Inteligence and Storage

What is data architecture? A framework to manage data

CIO

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects. Cloud storage.

Architecture

Architecture Data Fractional CTO Technical Review

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Are enterprises ready to adopt AI at scale?

CIO

OCTOBER 30, 2024

Whether it’s a financial services firm looking to build a personalized virtual assistant or an insurance company in need of ML models capable of identifying potential fraud, artificial intelligence (AI) is primed to transform nearly every industry.

Enterprise

Enterprise Artificial Inteligence Architecture Artificial Intelligence

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

All industries and modern applications are undergoing rapid transformation powered by advances in accelerated computing, deep learning, and artificial intelligence. The next phase of this transformation requires an intelligent data infrastructure that can bring AI closer to enterprise data.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. Monitoring the performance and behavior of LLMs is a critical task for ensuring their safety and effectiveness.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Stability AI backs effort to bring machine learning to biomed

TechCrunch

NOVEMBER 4, 2022

Called OpenBioML , the endeavor’s first projects will focus on machine learning-based approaches to DNA sequencing, protein folding and computational biochemistry. Stability AI’s ethically questionable decisions to date aside, machine learning in medicine is a minefield. ” Generating DNA sequences.

Artificial Inteligence

Artificial Inteligence Machine Learning Biotech Training

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

Architecture The following figure shows the architecture of the solution. Through natural language processing algorithms and machine learning techniques, the large language model (LLM) analyzes the user’s queries in real time, extracting relevant context and intent to deliver tailored responses.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Real-time Data, Machine Learning, and Results: The Evidence Mounts

CIO

OCTOBER 4, 2022

From delightful consumer experiences to attacking fuel costs and carbon emissions in the global supply chain, real-time data and machine learning (ML) work together to power apps that change industries. Data architecture coherence. more machine learning use casesacross the company.

Machine Learning

Machine Learning Artificial Inteligence Data Architecture

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning - AI

MARCH 18, 2025

This is where the integration of cutting-edge technologies, such as audio-to-text translation and large language models (LLMs), holds the potential to revolutionize the way patients receive, process, and act on vital medical information. These insights can include: Potential adverse event detection and reporting.

Artificial Inteligence

Artificial Inteligence Technical Review Healthcare Systems Review

IT leaders brace for the AI agent management challenge

CIO

MARCH 4, 2025

There are organizations who spend $1 million plus per year on LLM calls, Ricky wrote. Agent ops is a critical capability think Python SDKs for agent monitoring, LLM cost tracking, benchmarking, to gain visibility into API calls, real-time cost management, and reliability scores for agents in production.

Software Review

Software Review Artificial Inteligence Technical Review Government

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. He helps support large enterprise customers at AWS and is part of the Machine Learning TFC.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Transforming workloads: Harnessing AI within VMware environments

CIO

APRIL 9, 2025

CEOs and boards of directors are tasking their CIOs to enable artificial intelligence (AI) within the organization as rapidly as possible. The networking, compute, and storage needs not to mention power and cooling are significant, and market pressures require the assembly to happen quickly.

Google Cloud

Google Cloud Load Balancer Virtualization Artificial Intelligence

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Perficient

NOVEMBER 20, 2024

Introduction to Multiclass Text Classification with LLMs Multiclass text classification (MTC) is a natural language processing (NLP) task where text is categorized into multiple predefined categories or classes. Traditional approaches rely on training machine learning models, requiring labeled data and iterative fine-tuning.

Artificial Inteligence

Artificial Inteligence Metrics Airlines Travel

Harness the Power of Pinecone with Cloudera’s New Applied Machine Learning Prototype

Cloudera

NOVEMBER 1, 2023

And so we are thrilled to introduce our latest applied ML prototype (AMP) — a large language model (LLM) chatbot customized with website data using Meta’s Llama2 LLM and Pinecone’s vector database. We invite you to explore the improved functionalities of this latest AMP.

Artificial Inteligence

Artificial Inteligence Machine Learning Knowledge Base Architecture

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Large language models (LLMs) have witnessed an unprecedented surge in popularity, with customers increasingly using publicly available models such as Llama, Stable Diffusion, and Mistral. With activations being partitioned along the sequence dimension, we need to consider how our model’s computations are affected.

Training

Training Artificial Inteligence AWS Machine Learning

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

It prevents vendor lock-in, gives a lever for strong negotiation, enables business flexibility in strategy execution owing to complicated architecture or regional limitations in terms of security and legal compliance if and when they rise and promotes portability from an application architecture perspective.

Cloud

Cloud Strategy Architecture Policies

Data distilleries: CIOs turn to new efficient enterprise data platforms

CIO

DECEMBER 5, 2024

Consolidating data and improving accessibility through tenanted access controls can typically deliver a 25-30% reduction in data storage expenses while driving more informed decisions. When evaluating options, prioritize platforms that facilitate data democratization through low-code or no-code architectures.

Enterprise

Enterprise Data Insurance Business Intelligence

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

AWS Machine Learning - AI

MARCH 18, 2025

This application allows users to ask questions in natural language and then generates a SQL query for the users request. Large language models (LLMs) are trained to generate accurate SQL queries for natural language instructions. However, off-the-shelf LLMs cant be used without some modification.

Artificial Inteligence

Artificial Inteligence Applications Generative AI Off-The-Shelf

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. The resulting distilled models, such as DeepSeek-R1-Distill-Llama-8B (from base model Llama-3.1-8B

Generative AI

Generative AI Artificial Inteligence AWS Serverless

Astera Labs, a fabless chip startup, nabs $50M at a $950M valuation to remove bottlenecks in high-bandwidth cloud applications

TechCrunch

SEPTEMBER 27, 2021

As more enterprises migrate to cloud-based architectures, they are also taking on more applications (because they can) and, as a result of that, more complex workloads and storage needs. Machine learning and other artificial intelligence applications add even more complexity.

Artificial Inteligence

Artificial Inteligence Applications Cloud Artificial Intelligence

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. It doesn’t retain audio or output text, and users have control over data storage with encryption in transit and at rest. This can lead to more personalized and effective care.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures. Prompt catalog – Crafting effective prompts is important for guiding large language models (LLMs) to generate the desired outputs. It’s serverless so you don’t have to manage the infrastructure.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

Traditionally, transforming raw data into actionable intelligence has demanded significant engineering effort. It often requires managing multiple machine learning (ML) models, designing complex workflows, and integrating diverse data sources into production-ready formats.

Data

Data Generative AI Artificial Inteligence Compliance

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

The solution integrates large language models (LLMs) with your organization’s data and provides an intelligent chat assistant that understands conversation context and provides relevant, interactive responses directly within the Google Chat interface. It can be a local machine or a cloud instance.

Generative AI

Generative AI Lambda Applications AWS

Data trends in 2025

Xebia

FEBRUARY 23, 2025

Data governance is rapidly rising on the priority lists of large companies that want to work with AI in a data-driven manner. In many companies, data is spread across different storage locations and platforms, thus, ensuring effective connections and governance is crucial. Poor data quality automatically results in poor decisions.

Trends

Trends Data Artificial Inteligence Weak Development Team

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

AWS Machine Learning - AI

JULY 24, 2024

Large language models (LLMs) have achieved remarkable success in various natural language processing (NLP) tasks, but they may not always generalize well to specific domains or tasks. You may need to customize an LLM to adapt to your unique use case, improving its performance on your specific dataset or task.

Artificial Inteligence

Artificial Inteligence Generative AI Training AWS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. The solution incorporates the following key features: Using a Retrieval Augmented Generation (RAG) architecture, the system generates a context-aware detailed assessment.

Generative AI

Generative AI Technical Review Software Review Systems Review

Navigating the future of national tech independence with sovereign AI

CIO

MARCH 31, 2025

Sovereign AI refers to a national or regional effort to develop and control artificial intelligence (AI) systems, independent of the large non-EU foreign private tech platforms that currently dominate the field. Talent shortages AI development requires specialized knowledge in machine learning, data science, and engineering.

Technical Review

Technical Review Artificial Inteligence Compliance Open Source

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

This need for customization has become even more pronounced with the emergence of new models, such as those released by DeepSeek. However, customizing DeepSeek models effectively while managing computational resources remains a significant challenge. You can run these recipes using SageMaker HyperPod or as SageMaker training jobs.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

AI on the mainframe? IBM may be onto something

CIO

OCTOBER 3, 2024

Rather than pull away from big iron in the AI era, Big Blue is leaning into it, with plans in 2025 to release its next-generation Z mainframe , with a Telum II processor and Spyre AI Accelerator Card, positioned to run large language models (LLMs) and machine learning models for fraud detection and other use cases.

Artificial Inteligence

Artificial Inteligence Generative AI Machine Learning Enterprise

Unlock business growth with data-driven insights: 5 lessons from IT leaders

CIO

MARCH 26, 2025

Maintaining a competitive edge can feel like a constant struggle as IT leaders race to adopt artificial intelligence (AI)to solve their IT challenges and drive innovation. Unless you analyze it, all this useful information can get lost in storage, often leading to lost revenue opportunities or high operational costs.

Artificial Inteligence

Artificial Inteligence Data Generative AI Innovation

Foundational data protection for enterprise LLM acceleration with Protopia AI

AWS Machine Learning - AI

DECEMBER 5, 2023

New and powerful large language models (LLMs) are changing businesses rapidly, improving efficiency and effectiveness for a variety of enterprise use cases. Speed is of the essence, and adoption of LLM technologies can make or break a business’s competitive advantage.

Artificial Inteligence

Artificial Inteligence Enterprise Data Generative AI

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

The flexible, scalable nature of AWS services makes it straightforward to continually refine the platform through improvements to the machine learning models and addition of new features. The following diagram illustrates the Principal generative AI chatbot architecture with AWS services.

Generative AI

Generative AI AWS Groups Artificial Inteligence

AI’s data problem: How to build the right foundation

CIO

NOVEMBER 27, 2024

With data existing in a variety of architectures and forms, it can be impossible to discern which resources are the best for fueling GenAI. The Right Foundation Having trustworthy, governed data starts with modern, effective data management and storage practices.

Data

Data How To Government Architecture

Scaling AI? First—get your data storage right

CIO

JUNE 23, 2023

Artificial intelligence (AI) is the analytics vehicle that extracts data’s tremendous value and translates it into actionable, usable insights. In my role at Dell Technologies, I strive to help organizations advance the use of data, especially unstructured data, by democratizing the at-scale deployment of artificial intelligence (AI).

Storage

Storage Data Artificial Intelligence Artificial Inteligence

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques. To learn more about FMEval, see Evaluate large language models for quality and responsibility of LLMs.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

The Reason Many AI and Analytics Projects Fail—and How to Make Sure Yours Doesn’t

CIO

JANUARY 20, 2023

Many companies have been experimenting with advanced analytics and artificial intelligence (AI) to fill this need. 2] Foundational considerations include compute power, memory architecture as well as data processing, storage, and security. Now, they must turn their proof of concept into a return on investment.

Analytics

Analytics Artificial Inteligence Artificial Intelligence Hardware

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

DeepSeek-R1 is a large language model (LLM) developed by DeepSeek AI that uses reinforcement learning to enhance reasoning capabilities through a multi-stage training process from a DeepSeek-V3-Base foundation. DeepSeek-R1 uses a Mixture of Experts (MoE) architecture and is 671 billion parameters in size.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

Training large language models (LLMs) models has become a significant expense for businesses. For many use cases, companies are looking to use LLM foundation models (FM) with their domain-specific data. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Artificial Inteligence Generative AI Training

SAP and Databricks: Better Together

Perficient

NOVEMBER 17, 2024

No single platform architecture can satisfy all the needs and use cases of large complex enterprises, so SAP partnered with a small handful of companies to enhance and enlarge the scope of their offering. Unified Data Storage Combines the scalability and flexibility of a data lake with the structured capabilities of a data warehouse.

Artificial Inteligence

Artificial Inteligence Machine Learning Architecture Analytics

Accelerating generative AI requires the right storage

CIO

AUGUST 9, 2023

In generative AI, data is the fuel, storage is the fuel tank and compute is the engine. Organizations need massive amounts of data to build and train generative AI models. In turn, these models will also generate reams of data that elevate organizational insights and productivity.

Generative AI

Generative AI Storage Scalability Technical Review

TransUnion transforms its business model with IT

CIO

APRIL 26, 2024

Once completed within two years, the platform, OneTru, will give TransUnion and its customers access to TransUnion’s behemoth trove of consumer data to fuel next-generation analytical services, machine learning models and generative AI applications, says Achanta, who is driving the effort, and held similar posts at Neustar and Walmart.

Artificial Inteligence

Artificial Inteligence Generative AI Machine Learning Analytics

What is data architecture? A framework to manage data

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Are enterprises ready to adopt AI at scale?

Webinars

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

Techniques and approaches for monitoring large language models on AWS

Stability AI backs effort to bring machine learning to biomed

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Real-time Data, Machine Learning, and Results: The Evidence Mounts

Model customization, RAG, or both: A case study with Amazon Nova

Revolutionizing clinical trials with the power of voice and AI

IT leaders brace for the AI agent management challenge

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Transforming workloads: Harnessing AI within VMware environments

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Harness the Power of Pinecone with Cloudera’s New Applied Machine Learning Prototype

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Data distilleries: CIOs turn to new efficient enterprise data platforms

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Astera Labs, a fabless chip startup, nabs $50M at a $950M valuation to remove bottlenecks in high-bandwidth cloud applications

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Build a multi-tenant generative AI environment for your enterprise on AWS

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Data trends in 2025

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Accelerate AWS Well-Architected reviews with Generative AI

Navigating the future of national tech independence with sovereign AI

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AI on the mainframe? IBM may be onto something

Unlock business growth with data-driven insights: 5 lessons from IT leaders

Foundational data protection for enterprise LLM acceleration with Protopia AI

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AI’s data problem: How to build the right foundation

Scaling AI? First—get your data storage right

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

The Reason Many AI and Analytics Projects Fail—and How to Make Sure Yours Doesn’t

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

SAP and Databricks: Better Together

Accelerating generative AI requires the right storage

TransUnion transforms its business model with IT

Stay Connected