Artificial Inteligence, Scalability and Storage

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

AI dominates Gartner’s 2025 predictions

CIO

OCTOBER 22, 2024

Artificial Intelligence continues to dominate this week’s Gartner IT Symposium/Xpo, as well as the research firm’s annual predictions list. “It It is clear that no matter where we go, we cannot avoid the impact of AI,” Daryl Plummer, distinguished vice president analyst, chief of research and Gartner Fellow told attendees. “AI

Artificial Inteligence

Artificial Inteligence Energy Healthcare Technical Review

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

All industries and modern applications are undergoing rapid transformation powered by advances in accelerated computing, deep learning, and artificial intelligence. The next phase of this transformation requires an intelligent data infrastructure that can bring AI closer to enterprise data. Through relentless innovation.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Gartner projects major IT spending increases for 2025

CIO

OCTOBER 24, 2024

TRECIG, a cybersecurity and IT consulting firm, will spend more on IT in 2025 as it invests more in advanced technologies such as artificial intelligence, machine learning, and cloud computing, says Roy Rucker Sr., CEO and president there. The company will still prioritize IT innovation, however.

Data Center

Data Center Artificial Inteligence Generative AI Artificial Intelligence

What is data architecture? A framework to manage data

CIO

DECEMBER 20, 2024

Its an offshoot of enterprise architecture that comprises the models, policies, rules, and standards that govern the collection, storage, arrangement, integration, and use of data in organizations. It includes data collection, refinement, storage, analysis, and delivery. Cloud storage. AI and machine learning models.

Architecture

Architecture Data Fractional CTO Technical Review

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Large language models (LLMs) have revolutionized the field of natural language processing with their ability to understand and generate humanlike text. Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

AI brings order to observability disorder

CIO

APRIL 16, 2025

Artificial intelligence has contributed to complexity. Businesses now want to monitor large language models as well as applications to spot anomalies that may contribute to inaccuracies,bias, and slow performance. Support for a wide range of large language models in the cloud and on premises.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Analysis Banking

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. Monitoring the performance and behavior of LLMs is a critical task for ensuring their safety and effectiveness.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Unlocking the full potential of enterprise AI

CIO

JANUARY 5, 2025

These narrow approaches also exacerbate data quality issues, as discrepancies in data format, consistency, and storage arise across disconnected teams, reducing the accuracy and reliability of AI outputs. Reliability and security is paramount. Without the necessary guardrails and governance, AI can be harmful.

Enterprise

Enterprise Generative AI Weak Development Team Technical Review

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services. An agent uses the power of an LLM to determine which function to execute, and output the result based on the prompt guide.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Fixie wants to make it easier for companies to build on top of language models

TechCrunch

MARCH 30, 2023

Co-founder and CEO Matt Welsh describes it as the first enterprise-focused platform-as-a-service for building experiences with large language models (LLMs). “The core of Fixie is its LLM-powered agents that can be built by anyone and run anywhere.” Fixie agents can interact with databases, APIs (e.g.

Artificial Inteligence

Artificial Inteligence Company ChatGPT Generative AI

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Out-of-the-box models often lack the specific knowledge required for certain domains or organizational terminologies. To address this, businesses are turning to custom fine-tuned models, also known as domain-specific large language models (LLMs). You have the option to quantize the model.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Although batch inference offers numerous benefits, it’s limited to 10 batch inference jobs submitted per model per Region. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. This automatically deletes the deployed stack.

Scalability

Scalability Lambda Generative AI AWS

Inferencing holds the clues to AI puzzles

CIO

APRIL 10, 2024

Inferencing has emerged as among the most exciting aspects of generative AI large language models (LLMs). A quick explainer: In AI inferencing , organizations take a LLM that is pretrained to recognize relationships in large datasets and generate new content based on input, such as text or images.

Artificial Inteligence

Artificial Inteligence Generative AI Storage Artificial Intelligence

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Perficient

NOVEMBER 20, 2024

Introduction to Multiclass Text Classification with LLMs Multiclass text classification (MTC) is a natural language processing (NLP) task where text is categorized into multiple predefined categories or classes. Traditional approaches rely on training machine learning models, requiring labeled data and iterative fine-tuning.

Artificial Inteligence

Artificial Inteligence Metrics Airlines Travel

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Amazon DynamoDB is a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Navigating the future of national tech independence with sovereign AI

CIO

MARCH 31, 2025

Sovereign AI refers to a national or regional effort to develop and control artificial intelligence (AI) systems, independent of the large non-EU foreign private tech platforms that currently dominate the field. Talent shortages AI development requires specialized knowledge in machine learning, data science, and engineering.

Technical Review

Technical Review Artificial Inteligence Compliance Open Source

Data trends in 2025

Xebia

FEBRUARY 23, 2025

Data governance is rapidly rising on the priority lists of large companies that want to work with AI in a data-driven manner. In many companies, data is spread across different storage locations and platforms, thus, ensuring effective connections and governance is crucial. Poor data quality automatically results in poor decisions.

Trends

Trends Data Artificial Inteligence Weak Development Team

Storage: The unsung hero of AI deployments

CIO

JULY 11, 2024

As enterprises begin to deploy and use AI, many realize they’ll need access to massive computing power and fast networking capabilities, but storage needs may be overlooked. In that case, Duos needs super-fast storage that works alongside its AI computing units. “If If you have a broken wheel, you want to know right now,” he says. “We

Storage

Storage Cloud Enterprise Training

The 10 Biggest Rounds Of October: OpenAI’s Massive Deal Dwarfs All Others

Crunchbase News

NOVEMBER 1, 2024

OpenAI , $6.6B, artificial intelligence: OpenAI announced its long-awaited raise of $6.6 tied) Poolside , $500M, artificial intelligence: Poolside closed a $500 million Series B led by Bain Capital Ventures. The startup builds artificial intelligence software for programmers. billion, per Crunchbase.

Biotech

Biotech Energy Artificial Inteligence Artificial Intelligence

Harness the Power of Pinecone with Cloudera’s New Applied Machine Learning Prototype

Cloudera

NOVEMBER 1, 2023

And so we are thrilled to introduce our latest applied ML prototype (AMP) — a large language model (LLM) chatbot customized with website data using Meta’s Llama2 LLM and Pinecone’s vector database. We invite you to explore the improved functionalities of this latest AMP.

Artificial Inteligence

Artificial Inteligence Machine Learning Knowledge Base Architecture

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

See clearly, spend wisely: The power of data platform observability

Xebia

DECEMBER 23, 2024

Scalability and Flexibility: The Double-Edged Sword of Pay-As-You-Go Models Pay-as-you-go pricing models are a game-changer for businesses. In these scenarios, the very scalability that makes pay-as-you-go models attractive can undermine an organization’s return on investment.

Data

Data Storage Culture Resources

See clearly, spend wisely: The power of data platform observability

Xebia

DECEMBER 23, 2024

Scalability and Flexibility: The Double-Edged Sword of Pay-As-You-Go Models Pay-as-you-go pricing models are a game-changer for businesses. In these scenarios, the very scalability that makes pay-as-you-go models attractive can undermine an organization’s return on investment.

Data

Data Storage Culture Resources

Scaling AI? First—get your data storage right

CIO

JUNE 23, 2023

Artificial intelligence (AI) is the analytics vehicle that extracts data’s tremendous value and translates it into actionable, usable insights. In my role at Dell Technologies, I strive to help organizations advance the use of data, especially unstructured data, by democratizing the at-scale deployment of artificial intelligence (AI).

Storage

Storage Data Artificial Inteligence Artificial Intelligence

Astera Labs, a fabless chip startup, nabs $50M at a $950M valuation to remove bottlenecks in high-bandwidth cloud applications

TechCrunch

SEPTEMBER 27, 2021

As more enterprises migrate to cloud-based architectures, they are also taking on more applications (because they can) and, as a result of that, more complex workloads and storage needs. Machine learning and other artificial intelligence applications add even more complexity. ” .

Artificial Inteligence

Artificial Inteligence Applications Cloud Artificial Intelligence

Data distilleries: CIOs turn to new efficient enterprise data platforms

CIO

DECEMBER 5, 2024

Consolidating data and improving accessibility through tenanted access controls can typically deliver a 25-30% reduction in data storage expenses while driving more informed decisions. The ideal solution should be scalable and flexible, capable of evolving alongside your organization’s needs.

Enterprise

Enterprise Data Insurance Business Intelligence

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

AWS Machine Learning - AI

JULY 24, 2024

Large language models (LLMs) have achieved remarkable success in various natural language processing (NLP) tasks, but they may not always generalize well to specific domains or tasks. You may need to customize an LLM to adapt to your unique use case, improving its performance on your specific dataset or task.

Artificial Inteligence

Artificial Inteligence Generative AI Training AWS

Integrating Key Vault Secrets with Azure Synapse Analytics

Apiumhub

DECEMBER 9, 2024

Azure Key Vault Secrets offers a centralized and secure storage alternative for API keys, passwords, certificates, and other sensitive statistics. Azure Key Vault is a cloud service that provides secure storage and access to confidential information such as passwords, API keys, and connection strings. What is Azure Key Vault Secret?

Azure

Azure Analytics Storage Machine Learning

Comparing production-grade NLP libraries: Accuracy, performance, and scalability

O'Reilly Media - Data

FEBRUARY 28, 2018

Training scalability. Scalability difference is significant. Naturally, this advantage becomes more substantial as the data size grows, or as the complexity of the pipeline (more naturl language processing (NLP) stages, adding machine learning (ML) or deep learning (DL) stages) grows. Scalability.

Scalability

Scalability Performance Comparison Training

Unlock business growth with data-driven insights: 5 lessons from IT leaders

CIO

MARCH 26, 2025

Maintaining a competitive edge can feel like a constant struggle as IT leaders race to adopt artificial intelligence (AI)to solve their IT challenges and drive innovation. Unless you analyze it, all this useful information can get lost in storage, often leading to lost revenue opportunities or high operational costs.

Artificial Inteligence

Artificial Inteligence Data Generative AI Innovation

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. For more information, see Create a service role for model import. for the month.

Generative AI

Generative AI Artificial Inteligence AWS Serverless

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures. Prompt catalog – Crafting effective prompts is important for guiding large language models (LLMs) to generate the desired outputs. It’s serverless so you don’t have to manage the infrastructure.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

AI on the mainframe? IBM may be onto something

CIO

OCTOBER 3, 2024

Rather than pull away from big iron in the AI era, Big Blue is leaning into it, with plans in 2025 to release its next-generation Z mainframe , with a Telum II processor and Spyre AI Accelerator Card, positioned to run large language models (LLMs) and machine learning models for fraud detection and other use cases.

Artificial Inteligence

Artificial Inteligence Generative AI Machine Learning Enterprise

Accelerating generative AI requires the right storage

CIO

AUGUST 9, 2023

In generative AI, data is the fuel, storage is the fuel tank and compute is the engine. Organizations need massive amounts of data to build and train generative AI models. In turn, these models will also generate reams of data that elevate organizational insights and productivity.

Generative AI

Generative AI Storage Scalability Technical Review

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

In todays fast-paced digital landscape, the cloud has emerged as a cornerstone of modern business infrastructure, offering unparalleled scalability, agility, and cost-efficiency. As organizations increasingly migrate to the cloud, however, CIOs face the daunting challenge of navigating a complex and rapidly evolving cloud ecosystem.

Cloud

Cloud Strategy Architecture Policies

Scaling Media Machine Learning at Netflix

Netflix Tech

FEBRUARY 13, 2023

We have been leveraging machine learning (ML) models to personalize artwork and to help our creatives create promotional content efficiently. Media Feature Storage: Amber Storage Media feature computation tends to be expensive and time-consuming. Why should members care about any particular show that we recommend?

Machine Learning

Machine Learning Artificial Inteligence Media Video

The Reason Many AI and Analytics Projects Fail—and How to Make Sure Yours Doesn’t

CIO

JANUARY 20, 2023

Many companies have been experimenting with advanced analytics and artificial intelligence (AI) to fill this need. 2] Foundational considerations include compute power, memory architecture as well as data processing, storage, and security. Now, they must turn their proof of concept into a return on investment.

Analytics

Analytics Artificial Inteligence Artificial Intelligence Hardware

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

DeepSeek-R1 is a large language model (LLM) developed by DeepSeek AI that uses reinforcement learning to enhance reasoning capabilities through a multi-stage training process from a DeepSeek-V3-Base foundation. See the following GitHub repo for more deployment examples using TGI, TensorRT-LLM, and Neuron.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

We demonstrate how to harness the power of LLMs to build an intelligent, scalable system that analyzes architecture documents and generates insightful recommendations based on AWS Well-Architected best practices. This scalability allows for more frequent and comprehensive reviews.

Generative AI

Generative AI Technical Review Software Review Systems Review

Databricks acquires AI-centric data governance platform Okera

TechCrunch

MAY 3, 2023

That approach doesn’t work anymore in the age of large language models (LLMs) because the number of assets is growing too quickly (in part because so much of it is machine-generated) and because the overall AI landscape is changing so quickly, standard access controls aren’t able to capture these changes quickly enough.

Government

Government Artificial Inteligence Data CTO Coach

Making the shift from computation to cognition

CIO

JUNE 11, 2024

Once perceived as an abstract concept, Artificial Intelligence (AI) and generative AI (genAI) have become more normalized as organizations look at ways to implement them into their tech stack.

Artificial Inteligence

Artificial Inteligence Technical Review eBook Artificial Intelligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

The flexible, scalable nature of AWS services makes it straightforward to continually refine the platform through improvements to the machine learning models and addition of new features. All AWS services are high-performing, secure, scalable, and purpose-built. 2024, Principal Financial Services, Inc.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Multi-LLM routing strategies for generative AI applications on AWS

AI dominates Gartner’s 2025 predictions

Webinars

Trending Sources

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

Webinars

Gartner projects major IT spending increases for 2025

What is data architecture? A framework to manage data

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AI brings order to observability disorder

Techniques and approaches for monitoring large language models on AWS

Unlocking the full potential of enterprise AI

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Model customization, RAG, or both: A case study with Amazon Nova

Fixie wants to make it easier for companies to build on top of language models

Host concurrent LLMs with LoRAX

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Inferencing holds the clues to AI puzzles

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Navigating the future of national tech independence with sovereign AI

Data trends in 2025

Storage: The unsung hero of AI deployments

The 10 Biggest Rounds Of October: OpenAI’s Massive Deal Dwarfs All Others

Harness the Power of Pinecone with Cloudera’s New Applied Machine Learning Prototype

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

See clearly, spend wisely: The power of data platform observability

See clearly, spend wisely: The power of data platform observability

Scaling AI? First—get your data storage right

Astera Labs, a fabless chip startup, nabs $50M at a $950M valuation to remove bottlenecks in high-bandwidth cloud applications

Data distilleries: CIOs turn to new efficient enterprise data platforms

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Integrating Key Vault Secrets with Azure Synapse Analytics

Comparing production-grade NLP libraries: Accuracy, performance, and scalability

Unlock business growth with data-driven insights: 5 lessons from IT leaders

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Build a multi-tenant generative AI environment for your enterprise on AWS

AI on the mainframe? IBM may be onto something

Accelerating generative AI requires the right storage

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Scaling Media Machine Learning at Netflix

The Reason Many AI and Analytics Projects Fail—and How to Make Sure Yours Doesn’t

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Accelerate AWS Well-Architected reviews with Generative AI

Databricks acquires AI-centric data governance platform Okera

Making the shift from computation to cognition

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Stay Connected