Artificial Inteligence, Performance and Scalability

Revolutionizing data management: Trends driving security, scalability, and governance in 2025

CIO

JANUARY 30, 2025

From data masking technologies that ensure unparalleled privacy to cloud-native innovations driving scalability, these trends highlight how enterprises can balance innovation with accountability. With machine learning, these processes can be refined over time and anomalies can be predicted before they arise.

Scalability

Scalability Government Trends Artificial Inteligence

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Generative and agentic artificial intelligence (AI) are paving the way for this evolution. AI practitioners and industry leaders discussed these trends, shared best practices, and provided real-world use cases during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI. The EXLerate.AI

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

AI dominates Gartner’s 2025 predictions

CIO

OCTOBER 22, 2024

Artificial Intelligence continues to dominate this week’s Gartner IT Symposium/Xpo, as well as the research firm’s annual predictions list. “It By 2028, 40% of large enterprises will deploy AI to manipulate and measure employee mood and behaviors, all in the name of profit. “AI AI is evolving as human use of AI evolves. “AI

Artificial Inteligence

Artificial Inteligence Energy Healthcare Technical Review

EXL’s Insurance LLM transforms claims and underwriting

CIO

FEBRUARY 5, 2025

As insurance companies embrace generative AI (genAI) to address longstanding operational inefficiencies, theyre discovering that general-purpose large language models (LLMs) often fall short in solving their unique challenges. Claims adjudication, for example, is an intensive manual process that bogs down insurers.

Artificial Inteligence

Artificial Inteligence Insurance Technical Review Generative AI

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

All industries and modern applications are undergoing rapid transformation powered by advances in accelerated computing, deep learning, and artificial intelligence. The next phase of this transformation requires an intelligent data infrastructure that can bring AI closer to enterprise data. Performance enhancements.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

John Snow Labs

APRIL 2, 2025

The update enables domain experts, such as doctors or lawyers, to evaluate and improve custom-built large language models (LLMs) with precision and transparency. New capabilities include no-code features to streamline the process of auditing and tuning AI models.

Artificial Inteligence

Artificial Inteligence Software Review Generative AI Technical Review

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Large language models (LLMs) have revolutionized the field of natural language processing with their ability to understand and generate humanlike text. Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

Scaling AI talent: An AI apprenticeship model that works

CIO

NOVEMBER 12, 2024

The hunch was that there were a lot of Singaporeans out there learning about data science, AI, machine learning and Python on their own. Because a lot of Singaporeans and locals have been learning AI, machine learning, and Python on their own. I needed the ratio to be the other way around! And why that role?

Artificial Inteligence

Artificial Inteligence Weak Development Team Training Artificial Intelligence

The Power of Small LLMs in Healthcare: A RAG Framework Alternative to Large Language Models

John Snow Labs

NOVEMBER 8, 2024

By examining JSL’s purpose-built models, including jsl_med_rag_v1 , jsl_meds_rag_q8_v1 , jsl_meds_q8_v3 , and jsl_medm_q8_v2 we demonstrate that even an 8-billion parameter model, when fine-tuned for clinical use, can deliver performance comparable to larger, general-purpose LLMs. The prompt is fed into the LLM.

Artificial Inteligence

Artificial Inteligence Healthcare Case Study Comparison

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

Organizations can use these models securely, and for models that are compatible with the Amazon Bedrock Converse API, you can use the robust toolkit of Amazon Bedrock, including Amazon Bedrock Agents , Amazon Bedrock Knowledge Bases , Amazon Bedrock Guardrails , and Amazon Bedrock Flows.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

For generative AI models requiring multiple instances to handle high-throughput inference requests, this added significant overhead to the total scaling time, potentially impacting application performance during traffic spikes. 70B model showed significant and consistent improvements in end-to-end (E2E) scaling times.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

“Collaborations between public and private organizations will be vital for the UAE to deliver on digital agenda”

CIO

OCTOBER 20, 2024

The partnership is set to trial cutting-edge AI and machine learning solutions while exploring confidential compute technology for cloud deployments. Core42 equips organizations across the UAE and beyond with the infrastructure they need to take advantage of exciting technologies like AI, Machine Learning, and predictive analytics.

Organization

Organization Artificial Inteligence Machine Learning Infrastructure

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Eye On AI: As Big Money Rolls Into Data Centers, Startup Investment Gains

Crunchbase News

OCTOBER 17, 2024

The startup uses light to link chips together and to do calculations for the deep learning necessary for AI. The Columbus, Ohio-based company currently has two robotic welding products in the market, both leveraging vision systems, artificial intelligence and machine learning to autonomously weld steel parts.

Data Center

Data Center Artificial Inteligence Data Energy

Using John Snow Labs’ Medical Large Language Models on Azure Fabric

John Snow Labs

FEBRUARY 12, 2025

John Snow Labs’ Medical Language Models library is an excellent choice for leveraging the power of large language models (LLM) and natural language processing (NLP) in Azure Fabric due to its seamless integration, scalability, and state-of-the-art accuracy on medical tasks.

Artificial Inteligence

Artificial Inteligence Azure Healthcare Software Review

What is data architecture? A framework to manage data

CIO

DECEMBER 20, 2024

Invest in core functions that perform data curation such as modeling important relationships, cleansing raw data, and curating key dimensions and measures. AI and machine learning models. According to data platform Acceldata , there are three core principles of data architecture: Scalability.

Architecture

Architecture Data Fractional CTO Technical Review

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services. An agent uses the power of an LLM to determine which function to execute, and output the result based on the prompt guide.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

A blueprint for successfully executing business-aligned IT strategies

CIO

NOVEMBER 21, 2024

Structured frameworks such as the Stakeholder Value Model provide a method for evaluating how IT projects impact different stakeholders, while tools like the Business Model Canvas help map out how technology investments enhance value propositions, streamline operations, and improve financial performance.

Strategy

Strategy Technical Advisors Agile Culture

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

The use of large language models (LLMs) and generative AI has exploded over the last year. With the release of powerful publicly available foundation models, tools for training, fine tuning and hosting your own LLM have also become democratized. We will also talk about performance tuning the inference graph.

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

From automation to transformation: How AI is reshaping business

CIO

MARCH 27, 2025

Are you using artificial intelligence (AI) to do the same things youve always done, just more efficiently? EXL executives and AI practitioners discussed the technologys full potential during the companys recent virtual event, AI in Action: Driving the Shift to Scalable AI. If so, youre only scratching the surface. The EXLerate.AI

Artificial Inteligence

Artificial Inteligence Insurance Generative AI Artificial Intelligence

Cloud analytics migration: how to exceed expectations

CIO

NOVEMBER 19, 2024

A modern data and artificial intelligence (AI) platform running on scalable processors can handle diverse analytics workloads and speed data retrieval, delivering deeper insights to empower strategic decision-making. Intel’s cloud-optimized hardware accelerates AI workloads, while SAS provides scalable, AI-driven solutions.

Analytics

Analytics Cloud How To Scalability

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Although batch inference offers numerous benefits, it’s limited to 10 batch inference jobs submitted per model per Region. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. Access to your selected models hosted on Amazon Bedrock.

Scalability

Scalability Lambda Generative AI AWS

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Perficient

NOVEMBER 20, 2024

Introduction to Multiclass Text Classification with LLMs Multiclass text classification (MTC) is a natural language processing (NLP) task where text is categorized into multiple predefined categories or classes. Traditional approaches rely on training machine learning models, requiring labeled data and iterative fine-tuning.

Artificial Inteligence

Artificial Inteligence Metrics Airlines Travel

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

For some content, additional screening is performed to generate subtitles and captions. As DPG Media grows, they need a more scalable way of capturing metadata that enhances the consumer experience on online video services and aids in understanding key content characteristics.

Media

Media Video Artificial Inteligence Generative AI

AI Cloud Startup Vultr Raises $333M At $3.5B In First Outside Funding Round

Crunchbase News

DECEMBER 18, 2024

West Palm Beach, Florida-based Vultr says it plans to use the new capital to acquire more graphics processing units, or GPUs, which are in hot demand to power large language models. Along with rivals Nvidia and Intel , AMD and its venture arm have been active investors in startup funding deals this year for AI-related companies.

Artificial Inteligence

Artificial Inteligence Cloud Journal Fintech

Fixie wants to make it easier for companies to build on top of language models

TechCrunch

MARCH 30, 2023

Co-founder and CEO Matt Welsh describes it as the first enterprise-focused platform-as-a-service for building experiences with large language models (LLMs). “The core of Fixie is its LLM-powered agents that can be built by anyone and run anywhere.” Fixie agents can interact with databases, APIs (e.g.

Artificial Inteligence

Artificial Inteligence Company ChatGPT Generative AI

Foundation Model vs LLM: Choosing the Best AI Model

Openxcell

JANUARY 20, 2025

Have you ever imagined how artificial intelligence has changed our lives and the way businesses function? The rise of AI models, such as the foundation model and LLM, which offer massive automation and creativity, has made this possible. It ultimately increases the performance and versatility. What are LLMs?

Artificial Inteligence

Artificial Inteligence Generative AI Training Architecture

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. The Streamlit application will now display a button labeled Get LLM Response.

Generative AI

Generative AI AWS Artificial Inteligence Applications

EBSCOlearning scales assessment generation for their online learning content with generative AI

AWS Machine Learning - AI

DECEMBER 11, 2024

EBSCOlearning, a leader in the realm of online learning, recognized this need and embarked on an ambitious journey to transform their assessment creation process using cutting-edge generative AI technology. The evaluation process includes three phases: LLM-based guideline evaluation, rule-based checks, and a final evaluation.

Generative AI

Generative AI Artificial Inteligence Guidelines Education

Unlocking the full potential of enterprise AI

CIO

JANUARY 5, 2025

According to PwC, organizations can experience incremental value at scale through AI, with 20% to 30% gains in productivity, speed to market, and revenue, on top of big leaps such as new business models. [2] AI in action The benefits of this approach are clear to see.

Enterprise

Enterprise Generative AI Weak Development Team Technical Review

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Bodo.ai secures $14M, aims to make Python better at handling large-scale data

TechCrunch

AUGUST 25, 2021

Python is one of the top programming languages used among artificial intelligence and machine learning developers and data scientists, but as Behzad Nasre, co-founder and CEO of Bodo.ai, points out, it is challenging to use when handling large-scale data.

Artificial Inteligence

Artificial Inteligence Machine Learning Data Artificial Intelligence

Data trends in 2025

Xebia

FEBRUARY 23, 2025

The first agents to emerge are expected to perform small, structured internal tasks with some degree of fault-tolerance, such as helping to change passwords on IT systems or book vacation time on HR platforms. The real challenge in 2025 is using AI effectively and responsibly, which is where LLMOps (LLM Operations) comes in.

Trends

Trends Data Artificial Inteligence Weak Development Team

Fast-Tracking Custom LLMs Using vLLM

InnovationM

APRIL 15, 2025

At InnovationM, we are constantly searching for tools and technologies that can drive the performance and scalability of our AI-driven products. Recently, we made progress with vLLM, a high-performance model inference engine designed to deploy Large Language Models (LLMs) more efficiently.

Artificial Inteligence

Artificial Inteligence Performance Training Scalability

OpenAI’s new tool attempts to explain language models’ behaviors

TechCrunch

MAY 9, 2023

It’s often said that large language models (LLMs) along the lines of OpenAI’s ChatGPT are a black box, and certainly, there’s some truth to that. Even for data scientists, it’s difficult to know why, always, a model responds in the way it does, like inventing facts out of whole cloth.

Artificial Inteligence

Artificial Inteligence Tools Weak Development Team Open Source

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Amazon DynamoDB is a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Foundation Model for Personalized Recommendation

Netflix Tech

MARCH 28, 2025

By Ko-Jen Hsiao , Yesu Feng and Sudarshan Lamkhede Motivation Netflixs personalized recommender system is a complex system, boasting a variety of specialized machine learned models each catering to distinct needs including Continue Watching and Todays Top Picks for You. Refer to our recent overview for more details).

Artificial Inteligence

Artificial Inteligence Systems Review Training Windows

Bridging the IT skills gap, Part 1: Assessing current strategies and introducing GenAI as a unified solution

CIO

JANUARY 14, 2025

The gap between emerging technological capabilities and workforce skills is widening, and traditional approaches such as hiring specialized professionals or offering occasional training are no longer sufficient as they often lack the scalability and adaptability needed for long-term success.

Technical Advisors

Technical Advisors Strategy Training Research

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

Technology leaders in the financial services sector constantly struggle with the daily challenges of balancing cost, performance, and security the constant demand for high availability means that even a minor system outage could lead to significant financial and reputational losses. Scalability. Scalability. Cost forecasting.

Serverless

Serverless Architecture Microservices Scalability

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

CIO

APRIL 15, 2025

During his one hour forty minute-keynote, Thomas Kurian, CEO of Google Cloud showcased updates around most of the companys offerings, including new large language models (LLMs) , a new AI accelerator chip, new open source frameworks around agents, and updates to its data analytics, databases, and productivity tools and services among others.

Cloud

Cloud Innovation Artificial Inteligence Google Cloud

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

This visibility is essential for setting accurate pricing for generative AI offerings, implementing chargebacks, and establishing usage-based billing models. Without a scalable approach to controlling costs, organizations risk unbudgeted usage and cost overruns. However, there are considerations to keep in mind.

Generative AI

Generative AI AWS Artificial Inteligence Budget

EXL Code Harbor streamlines platform migration, data governance, and workflow assessment

CIO

FEBRUARY 18, 2025

But in many cases, the prospect of migrating to modern cloud native, open source languages 1 seems even worse. Artificial intelligence (AI) tools have emerged to help, but many businesses fear they will expose their intellectual property, hallucinate errors or fail on large codebases because of their prompt limits.

Software Review

Software Review Artificial Inteligence Government Data

Navigating the future of national tech independence with sovereign AI

CIO

MARCH 31, 2025

Sovereign AI refers to a national or regional effort to develop and control artificial intelligence (AI) systems, independent of the large non-EU foreign private tech platforms that currently dominate the field. high-performance computing GPU), data centers, and energy.

Technical Review

Technical Review Artificial Inteligence Compliance Open Source

Revolutionizing data management: Trends driving security, scalability, and governance in 2025

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

AI in action: Stories of how enterprises are transforming and modernizing

Webinars

AI dominates Gartner’s 2025 predictions

EXL’s Insurance LLM transforms claims and underwriting

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Scaling AI talent: An AI apprenticeship model that works

The Power of Small LLMs in Healthcare: A RAG Framework Alternative to Large Language Models

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

“Collaborations between public and private organizations will be vital for the UAE to deliver on digital agenda”

Model customization, RAG, or both: A case study with Amazon Nova

Eye On AI: As Big Money Rolls Into Data Centers, Startup Investment Gains

Using John Snow Labs’ Medical Large Language Models on Azure Fabric

What is data architecture? A framework to manage data

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

A blueprint for successfully executing business-aligned IT strategies

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

From automation to transformation: How AI is reshaping business

Cloud analytics migration: how to exceed expectations

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AI Cloud Startup Vultr Raises $333M At $3.5B In First Outside Funding Round

Fixie wants to make it easier for companies to build on top of language models

Foundation Model vs LLM: Choosing the Best AI Model

Build and deploy a UI for your generative AI applications with AWS and Python

EBSCOlearning scales assessment generation for their online learning content with generative AI

Unlocking the full potential of enterprise AI

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Bodo.ai secures $14M, aims to make Python better at handling large-scale data

Data trends in 2025

Fast-Tracking Custom LLMs Using vLLM

OpenAI’s new tool attempts to explain language models’ behaviors

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Foundation Model for Personalized Recommendation

Bridging the IT skills gap, Part 1: Assessing current strategies and introducing GenAI as a unified solution

Can serverless fix fintech’s scaling problem?

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

EXL Code Harbor streamlines platform migration, data governance, and workflow assessment

Navigating the future of national tech independence with sovereign AI

Stay Connected