Artificial Inteligence and Scalability

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Revolutionizing data management: Trends driving security, scalability, and governance in 2025

CIO

JANUARY 30, 2025

From data masking technologies that ensure unparalleled privacy to cloud-native innovations driving scalability, these trends highlight how enterprises can balance innovation with accountability. With machine learning, these processes can be refined over time and anomalies can be predicted before they arise.

Scalability

Scalability Government Trends Artificial Inteligence

AI in action: How enterprises are scaling AI for real business impact

CIO

MARCH 11, 2025

To capitalize on the enormous potential of artificial intelligence (AI) enterprises need systems purpose-built for industry-specific workflows. Enterprise technology leaders discussed these issues and more while sharing real-world examples during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI.

Artificial Inteligence

Artificial Inteligence Enterprise Artificial Intelligence Insurance

Webinars

Going Beyond Chatbots: Connecting AI to Your Tools, Systems, & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

MORE WEBINARS

EXL’s Insurance LLM transforms claims and underwriting

CIO

FEBRUARY 5, 2025

As insurance companies embrace generative AI (genAI) to address longstanding operational inefficiencies, theyre discovering that general-purpose large language models (LLMs) often fall short in solving their unique challenges. Claims adjudication, for example, is an intensive manual process that bogs down insurers.

Artificial Inteligence

Artificial Inteligence Insurance Technical Review Generative AI

MLOps 101: The Foundation for Your AI Strategy

Advertiser: Data Robot

Many organizations are dipping their toes into machine learning and artificial intelligence (AI). Download this comprehensive guide to learn: What is MLOps? How can MLOps tools deliver trusted, scalable, and secure infrastructure for machine learning projects?

Artificial Inteligence

AI in action: Stories of how enterprises are transforming and modernizing

CIO

MARCH 20, 2025

Generative and agentic artificial intelligence (AI) are paving the way for this evolution. AI practitioners and industry leaders discussed these trends, shared best practices, and provided real-world use cases during EXLs recent virtual event, AI in Action: Driving the Shift to Scalable AI. The EXLerate.AI

Artificial Inteligence

Artificial Inteligence Enterprise Insurance Artificial Intelligence

AI dominates Gartner’s 2025 predictions

CIO

OCTOBER 22, 2024

Artificial Intelligence continues to dominate this week’s Gartner IT Symposium/Xpo, as well as the research firm’s annual predictions list. “It It is clear that no matter where we go, we cannot avoid the impact of AI,” Daryl Plummer, distinguished vice president analyst, chief of research and Gartner Fellow told attendees. “AI

Artificial Inteligence

Artificial Inteligence Energy Healthcare Technical Review

Top 11 LLM Tools That Ensure Smooth LLM Operations

Openxcell

JANUARY 20, 2025

LLM or large language models are deep learning models trained on vast amounts of linguistic data so they understand and respond in natural language (human-like texts). These encoders and decoders help the LLM model contextualize the input data and, based on that, generate appropriate responses.

Artificial Inteligence

Artificial Inteligence Tools Open Source Architecture

The key to operational AI: Modern data architecture

CIO

NOVEMBER 27, 2024

Recent research shows that 67% of enterprises are using generative AI to create new content and data based on learned patterns; 50% are using predictive AI, which employs machine learning (ML) algorithms to forecast future events; and 45% are using deep learning, a subset of ML that powers both generative and predictive models.

Architecture

Architecture Artificial Inteligence Data Development Team Review

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence.

Generative AI

Building a Scalable ML Pipeline and API in AWS

Dzone - DevOps

MARCH 28, 2025

With rapid progress in the fields of machine learning (ML) and artificial intelligence (AI), it is important to deploy the AI/ML model efficiently in production environments. The architecture downstream ensures scalability, cost efficiency, and real-time access to applications.

Scalability

Scalability Artificial Inteligence AWS Artificial Intelligence

Faster, Better, Cheaper: How to Measure the Business Impact of LLMs

Xebia

APRIL 16, 2025

Understanding the Value Proposition of LLMs Large Language Models (LLMs) have quickly become a powerful tool for businesses, but their true impact depends on how they are implemented. The key is determining where LLMs provide value without sacrificing business-critical quality.

Artificial Inteligence

Artificial Inteligence Systems Review How To eCommerce

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

All industries and modern applications are undergoing rapid transformation powered by advances in accelerated computing, deep learning, and artificial intelligence. The next phase of this transformation requires an intelligent data infrastructure that can bring AI closer to enterprise data. Performance enhancements.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

John Snow Labs

APRIL 2, 2025

The update enables domain experts, such as doctors or lawyers, to evaluate and improve custom-built large language models (LLMs) with precision and transparency. New capabilities include no-code features to streamline the process of auditing and tuning AI models.

Artificial Inteligence

Artificial Inteligence Software Review Generative AI Technical Review

From legacy to lakehouse: Centralizing insurance data with Delta Lake

CIO

APRIL 23, 2025

Modern AI models, particularly large language models, frequently require real-time data processing capabilities. The machine learning models would target and solve for one use case, but Gen AI has the capability to learn and address multiple use cases at scale.

Insurance

Insurance Artificial Inteligence Data Architecture

CAIOs are stepping out from the CIO’s shadow

CIO

MARCH 14, 2025

But the increase in use of intelligent tools in recent years since the arrival of generative AI has begun to cement the CAIO role as a key tech executive position across a wide range of sectors. The role of artificial intelligence is very closely tied to generating efficiencies on an ongoing basis, as well as implying continuous adoption.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Government Generative AI

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Large language models (LLMs) have revolutionized the field of natural language processing with their ability to understand and generate humanlike text. Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

Organizations can use these models securely, and for models that are compatible with the Amazon Bedrock Converse API, you can use the robust toolkit of Amazon Bedrock, including Amazon Bedrock Agents , Amazon Bedrock Knowledge Bases , Amazon Bedrock Guardrails , and Amazon Bedrock Flows. You can find him on LinkedIn.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Gartner projects major IT spending increases for 2025

CIO

OCTOBER 24, 2024

TRECIG, a cybersecurity and IT consulting firm, will spend more on IT in 2025 as it invests more in advanced technologies such as artificial intelligence, machine learning, and cloud computing, says Roy Rucker Sr., CEO and president there. The company will still prioritize IT innovation, however.

Data Center

Data Center Artificial Inteligence Generative AI Artificial Intelligence

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

In this post, we explore the new Container Caching feature for SageMaker inference, addressing the challenges of deploying and scaling large language models (LLMs). You’ll learn about the key benefits of Container Caching, including faster scaling, improved resource utilization, and potential cost savings.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

The AI Future According to Google Cloud Next ’25: My Interesting Finds

Xebia

APRIL 17, 2025

It also supports the newly announced Agent 2 Agent (A2A) protocol which Google is positioning as an open, secure standard for agent-agent collaboration, driven by a large community of Technology, Platform and Service partners. Native Multi-Agent Architecture: Build scalable applications by composing specialized agents in a hierarchy.

Google Cloud

Google Cloud Artificial Inteligence Cloud Video

The Power of Small LLMs in Healthcare: A RAG Framework Alternative to Large Language Models

John Snow Labs

NOVEMBER 8, 2024

Our results indicate that, for specialized healthcare tasks like answering clinical questions or summarizing medical research, these smaller models offer both efficiency and high relevance, positioning them as an effective alternative to larger counterparts within a RAG setup. The prompt is fed into the LLM.

Artificial Inteligence

Artificial Inteligence Healthcare Case Study Comparison

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Scaling AI talent: An AI apprenticeship model that works

CIO

NOVEMBER 12, 2024

The hunch was that there were a lot of Singaporeans out there learning about data science, AI, machine learning and Python on their own. Because a lot of Singaporeans and locals have been learning AI, machine learning, and Python on their own. I needed the ratio to be the other way around! And why that role?

Artificial Inteligence

Artificial Inteligence Weak Development Team Training Artificial Intelligence

AI brings order to observability disorder

CIO

APRIL 16, 2025

Artificial intelligence has contributed to complexity. Businesses now want to monitor large language models as well as applications to spot anomalies that may contribute to inaccuracies,bias, and slow performance. Support for a wide range of large language models in the cloud and on premises.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Analysis Banking

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

The use of large language models (LLMs) and generative AI has exploded over the last year. With the release of powerful publicly available foundation models, tools for training, fine tuning and hosting your own LLM have also become democratized. model. , "temperature":0, "max_tokens": 128}' | jq '.choices[0].text'

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

APRIL 23, 2025

National Laboratory has implemented an AI-driven document processing platform that integrates named entity recognition (NER) and large language models (LLMs) on Amazon SageMaker AI. In this post, we discuss how you can build an AI-powered document processing platform with open source NER and LLMs on SageMaker.

Artificial Inteligence

Artificial Inteligence Open Source AWS Serverless

Reimagine application modernisation with the power of generative AI

CIO

JANUARY 15, 2025

2] The myriad potential of GenAI enables enterprises to simplify coding and facilitate more intelligent and automated system operations. By leveraging large language models and platforms like Azure Open AI, for example, organisations can transform outdated code into modern, customised frameworks that support advanced features.

Generative AI

Generative AI Applications Artificial Inteligence Azure

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services. An agent uses the power of an LLM to determine which function to execute, and output the result based on the prompt guide.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

AWS Machine Learning - AI

APRIL 21, 2025

In this blog post, we discuss how Prompt Optimization improves the performance of large language models (LLMs) for intelligent text processing task in Yuewen Group. Evolution from Traditional NLP to LLM in Intelligent Text Processing Yuewen Group leverages AI for intelligent analysis of extensive web novel texts.

Artificial Inteligence

Artificial Inteligence Groups Applications Innovation

EBSCOlearning scales assessment generation for their online learning content with generative AI

AWS Machine Learning - AI

DECEMBER 11, 2024

This pipeline is illustrated in the following figure and consists of several key components: QA generation, multifaceted evaluation, and intelligent revision. The evaluation process includes three phases: LLM-based guideline evaluation, rule-based checks, and a final evaluation. Sonnet in Amazon Bedrock.

Generative AI

Generative AI Artificial Inteligence Guidelines Education

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Although batch inference offers numerous benefits, it’s limited to 10 batch inference jobs submitted per model per Region. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. This automatically deletes the deployed stack.

Scalability

Scalability Lambda Generative AI AWS

AI market evolution: Data and infrastructure transformation through AI

CIO

NOVEMBER 4, 2024

Artificial Intelligence (AI), a term once relegated to science fiction, is now driving an unprecedented revolution in business technology. AI applications rely heavily on secure data, models, and infrastructure. From nimble start-ups to global powerhouses, businesses are hailing AI as the next frontier of digital transformation.

Infrastructure

Infrastructure Marketing Data Artificial Inteligence

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. The Streamlit application will now display a button labeled Get LLM Response.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Eye On AI: As Big Money Rolls Into Data Centers, Startup Investment Gains

Crunchbase News

OCTOBER 17, 2024

The startup uses light to link chips together and to do calculations for the deep learning necessary for AI. The Columbus, Ohio-based company currently has two robotic welding products in the market, both leveraging vision systems, artificial intelligence and machine learning to autonomously weld steel parts.

Data Center

Data Center Artificial Inteligence Data Energy

CIO hiring on the rise: How to land a top tech exec role in 2025

CIO

FEBRUARY 25, 2025

CIOs who bring real credibility to the conversation understand that AI is an output of a well architected, well managed, scalable set of data platforms, an operating model, and a governance model. CIOs have shared that in every meeting, people are enamored with AI and gen AI.

Technical Review

Technical Review Artificial Inteligence How To Recruiting

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

As DPG Media grows, they need a more scalable way of capturing metadata that enhances the consumer experience on online video services and aids in understanding key content characteristics. The following were some initial challenges in automation: Language diversity – The services host both Dutch and English shows.

Media

Media Video Artificial Inteligence Generative AI

From LLM Mess to LLM Mesh: Building Scalable AI Applications

Dataiku

JANUARY 7, 2025

At Dataiku Everyday AI events in Dallas, Toronto, London, Berlin, and Dubai this past fall, we talked about an architecture paradigm for LLM-powered applications: an LLM Mesh. What actually is an LLM Mesh? How does it help organizations scale up the development and delivery of LLM-powered applications?

Artificial Inteligence

Artificial Inteligence Applications Scalability Architecture

From automation to transformation: How AI is reshaping business

CIO

MARCH 27, 2025

Are you using artificial intelligence (AI) to do the same things youve always done, just more efficiently? EXL executives and AI practitioners discussed the technologys full potential during the companys recent virtual event, AI in Action: Driving the Shift to Scalable AI. If so, youre only scratching the surface. The EXLerate.AI

Artificial Inteligence

Artificial Inteligence Insurance Generative AI Artificial Intelligence

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

Out-of-the-box models often lack the specific knowledge required for certain domains or organizational terminologies. To address this, businesses are turning to custom fine-tuned models, also known as domain-specific large language models (LLMs). You have the option to quantize the model.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Fixie wants to make it easier for companies to build on top of language models

TechCrunch

MARCH 30, 2023

Co-founder and CEO Matt Welsh describes it as the first enterprise-focused platform-as-a-service for building experiences with large language models (LLMs). “The core of Fixie is its LLM-powered agents that can be built by anyone and run anywhere.” Fixie agents can interact with databases, APIs (e.g.

Artificial Inteligence

Artificial Inteligence Company ChatGPT Generative AI

Dubai and the UAE partner with Google to reshape the digital future

CIO

NOVEMBER 10, 2024

Sheikh Hamdan highlighted that partnerships with global leaders like Google are integral to this goal, enabling the city to set new standards in technology and develop scalable solutions that serve international markets.

Tourism

Tourism Artificial Intelligence Artificial Inteligence Policies

“Collaborations between public and private organizations will be vital for the UAE to deliver on digital agenda”

CIO

OCTOBER 20, 2024

The partnership is set to trial cutting-edge AI and machine learning solutions while exploring confidential compute technology for cloud deployments. Core42 equips organizations across the UAE and beyond with the infrastructure they need to take advantage of exciting technologies like AI, Machine Learning, and predictive analytics.

Organization

Organization Machine Learning Artificial Inteligence Infrastructure

Multi-LLM routing strategies for generative AI applications on AWS

Revolutionizing data management: Trends driving security, scalability, and governance in 2025

Webinars

Trending Sources

AI in action: How enterprises are scaling AI for real business impact

Webinars

EXL’s Insurance LLM transforms claims and underwriting

MLOps 101: The Foundation for Your AI Strategy

AI in action: Stories of how enterprises are transforming and modernizing

AI dominates Gartner’s 2025 predictions

Top 11 LLM Tools That Ensure Smooth LLM Operations

The key to operational AI: Modern data architecture

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Building a Scalable ML Pipeline and API in AWS

Faster, Better, Cheaper: How to Measure the Business Impact of LLMs

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

John Snow Labs Releases Generative AI Lab 7.0 to Help Domain Experts Evaluate and Improve LLM Applications and Conduct HCC Coding Reviews

From legacy to lakehouse: Centralizing insurance data with Delta Lake

CAIOs are stepping out from the CIO’s shadow

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Gartner projects major IT spending increases for 2025

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

The AI Future According to Google Cloud Next ’25: My Interesting Finds

The Power of Small LLMs in Healthcare: A RAG Framework Alternative to Large Language Models

Model customization, RAG, or both: A case study with Amazon Nova

Scaling AI talent: An AI apprenticeship model that works

AI brings order to observability disorder

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Reimagine application modernisation with the power of generative AI

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group

EBSCOlearning scales assessment generation for their online learning content with generative AI

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AI market evolution: Data and infrastructure transformation through AI

Build and deploy a UI for your generative AI applications with AWS and Python

Eye On AI: As Big Money Rolls Into Data Centers, Startup Investment Gains

CIO hiring on the rise: How to land a top tech exec role in 2025

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

From LLM Mess to LLM Mesh: Building Scalable AI Applications

From automation to transformation: How AI is reshaping business

Host concurrent LLMs with LoRAX

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Fixie wants to make it easier for companies to build on top of language models

Dubai and the UAE partner with Google to reshape the digital future

“Collaborations between public and private organizations will be vital for the UAE to deliver on digital agenda”

Stay Connected