Artificial Inteligence, Generative AI and How To

LLM benchmarking: How to find the right AI model

CIO

MARCH 11, 2025

But how do companies decide which large language model (LLM) is right for them? But beneath the glossy surface of advertising promises lurks the crucial question: Which of these technologies really delivers what it promises and which ones are more likely to cause AI projects to falter?

Artificial Inteligence

Artificial Inteligence How To Metrics Software Review

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

By making tool integration simpler and standardized, customers building agents can now focus on which tools to use and how to use them, rather than spending cycles building custom integration code. Amazon SageMaker AI provides the ability to host LLMs without worrying about scaling or managing the undifferentiated heavy lifting.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Have we reached the end of ‘too expensive’ for enterprise software?

CIO

JANUARY 9, 2025

Generative artificial intelligence ( genAI ) and in particular large language models ( LLMs ) are changing the way companies develop and deliver software. These autoregressive models can ultimately process anything that can be easily broken down into tokens: image, video, sound and even proteins.

Artificial Inteligence

Artificial Inteligence Software Review Software Enterprise

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence. Save your seat and register today! 📆 June 4th 2024 at 11:00am PDT, 2:00pm EDT, 7:00pm BST

Generative AI

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning - AI

NOVEMBER 6, 2024

The emergence of generative AI has ushered in a new era of possibilities, enabling the creation of human-like text, images, code, and more. Solution overview For this solution, you deploy a demo application that provides a clean and intuitive UI for interacting with a generative AI model, as illustrated in the following screenshot.

Generative AI

Generative AI AWS Artificial Inteligence Applications

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. 70B model showed significant and consistent improvements in end-to-end (E2E) scaling times.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

How to Use Generative AI and LLMs to Improve Search

TechEmpower CTO

OCTOBER 9, 2023

Artificial Intelligence (AI), and particularly Large Language Models (LLMs), have significantly transformed the search engine as we’ve known it. With Generative AI and LLMs, new avenues for improving operational efficiency and user satisfaction are emerging every day.

Generative AI

Generative AI Artificial Inteligence How To Systems Review

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

While organizations continue to discover the powerful applications of generative AI , adoption is often slowed down by team silos and bespoke workflows. To move faster, enterprises need robust operating models and a holistic approach that simplifies the generative AI lifecycle.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Launching LLM-Based Products: From Concept to Cash in 90 Days

Speaker: Christophe Louvion, Chief Product & Technology Officer of NRC Health and Tony Karrer, CTO at Aggregage

In this exclusive webinar, Christophe will cover key aspects of his journey, including: LLM Development & Quick Wins 🤖 Understand how LLMs differ from traditional software, identifying opportunities for rapid development and deployment.

Artificial Inteligence

Generate financial industry-specific insights using generative AI and in-context fine-tuning

AWS Machine Learning - AI

NOVEMBER 12, 2024

In this blog post, we demonstrate prompt engineering techniques to generate accurate and relevant analysis of tabular data using industry-specific language. This is done by providing large language models (LLMs) in-context sample data with features and labels in the prompt.

Generative AI

Generative AI Artificial Inteligence Industry Analysis

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

As enterprises increasingly embrace generative AI , they face challenges in managing the associated costs. With demand for generative AI applications surging across projects and multiple lines of business, accurately allocating and tracking spend becomes more complex.

Generative AI

Generative AI AWS Artificial Inteligence Budget

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

In this post, we explore a generative AI solution leveraging Amazon Bedrock to streamline the WAFR process. We demonstrate how to harness the power of LLMs to build an intelligent, scalable system that analyzes architecture documents and generates insightful recommendations based on AWS Well-Architected best practices.

Generative AI

Generative AI Technical Review Software Review Systems Review

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

Furthermore, these notes are usually personal and not stored in a central location, which is a lost opportunity for businesses to learn what does and doesn’t work, as well as how to improve their sales, purchasing, and communication processes.

Generative AI

Generative AI Video Engineering Artificial Inteligence

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale.

Artificial Inteligence

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Recently, we’ve been witnessing the rapid development and evolution of generative AI applications, with observability and evaluation emerging as critical aspects for developers, data scientists, and stakeholders. In the context of Amazon Bedrock , observability and evaluation become even more crucial.

Generative AI

Generative AI Applications AWS Knowledge Base

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. In the following sections, we explain how to deploy this architecture.

Generative AI

Generative AI Lambda Applications AWS

Integrate foundation models into your code with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 6, 2024

The rise of large language models (LLMs) and foundation models (FMs) has revolutionized the field of natural language processing (NLP) and artificial intelligence (AI). You can find instructions on how to do this in the AWS documentation for your chosen SDK.

Software Review

Software Review Artificial Inteligence Generative AI AWS

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

Have you ever stumbled upon a breathtaking travel photo and instantly wondered where it was and how to get there? Each one of these millions of travelers need to plan where they’ll stay, what they’ll see, and how they’ll get from place to place. It’s like having your own personal travel agent whenever you need it.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

LLMs in Production: Tooling, Process, and Team Structure

Speaker: Dr. Greg Loughnane and Chris Alexiuk

Technology professionals developing generative AI applications are finding that there are big leaps from POCs and MVPs to production-ready applications. However, during development – and even more so once deployed to production – best practices for operating and improving generative AI applications are less understood.

Tools

Camelot Secure’s AI wizard eases path to cybersecurity compliance

CIO

NOVEMBER 4, 2024

Like many innovative companies, Camelot looked to artificial intelligence for a solution. The result is Myrddin, an AI-based cyber wizard that provides answers and guidance to IT teams undergoing CMMC assessments. To address compliance fatigue, Camelot began work on its AI wizard in 2023.

Compliance

Compliance Artificial Inteligence Guidelines Artificial Intelligence

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

Principal wanted to use existing internal FAQs, documentation, and unstructured data and build an intelligent chatbot that could provide quick access to the right information for different roles. Adherence to responsible and ethical AI practices were a priority for Principal.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

However, as the reach of live streams expands globally, language barriers and accessibility challenges have emerged, limiting the ability of viewers to fully comprehend and participate in these immersive experiences. To learn more about how to build and scale generative AI applications, refer to Transform your business with generative AI.

Generative AI

Generative AI AWS Lambda Authentication

Insights in implementing production-ready solutions with generative AI

AWS Machine Learning - AI

APRIL 30, 2025

As generative AI revolutionizes industries, organizations are eager to harness its potential. This post explores key insights and lessons learned from AWS customers in Europe, Middle East, and Africa (EMEA) who have successfully navigated this transition, providing a roadmap for others looking to follow suit.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

A Tale of Two Case Studies: Using LLMs in Production

Speaker: Tony Karrer, Ryan Barker, Grant Wiles, Zach Asman, & Mark Pace

Join our exclusive webinar with top industry visionaries, where we'll explore the latest innovations in Artificial Intelligence and the incredible potential of LLMs. We'll walk through two compelling case studies that showcase how AI is reimagining industries and revolutionizing the way we interact with technology.

Case Study

10 generative AI certs and certificate programs to grow your skills

CIO

MAY 30, 2024

Generative AI is poised to disrupt nearly every industry, and IT professionals with highly sought after gen AI skills are in high demand, as companies seek to harness the technology for various digital and operational initiatives.

Generative AI

Generative AI Artificial Inteligence Programming Azure

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

The introduction of Amazon Nova models represent a significant advancement in the field of AI, offering new opportunities for large language model (LLM) optimization. In this post, we demonstrate how to effectively perform model customization and RAG with Amazon Nova models as a baseline.

Case Study

Case Study Artificial Inteligence Study Generative AI

Using Generative AI to Build Generative AI

O'Reilly Media - Ideas

FEBRUARY 25, 2025

Hi, I am a professor of cognitive science and design at UC San Diego, and I recently wrote posts on Radar about my experiences coding with and speaking to generative AI tools like ChatGPT. In particular, theyre great at generating and explaining small pieces of self-contained code (e.g.,

Generative AI

Generative AI Artificial Inteligence ChatGPT UI/UX

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

The NVIDIA Nemotron family, available as NVIDIA NIM microservices, offers a cutting-edge suite of language models now available through Amazon Bedrock Marketplace, marking a significant milestone in AI model accessibility and deployment. About the authors James Park is a Solutions Architect at Amazon Web Services.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

Building User-Centric and Responsible Generative AI Products

Speaker: Shyvee Shi - Product Lead and Learning Instructor at LinkedIn

In the rapidly evolving landscape of artificial intelligence, Generative AI products stand at the cutting edge. This presentation unveils a comprehensive 7-step framework designed to navigate the complexities of developing, launching, and scaling Generative AI products.

Generative AI

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

Building generative AI applications presents significant challenges for organizations: they require specialized ML expertise, complex infrastructure management, and careful orchestration of multiple services. Building a generative AI application SageMaker Unified Studio offers tools to discover and build with generative AI.

Generative AI

Generative AI Applications Technical Review Software Review

AI Pact: Simplifying EU AI Act compliance for enterprises

CIO

JANUARY 30, 2025

While most provisions of the EU AI Act come into effect at the end of a two-year transition period ending in August 2026, some of them enter force as early as February 2, 2025. Inform and educate and simplify are the key words, and thats what the AI Pact is for.

Compliance

Compliance Artificial Inteligence Enterprise Artificial Intelligence

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

Companies across all industries are harnessing the power of generative AI to address various use cases. Cloud providers have recognized the need to offer model inference through an API call, significantly streamlining the implementation of AI within applications.

Generative AI

Generative AI AWS Technical Review Backup

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

Retrieval Augmented Generation (RAG) has become a crucial technique for improving the accuracy and relevance of AI-generated responses. The effectiveness of RAG heavily depends on the quality of context provided to the large language model (LLM), which is typically retrieved from vector stores based on user queries.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Large language models (LLMs) have revolutionized the field of natural language processing with their ability to understand and generate humanlike text. Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

7 ways gen AI can create more work than it saves

CIO

NOVEMBER 13, 2024

One is going through the big areas where we have operational services and look at every process to be optimized using artificial intelligence and large language models. And the second is deploying what we call LLM Suite to almost every employee. “We’re doing two things,” he says.

Weak Development Team

Weak Development Team Artificial Inteligence Technical Review Generative AI

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

At the forefront of using generative AI in the insurance industry, Verisks generative AI-powered solutions, like Mozart, remain rooted in ethical and responsible AI use. Security and governance Generative AI is very new technology and brings with it new challenges related to security and compliance.

Generative AI

Generative AI Technical Review Insurance Policies

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

The use of large language models (LLMs) and generative AI has exploded over the last year. With the release of powerful publicly available foundation models, tools for training, fine tuning and hosting your own LLM have also become democratized. top_p=0.95) # Create an LLM. choices[0].text'

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

Stop Creating Content With ChatGPT!

Xebia

OCTOBER 15, 2024

I got to deliver a session on a topic I’m very passionate about: using different forms of generative AI to generate self-guided meditation sessions. Well, here’s the first paragraph of the abstract: In an era where technology and mindfulness intersect, the power of AI is reshaping how we approach app development.

ChatGPT

ChatGPT Artificial Inteligence Software Review Systems Review

Cybersecurity Snapshot: AI Security Roundup: Best Practices, Research and Insights

Tenable

NOVEMBER 29, 2024

ICYMI the first time around, check out this roundup of data points, tips and trends about secure AI deployment; shadow AI; AI threat detection; AI risks; AI governance; AI cybersecurity uses — and more. ICYMI, here are six things that’ll help you better understand AI security.

Artificial Inteligence

Artificial Inteligence Research Generative AI Artificial Intelligence

CIOs contend with gen AI growing pains

CIO

NOVEMBER 22, 2024

The road ahead for IT leaders in turning the promise of generative AI into business value remains steep and daunting, but the key components of the gen AI roadmap — data, platform, and skills — are evolving and becoming better defined. MIT event, moderated by Lan Guan, CAIO at Accenture.

Airlines

Airlines LAN Generative AI Travel

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

Read on to find out how such expertise can make you stand out in any industry. Artificial Intelligence Average salary: $130,277 Expertise premium: $23,525 (15%) AI tops the list as the skill that can earn you the highest pay bump, earning tech professionals nearly an 18% premium over other tech skills.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

How to plan for a new business technology operating model

CIO

APRIL 21, 2025

You dont need an LLM to tell you what the biggest change to IT departments is this year. The evidence is everywhere.

Technology

Technology How To Artificial Inteligence Generative AI

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The following were some initial challenges in automation: Language diversity – The services host both Dutch and English shows. Some local shows feature Flemish dialects, which can be difficult for some large language models (LLMs) to understand. About the Authors Lucas Desard is GenAI Engineer at DPG Media.

Media

Media Video Artificial Inteligence Generative AI

LLM benchmarking: How to find the right AI model

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Webinars

Trending Sources

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Have we reached the end of ‘too expensive’ for enterprise software?

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Build and deploy a UI for your generative AI applications with AWS and Python

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

How to Use Generative AI and LLMs to Improve Search

Build a multi-tenant generative AI environment for your enterprise on AWS

Launching LLM-Based Products: From Concept to Cash in 90 Days

Generate financial industry-specific insights using generative AI and in-context fine-tuning

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Accelerate AWS Well-Architected reviews with Generative AI

Build a video insights and summarization engine using generative AI with Amazon Bedrock

How to Achieve High-Accuracy Results When Using LLMs

Empower your generative AI application with a comprehensive custom observability solution

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Integrate foundation models into your code with Amazon Bedrock

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

LLMs in Production: Tooling, Process, and Team Structure

Camelot Secure’s AI wizard eases path to cybersecurity compliance

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Insights in implementing production-ready solutions with generative AI

A Tale of Two Case Studies: Using LLMs in Production

10 generative AI certs and certificate programs to grow your skills

Model customization, RAG, or both: A case study with Amazon Nova

Using Generative AI to Build Generative AI

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Building User-Centric and Responsible Generative AI Products

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AI Pact: Simplifying EU AI Act compliance for enterprises

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

7 ways gen AI can create more work than it saves

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Stop Creating Content With ChatGPT!

Cybersecurity Snapshot: AI Security Roundup: Best Practices, Research and Insights

CIOs contend with gen AI growing pains

9 IT skills where expertise pays the most

How to plan for a new business technology operating model

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Stay Connected