Artificial Inteligence, Serverless and System Design

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

AWS Machine Learning - AI

MAY 1, 2025

For MCP implementation, you need a scalable infrastructure to host these servers and an infrastructure to host the large language model (LLM), which will perform actions with the tools implemented by the MCP server. You ask the agent to Book a 5-day trip to Europe in January and we like warm weather.

Artificial Inteligence

Artificial Inteligence AWS Architecture Generative AI

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

AWS Machine Learning - AI

APRIL 3, 2024

In this post, we show how to build a contextual text and image search engine for product recommendations using the Amazon Titan Multimodal Embeddings model , available in Amazon Bedrock , with Amazon OpenSearch Serverless. Amazon SageMaker Studio – It is an integrated development environment (IDE) for machine learning (ML).

Serverless

Serverless Artificial Inteligence Engineering Generative AI

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques. To learn more about FMEval, see Evaluate large language models for quality and responsibility of LLMs.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

During the solution design process, Verisk also considered using Amazon Bedrock Knowledge Bases because its purpose built for creating and storing embeddings within Amazon OpenSearch Serverless. In the future, Verisk intends to use the Amazon Titan Embeddings V2 model.

Generative AI

Generative AI Technical Review Insurance Policies

Import a fine-tuned Meta Llama 3 model for SQL query generation on Amazon Bedrock

AWS Machine Learning - AI

AUGUST 1, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading artificial intelligence (AI) companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API. INST] Assistant: The following animation shows the results.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Artificial Intelligence

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock is a fully managed service that makes foundation models (FMs) from leading AI startups and Amazon Web Services available through an API, so you can choose from a wide range of FMs to find the model that is best suited for your use case.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Create a generative AI-based application builder assistant using Amazon Bedrock Agents

AWS Machine Learning - AI

OCTOBER 24, 2024

Agentic workflows are a fresh new perspective in building dynamic and complex business use- case based workflows with the help of large language models (LLM) as their reasoning engine or brain. In this case, use prompt engineering techniques to call the default agent LLM and generate the email validation code.

Generative AI

Generative AI Artificial Inteligence Applications Knowledge Base

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and AWS CloudFormation

AWS Machine Learning - AI

AUGUST 5, 2024

Prerequisites To implement the solution provided in this post, you should have the following: An active AWS account and familiarity with FMs, Amazon Bedrock, and OpenSearch Serverless. The Amazon Titan Embeddings G1-Text model enabled in Amazon Bedrock. He specializes in generative AI, machine learning, and system design.

Knowledge Base

Knowledge Base AWS Generative AI Artificial Inteligence

How Mixbook used generative AI to offer personalized photo book experiences

AWS Machine Learning - AI

JULY 15, 2024

This pivotal decision has been instrumental in propelling them towards fulfilling their mission, ensuring their system operations are characterized by reliability, superior performance, and operational efficiency. Vlad enjoys learning about both contemporary and ancient cultures, their histories, and languages.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Reinvent personalization with generative AI on Amazon Bedrock using task decomposition for agentic workflows

AWS Machine Learning - AI

SEPTEMBER 18, 2024

Generative AI and large language models (LLMs) offer new possibilities, although some businesses might hesitate due to concerns about consistency and adherence to company guidelines. In this solution, the LLM is asked to use the sentence without changes because it’s a testimonial.

Generative AI

Generative AI Artificial Inteligence Fractional CTO Guidelines

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS CDK

AWS Machine Learning - AI

AUGUST 28, 2024

By using the AWS CDK, the solution sets up the necessary resources, including an AWS Identity and Access Management (IAM) role, Amazon OpenSearch Serverless collection and index, and knowledge base with its associated data source. He specializes in generative AI, machine learning, and system design.

Knowledge Base

Knowledge Base AWS Generative AI Machine Learning

Core technologies and tools for AI, big data, and cloud computing

O'Reilly Media - Ideas

FEBRUARY 11, 2019

Highlights and use cases from companies that are building the technologies needed to sustain their use of analytics and machine learning. In a forthcoming survey, “Evolving Data Infrastructure,” we found strong interest in machine learning (ML) among respondents across geographic regions. Deep Learning.

Big Data

Big Data Technology Tools Cloud

Import a question answering fine-tuned model into Amazon Bedrock as a custom model

AWS Machine Learning - AI

SEPTEMBER 30, 2024

As an Information Technology Leader, Jay specializes in artificial intelligence, generative AI, data integration, business intelligence, and user interface domains. He currently focuses on serving of models and MLOps on Amazon SageMaker. Rupinder Grewal is a Senior AI/ML Specialist Solutions Architect with AWS.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Software Review

New live online training courses

O'Reilly Media - Ideas

JUNE 4, 2019

Get hands-on training in Docker, microservices, cloud native, Python, machine learning, and many other topics. Learn new topics and refine your skills with more than 219 new live online training courses we opened up for June and July on the O'Reilly online learning platform. AI and machine learning.

Course

Course Training Artificial Inteligence Software Review

160+ live online training courses opened for May and June

O'Reilly Media - Ideas

MAY 1, 2019

Get hands-on training in machine learning, blockchain, cloud native, PySpark, Kubernetes, and many other topics. Learn new topics and refine your skills with more than 160 new live online training courses we opened up for May and June on the O'Reilly online learning platform. AI and machine learning.

Course

Course Training Artificial Inteligence Machine Learning

219+ live online training courses opened for June and July

O'Reilly Media - Ideas

JUNE 5, 2019

Get hands-on training in Docker, microservices, cloud native, Python, machine learning, and many other topics. Learn new topics and refine your skills with more than 219 new live online training courses we opened up for June and July on the O'Reilly online learning platform. AI and machine learning.

Course

Course Training Artificial Inteligence Software Review

High-performance computing on AWS

Xebia

AUGUST 29, 2023

It’s built on serverless services (API Gateway / Lambda) and provides the same functionality as the CLI tool pcluster. It uses OS-bypass capabilities and enhances the performance of inter-instance communication that is critical for scaling HPC and machine learning applications.

AWS

AWS Performance Storage Linux

Apiumhub among top IT industry leaders in Code Europe event

Apiumhub

AUGUST 12, 2021

Gema Parreño Piqueras – Lead Data Science @Apiumhub Gema Parreno is currently a Lead Data Scientist at Apiumhub, passionate about machine learning and video games, with three years of experience at BBVA and later at Google in ML Prototype. Twitter: [link] Linkedin: [link]. She started her own startup (Cubicus) in 2013.

Industry

Industry Technical Advisors CTO Coach Azure

The Cloud Cost-Conscious Conundrum

taos

JULY 18, 2017

Data storage, logic hosting and monitoring tools exist and provide quick integration into existing system designs. Why run a server if you could be serverless? And why build your own system monitoring or log aggregation solution when a service can be consumed? Other non-infrastructure services also exist.

Cloud

Cloud Part-Time VPE Data Center Technical Review

Technology Trends for 2022

O'Reilly Media - Ideas

JANUARY 25, 2022

Finally, last year we observed that serverless appeared to be keeping pace with microservices. While microservices shows healthy growth, serverless is one of the few topics in this group to see a decline—and a large one at that (41%). Programming Languages. That’s no longer true. AI, ML, and Data.

Trends

Trends Technical Review Technology Artificial Inteligence

Journey to Event Driven – Part 2: Programming Models for the Event-Driven Architecture

Confluent

FEBRUARY 13, 2019

Rather, we apply different event planes to provide orthogonal aspects of system design such as core functionality, operations and instrumentation. This is how we think about system design and architecture. Another benefit of the event streaming systems is that we can continuously extend the functionality.

Architecture

Architecture Programming Microservices Serverless

Where Programming, Ops, AI, and the Cloud are Headed in 2021

O'Reilly Media - Ideas

JANUARY 25, 2021

We’re not pretending the frameworks themselves are comparable—Spring is primarily for backend and middleware development (though it includes a web framework); React and Angular are for frontend development; and scikit-learn and PyTorch are machine learning libraries. serverless, a.k.a. AI, Machine Learning, and Data.

Programming

Programming Cloud Artificial Inteligence Machine Learning

8 AI trends that will define product development in 2025 & beyond

Modus Create

FEBRUARY 12, 2025

Every year, new trends, frameworks, and practices capture the industrys imaginationwhether it was no-code in 2024, Web3 in 2023, or serverless architecture in 2022. Machine learning models can now detect many potential failures before they arise , minimizing defects and accelerating time-to-market.

Weak Development Team

Weak Development Team Trends Development Technical Review

CTO Universe

Multi-LLM routing strategies for generative AI applications on AWS

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol

Webinars

Trending Sources

Build a contextual text and image search engine for product recommendations using Amazon Bedrock and Amazon OpenSearch Serverless

Webinars

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

Import a fine-tuned Meta Llama 3 model for SQL query generation on Amazon Bedrock

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Create a generative AI-based application builder assistant using Amazon Bedrock Agents

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and AWS CloudFormation

How Mixbook used generative AI to offer personalized photo book experiences

Reinvent personalization with generative AI on Amazon Bedrock using task decomposition for agentic workflows

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS CDK

Core technologies and tools for AI, big data, and cloud computing

Import a question answering fine-tuned model into Amazon Bedrock as a custom model

New live online training courses

160+ live online training courses opened for May and June

219+ live online training courses opened for June and July

High-performance computing on AWS

Apiumhub among top IT industry leaders in Code Europe event

The Cloud Cost-Conscious Conundrum

Technology Trends for 2022

Journey to Event Driven – Part 2: Programming Models for the Event-Driven Architecture

Where Programming, Ops, AI, and the Cloud are Headed in 2021

8 AI trends that will define product development in 2025 & beyond

Stay Connected