Artificial Inteligence, AWS and Storage

10 most in-demand enterprise IT skills

CIO

DECEMBER 10, 2024

Python Python is a programming language used in several fields, including data analysis, web development, software programming, scientific computing, and for building AI and machine learning models. AWS Amazon Web Services (AWS) is the most widely used cloud platform today.

UI/UX

UI/UX Enterprise Artificial Inteligence Database Administration

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. Monitoring the performance and behavior of LLMs is a critical task for ensuring their safety and effectiveness.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. AWS HealthScribe combines speech recognition and generative AI trained specifically for healthcare documentation to accelerate clinical documentation and enhance the consultation experience.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Large language models (LLMs) have revolutionized the field of natural language processing with their ability to understand and generate humanlike text. Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. It consists of one or more components depending on the number of FM providers and number and types of custom models used.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

Training large language models (LLMs) models has become a significant expense for businesses. For many use cases, companies are looking to use LLM foundation models (FM) with their domain-specific data. To learn more about Trainium chips and the Neuron SDK, see Welcome to AWS Neuron.

AWS

AWS Artificial Inteligence Generative AI Training

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. With the significant developments in the field of generative AI , intelligent applications powered by foundation models (FMs) can help users map out an itinerary through an intuitive natural conversation interface.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

Principal wanted to use existing internal FAQs, documentation, and unstructured data and build an intelligent chatbot that could provide quick access to the right information for different roles. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning - AI

NOVEMBER 13, 2024

However, as the reach of live streams expands globally, language barriers and accessibility challenges have emerged, limiting the ability of viewers to fully comprehend and participate in these immersive experiences. The extension delivers a web application implemented using the AWS SDK for JavaScript and the AWS Amplify JavaScript library.

Generative AI

Generative AI AWS Lambda Authentication

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

The Amazon Bedrock single API access, regardless of the models you choose, gives you the flexibility to use different FMs and upgrade to the latest model versions with minimal code changes. Amazon Titan FMs provide customers with a breadth of high-performing image, multimodal, and text model choices, through a fully managed API.

AWS

AWS Engineering Serverless eCommerce

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. This post provides guidance on how you can create a video insights and summarization engine using AWS AI/ML services.

Generative AI

Generative AI Video Engineering Artificial Inteligence

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning - AI

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Generative AI Technical Review Artificial Inteligence

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock

AWS Machine Learning - AI

MARCH 13, 2025

This post was co-written with Vishal Singh, Data Engineering Leader at Data & Analytics team of GoDaddy Generative AI solutions have the potential to transform businesses by boosting productivity and improving customer experiences, and using large language models (LLMs) in these solutions has become increasingly popular.

Artificial Inteligence

Artificial Inteligence Systems Review System Generative AI

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 31, 2024

AWS offers powerful generative AI services , including Amazon Bedrock , which allows organizations to create tailored use cases such as AI chat-based assistants that give answers based on knowledge contained in the customers’ documents, and much more. The following figure illustrates the high-level design of the solution.

Generative AI

Generative AI Lambda Applications AWS

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Large language models (LLMs) have witnessed an unprecedented surge in popularity, with customers increasingly using publicly available models such as Llama, Stable Diffusion, and Mistral. The training data, securely stored in Amazon Simple Storage Service (Amazon S3), is copied to the cluster.

Training

Training Artificial Inteligence AWS Machine Learning

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

AWS Machine Learning - AI

NOVEMBER 22, 2024

This post discusses how to use AWS Step Functions to efficiently coordinate multi-step generative AI workflows, such as parallelizing API calls to Amazon Bedrock to quickly gather answers to lists of submitted questions. It will be marked for deletion and will be deleted when all executions are stopped.

Generative AI

Generative AI AWS Technical Review Backup

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning - AI

MARCH 18, 2025

This is where the integration of cutting-edge technologies, such as audio-to-text translation and large language models (LLMs), holds the potential to revolutionize the way patients receive, process, and act on vital medical information. These insights can include: Potential adverse event detection and reporting.

Artificial Inteligence

Artificial Inteligence Technical Review Healthcare Systems Review

MLflow: A platform for managing the machine learning lifecycle

O'Reilly Media - Data

JULY 17, 2018

Although machine learning (ML) can produce fantastic results, using it in practice is complex. MLflow offers a powerful way to simplify and scale up ML development throughout an organization by making it easy to track, reproduce, manage, and deploy models. Machine learning workflow challenges. MLflow Models.

Artificial Inteligence

Artificial Inteligence Machine Learning Software Review Open Source

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

Traditionally, transforming raw data into actionable intelligence has demanded significant engineering effort. It often requires managing multiple machine learning (ML) models, designing complex workflows, and integrating diverse data sources into production-ready formats.

Data

Data Generative AI Artificial Inteligence Compliance

Oracle inks deal with AWS to offer database services

CIO

SEPTEMBER 10, 2024

In continuation of its efforts to help enterprises migrate to the cloud, Oracle said it is partnering with Amazon Web Services (AWS) to offer database services on the latter’s infrastructure. Oracle Database@AWS is expected to be available in preview later in the year with broader availability expected in 2025.

AWS

AWS Azure Database Administration Google Cloud

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. For more information, see Create a service role for model import.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. The customer interaction transcripts are stored in an Amazon Simple Storage Service (Amazon S3) bucket.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Organizations building and deploying AI applications, particularly those using large language models (LLMs) with Retrieval Augmented Generation (RAG) systems, face a significant challenge: how to evaluate AI outputs effectively throughout the application lifecycle.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

AWS launches no-code service AppFabric with generative AI assistance

CIO

JUNE 28, 2023

Amazon Web Services (AWS) on Tuesday unveiled a new no-code offering, dubbed AppFabric, designed to simplify SaaS integration for enterprises by increasing application observability and reducing operational costs associated with building point-to-point solutions. AppFabric, which is available across AWS’ US East (N.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone. Amazon Linux 2).

AWS

AWS Artificial Inteligence Technical Review Systems Review

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Perficient

NOVEMBER 20, 2024

Introduction to Multiclass Text Classification with LLMs Multiclass text classification (MTC) is a natural language processing (NLP) task where text is categorized into multiple predefined categories or classes. Traditional approaches rely on training machine learning models, requiring labeled data and iterative fine-tuning.

Artificial Inteligence

Artificial Inteligence Metrics Airlines Travel

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

DeepSeek-R1 is a large language model (LLM) developed by DeepSeek AI that uses reinforcement learning to enhance reasoning capabilities through a multi-stage training process from a DeepSeek-V3-Base foundation. See the following GitHub repo for more deployment examples using TGI, TensorRT-LLM, and Neuron.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 14, 2024

The storage layer uses Amazon Simple Storage Service (Amazon S3) to hold the invoices that business users upload. Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. or later on your local machine.

Software Review

Software Review Technical Review AWS Artificial Inteligence

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques. To learn more about FMEval, see Evaluate large language models for quality and responsibility of LLMs.

Generative AI

Generative AI Systems Review Artificial Inteligence Software Review

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

AWS Machine Learning - AI

JULY 24, 2024

Large language models (LLMs) have achieved remarkable success in various natural language processing (NLP) tasks, but they may not always generalize well to specific domains or tasks. You may need to customize an LLM to adapt to your unique use case, improving its performance on your specific dataset or task.

Artificial Inteligence

Artificial Inteligence Generative AI Training AWS

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

This solution uses decorators in your application code to capture and log metadata such as input prompts, output results, run time, and custom metadata, offering enhanced security, ease of use, flexibility, and integration with native AWS services. Additionally, you can choose what gets logged.

Generative AI

Generative AI Applications AWS Knowledge Base

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

SageMaker HyperPod recipes help data scientists and developers of all skill sets to get started training and fine-tuning popular publicly available generative AI models in minutes with state-of-the-art training performance. The architecture uses Amazon Elastic Container Registry (Amazon ECR) for container image management.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Foundational data protection for enterprise LLM acceleration with Protopia AI

AWS Machine Learning - AI

DECEMBER 5, 2023

New and powerful large language models (LLMs) are changing businesses rapidly, improving efficiency and effectiveness for a variety of enterprise use cases. Speed is of the essence, and adoption of LLM technologies can make or break a business’s competitive advantage.

Artificial Inteligence

Artificial Inteligence Enterprise Data Generative AI

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

AWS Machine Learning - AI

NOVEMBER 7, 2024

AI agents , powered by large language models (LLMs), can analyze complex customer inquiries, access multiple data sources, and deliver relevant, detailed responses. The workflow includes the following steps: Documents (owner manuals) are uploaded to an Amazon Simple Storage Service (Amazon S3) bucket.

Lambda

Lambda Enterprise Automotive Knowledge Base

Sequoia India’s Surge backs healthtech startup RedBrick AI in $4.6M funding

TechCrunch

NOVEMBER 22, 2022

Artificial intelligence has become ubiquitous in clinical diagnosis. “We see ourselves building the foundational layer of artificial intelligence in healthcare. Healthtech startup RedBrick AI has raised $4.6 But researchers need much of their initial time preparing data for training AI systems.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Biotech 3D

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

AWS Machine Learning - AI

MARCH 18, 2025

This application allows users to ask questions in natural language and then generates a SQL query for the users request. Large language models (LLMs) are trained to generate accurate SQL queries for natural language instructions. However, off-the-shelf LLMs cant be used without some modification.

Artificial Inteligence

Artificial Inteligence Applications Generative AI Off-The-Shelf

What is Oracle’s generative AI strategy?

CIO

JULY 6, 2023

While Microsoft, AWS, Google Cloud, and IBM have already released their generative AI offerings, rival Oracle has so far been largely quiet about its own strategy. Although not confirmed yet, Batta said new foundation models for industry sectors such as health and public safety could be added to the service in the future.

Generative AI

Generative AI Artificial Inteligence Strategy Google Cloud

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. Model evaluation is used to compare different models’ outputs and select the most appropriate model for your use case.

Applications

Applications Generative AI AWS Artificial Inteligence

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning - AI

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! Now all you need is some guidance on generative AI and machine learning (ML) sessions to attend at this twelfth edition of re:Invent. are the sessions dedicated to AWS DeepRacer ! And last but not least (and always fun!)

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Advanced RAG patterns on Amazon SageMaker

AWS Machine Learning - AI

MARCH 28, 2024

You can deploy this solution with just a few clicks using Amazon SageMaker JumpStart , a fully managed platform that offers state-of-the-art foundation models for various use cases such as content writing, code generation, question answering, copywriting, summarization, classification, and information retrieval.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Load Balancer

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

This is where intelligent document processing (IDP), coupled with the power of generative AI , emerges as a game-changing solution. Enhancing the capabilities of IDP is the integration of generative AI, which harnesses large language models (LLMs) and generative techniques to understand and generate human-like text.

Education

Education Report Technical Review Generative AI

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

AWS Machine Learning - AI

OCTOBER 29, 2024

Refer to Supported Regions and models for batch inference for current supporting AWS Regions and models. Although batch inference offers numerous benefits, it’s limited to 10 batch inference jobs submitted per model per Region. Amazon S3 invokes the {stack_name}-create-batch-queue-{AWS-Region} Lambda function.

Scalability

Scalability Lambda Generative AI AWS

10 most in-demand enterprise IT skills

Techniques and approaches for monitoring large language models on AWS

Webinars

Trending Sources

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Webinars

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Build a multi-tenant generative AI environment for your enterprise on AWS

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Accelerate AWS Well-Architected reviews with Generative AI

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Build a video insights and summarization engine using generative AI with Amazon Bedrock

How AWS sales uses Amazon Q Business for customer engagement

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Create a generative AI–powered custom Google Chat application using Amazon Bedrock

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Revolutionizing clinical trials with the power of voice and AI

MLflow: A platform for managing the machine learning lifecycle

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Oracle inks deal with AWS to offer database services

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS launches no-code service AppFabric with generative AI assistance

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Multiclass Text Classification Using LLM (MTC-LLM): A Comprehensive Guide

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Automate invoice processing with Streamlit and Amazon Bedrock

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Empower your generative AI application with a comprehensive custom observability solution

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Foundational data protection for enterprise LLM acceleration with Protopia AI

Enhance customer support with Amazon Bedrock Agents by integrating enterprise data APIs

Sequoia India’s Surge backs healthtech startup RedBrick AI in $4.6M funding

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

What is Oracle’s generative AI strategy?

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Your guide to generative AI and ML at AWS re:Invent 2023

Advanced RAG patterns on Amazon SageMaker

How BQA streamlines education quality reporting using Amazon Bedrock

Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Stay Connected