AWS, Storage and Training

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning - AI

DECEMBER 24, 2024

Training large language models (LLMs) models has become a significant expense for businesses. PEFT is a set of techniques designed to adapt pre-trained LLMs to specific tasks while minimizing the number of parameters that need to be updated.

AWS

AWS Artificial Inteligence Generative AI Training

Multi-LLM routing strategies for generative AI applications on AWS

AWS Machine Learning - AI

APRIL 9, 2025

The Pro tier, however, would require a highly customized LLM that has been trained on specific data and terminology, enabling it to assist with intricate tasks like drafting complex legal documents. Before migrating any of the provided solutions to production, we recommend following the AWS Well-Architected Framework.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Applications

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

The following diagram illustrates the solution architecture: The steps of the solution include: Upload data to Amazon S3 : Store the product images in Amazon Simple Storage Service (Amazon S3). The AWS Command Line Interface (AWS CLI) installed on your machine to upload the dataset to Amazon S3.

AWS

AWS Engineering Serverless eCommerce

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

AWS Machine Learning - AI

OCTOBER 17, 2024

During re:Invent 2023, we launched AWS HealthScribe , a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation. AWS HealthScribe will then output two files which are also stored on Amazon S3.

AWS

AWS Artificial Inteligence Generative AI Machine Learning

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Across diverse industries—including healthcare, finance, and marketing—organizations are now engaged in pre-training and fine-tuning these increasingly larger LLMs, which often boast billions of parameters and larger input sequence length. This approach reduces memory pressure and enables efficient training of large models.

Training

Training Artificial Inteligence AWS Machine Learning

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Accelerate AWS Well-Architected reviews with Generative AI

AWS Machine Learning - AI

MARCH 4, 2025

To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. This systematic approach leads to more reliable and standardized evaluations.

Generative AI

Generative AI Technical Review Software Review Systems Review

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. On AWS, you can use the fully managed Amazon Bedrock Agents or tools of your choice such as LangChain agents or LlamaIndex agents.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Demystifying RAG and model customization RAG is a technique to enhance the capability of pre-trained models by allowing the model access to external domain-specific data sources. Unlike fine-tuning, in RAG, the model doesnt undergo any training and the model weights arent updated to learn the domain knowledge.

Case Study

Case Study Artificial Inteligence Study Generative AI

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

All the major cloud providers from North America AWS, Google, Microsoft Azure, Oracle Cloud are on par with each other, with most of their services and capabilities are primed to address the needs of any enterprise. The AWS Cloud Adoption Framework (CAF) is an effective tool that helps to evaluate cloud readiness.

Cloud

Cloud Strategy Architecture Policies

Elevate customer experience by using the Amazon Q Business custom plugin for New Relic AI

AWS Machine Learning - AI

DECEMBER 3, 2024

This collaboration between AWS and New Relic opens up possibilities for building more robust digital infrastructures, advancing innovation in customer-facing technologies, and setting new benchmarks in proactive IT problem-solving. To get started on training, enroll for free Amazon Q training from AWS Training and Certification.

Technical Review

Technical Review AWS eCommerce Systems Review

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone. Amazon Linux 2).

AWS

AWS Artificial Inteligence Technical Review Systems Review

AWS to invest $5.3 to build data centers in Saudi Arabia to bolster tech in the region

CIO

MARCH 5, 2024

Amazon Web Services (AWS) is the latest high-tech giant to announce a major stake in Saudi Arabia’s burgeoning technology industry, unveiling a plan this week to invest more than $5.3 Technology and training The new AWS Region in Saudi Arabia will comprise three Availability Zones at launch, with plans to establish more zones in the future.

Data Center

Data Center Technical Review AWS Data

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning - AI

NOVEMBER 15, 2024

At AWS, we are committed to developing AI responsibly , taking a people-centric approach that prioritizes education, science, and our customers, integrating responsible AI across the end-to-end AI lifecycle. For human-in-the-loop evaluation, which can be done by either AWS managed or customer managed teams, you must bring your own dataset.

Applications

Applications Generative AI AWS Artificial Inteligence

AWS launches second region in India with a $4.4 billion commitment

CIO

NOVEMBER 22, 2022

Amazon Web Services (AWS) on Tuesday launched its second region in India and said it was committing $4.4 The new region, which will be based in Hyderabad (designated ap-south-2), will add three availability zones to AWS’ existing infrastructure in the country. billion (Rs 36,300 crore) to scale it till the end of 2030.

AWS

AWS Data Center Infrastructure Construction

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. The following diagram illustrates the end-to-end flow. for the month.

Generative AI

Generative AI Artificial Inteligence AWS Serverless

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning - AI

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. It comes in a range of parameter sizes—7 billion, 13 billion, and 70 billion—as well as pre-trained and fine-tuned variations. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart.

AWS

AWS Artificial Inteligence Training Generative AI

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The Education and Training Quality Authority (BQA) plays a critical role in improving the quality of education and training services in the Kingdom Bahrain. BQA oversees a comprehensive quality assurance process, which includes setting performance standards and conducting objective reviews of education and training institutions.

Education

Education Report Technical Review Generative AI

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions. Amazon Bedrock Data Automation optimizes for available AWS Regional capacity by automatically routing across regions within the same geographic area to maximize throughput at no additional cost.

Data

Data Generative AI Artificial Inteligence Compliance

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. Its a familiar NeMo-style launcher with which you can choose a recipe and run it on your infrastructure of choice (SageMaker HyperPod or training). recipes=recipe-name.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Kaseya

MAY 13, 2021

As the name suggests, a cloud service provider is essentially a third-party company that offers a cloud-based platform for application, infrastructure or storage services. In a public cloud, all of the hardware, software, networking and storage infrastructure is owned and managed by the cloud service provider. What Is a Public Cloud?

Google Cloud

Google Cloud Azure AWS Cloud

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

AWS Machine Learning - AI

NOVEMBER 21, 2024

As large language models (LLMs) increasingly integrate more multimedia capabilities, human feedback becomes even more critical in training them to generate rich, multi-modal content that aligns with human quality standards. The path to creating effective AI models for audio and video generation presents several distinct challenges.

Video

Video Lambda AWS Generative AI

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning - AI

MARCH 18, 2025

LLM analysis The integrated dataset is fed into an LLM specifically trained on medical and clinical trial data. Text processing and contextualization The transcribed text is then fed into an LLM trained on various healthcare datasets, including medical literature, clinical guidelines, and deidentified patient records. An AWS account.

Artificial Inteligence

Artificial Inteligence Technical Review Healthcare Systems Review

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

AWS Machine Learning - AI

NOVEMBER 19, 2024

Amazon Bedrock offers a serverless experience so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using AWS tools without having to manage infrastructure. Deploy the AWS CDK project to provision the required resources in your AWS account.

Knowledge Base

Knowledge Base Generative AI Technical Review Lambda

Taming the cost of AI: Is FinOps the answer?

CIO

APRIL 1, 2025

AI services require high resources like CPU/GPU and memory and hence cloud providers like Amazon AWS, Microsoft Azure and Google Cloud provide many AI services including features for genAI. For AI services, this implies breaking down costs associated with data processing, model training and inferencing.

Technical Review

Technical Review Azure Budget Artificial Intelligence

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Trained on broad, generic datasets spanning a wide range of topics and domains, LLMs use their parametric knowledge to perform increasingly complex and versatile tasks across multiple business use cases. For details, refer to Creating an AWS account. Be sure to set up your AWS Command Line Interface (AWS CLI) credentials correctly.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. The customer interaction transcripts are stored in an Amazon Simple Storage Service (Amazon S3) bucket.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Sequoia India’s Surge backs healthtech startup RedBrick AI in $4.6M funding

TechCrunch

NOVEMBER 22, 2022

But researchers need much of their initial time preparing data for training AI systems. The training process also requires hundreds of annotated medical images and thousands of hours of annotation by clinicians. Healthtech startup RedBrick AI has raised $4.6 Artificial intelligence has become ubiquitous in clinical diagnosis.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Biotech 3D

The road to Software 2.0

O'Reilly Media - Data

DECEMBER 10, 2019

We can collect many examples of what we want the program to do and what not to do (examples of correct and incorrect behavior), label them appropriately, and train a model to perform correctly on new inputs. We’re seeing continued adoption of tools like AWS’ Sagemaker and Google’s AutoML. Instead, we can program by example.

Software

Software Artificial Inteligence Machine Learning Conference

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

AWS Machine Learning - AI

APRIL 18, 2024

Tools like Terraform and AWS CloudFormation are pivotal for such transitions, offering infrastructure as code (IaC) capabilities that define and manage complex cloud environments with precision. AWS Landing Zone addresses this need by offering a standardized approach to deploying AWS resources.

AWS

AWS Applications Lambda Knowledge Base

Making Sense of IoT Platforms: AWS vs Azure vs Google vs IBM vs Cisco

Altexsoft

MAY 20, 2020

These hardware components cache and preprocess real-time data, reducing the burden on central storages and main processors. The list of top five fully-fledged solutions in alphabetical order is as follows : Amazon Web Service (AWS) IoT platform , Cisco IoT , Google Cloud IoT , IBM Watson IoT platform , and. AWS IoT infrastructure.

IoT

IoT Azure AWS Transportation

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

AWS Machine Learning - AI

NOVEMBER 30, 2023

The number of companies launching generative AI applications on AWS is substantial and building quickly, including adidas, Booking.com, Bridgewater Associates, Clariant, Cox Automotive, GoDaddy, and LexisNexis Legal & Professional, to name just a few. Innovative startups like Perplexity AI are going all in on AWS for generative AI.

Generative AI

Generative AI AWS Artificial Inteligence Software Review

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning - AI

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! are the sessions dedicated to AWS DeepRacer ! Generative AI is at the heart of the AWS Village this year. You marked your calendars, you booked your hotel, and you even purchased the airfare. And last but not least (and always fun!)

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Dynamic video content moderation and policy evaluation using AWS generative AI services

AWS Machine Learning - AI

MAY 30, 2024

In this post, we introduce the Media Analysis and Policy Evaluation solution, which uses AWS AI and generative AI services to provide a framework to streamline video extraction and evaluation processes. This solution, powered by AWS AI and generative AI services, meets these needs.

Generative AI

Generative AI Policies Video AWS

IT staff shortages damage the bottom line: IDC report

CIO

MAY 15, 2024

Training can be an answer — if done right The report shed light on two parallel causes: the IT talent shortage itself, and resistance from employees in being retrained for different tasks. But it does require specialized training. Some enterprises “are begging AWS to start certification programs,” Smith said.

Report

Report Software Review Training Survey

Integrating Key Vault Secrets with Azure Synapse Analytics

Apiumhub

DECEMBER 9, 2024

Azure Key Vault Secrets offers a centralized and secure storage alternative for API keys, passwords, certificates, and other sensitive statistics. Azure Key Vault is a cloud service that provides secure storage and access to confidential information such as passwords, API keys, and connection strings. What is Azure Key Vault Secret?

Azure

Azure Analytics Storage Artificial Inteligence

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

Your data is not used for training purposes, and the answers provided by Amazon Q Business are based solely on the data users have access to. By analyzing trends in Total queries , Total conversations , and user-specific metrics, administrators can gauge adoption rates and identify potential areas for user training or system improvements.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

Efficient continual pre-training LLMs for financial domains

AWS Machine Learning - AI

MARCH 28, 2024

Large language models (LLMs) are generally trained on large publicly available datasets that are domain agnostic. For example, Meta’s Llama models are trained on datasets such as CommonCrawl , C4 , Wikipedia, and ArXiv. The resulting LLM outperforms LLMs trained on non-domain-specific datasets when tested on finance-specific tasks.

Artificial Inteligence

Artificial Inteligence Training Generative AI Machine Learning

Empowering everyone with GenAI to rapidly build, customize, and deploy apps securely: Highlights from the AWS New York Summit

AWS Machine Learning - AI

JULY 10, 2024

Today at the AWS New York Summit, we announced a wide range of capabilities for customers to tailor generative AI to their needs and realize the benefits of generative AI faster. Each application can be immediately scaled to thousands of users and is secure and fully managed by AWS, eliminating the need for any operational expertise.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Knowledge Base

The Ultimate, Non-Technical Guide to AWS

Linux Academy

NOVEMBER 7, 2019

One of the most common questions people ask us is, “Do I need to have an IT background to start using AWS?” Knowing how networking works, and being familiar with the client-server model will help you understand the ins and outs of AWS, but what if you need something even more basic than that? AWS in non-technical terms.

AWS

AWS Linux Storage Data Center

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Asure anticipated that generative AI could aid contact center leaders to understand their teams support performance, identify gaps and pain points in their products, and recognize the most effective strategies for training customer support representatives using call transcripts. John Canada, VP of Engineering at Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Multi-LLM routing strategies for generative AI applications on AWS

Webinars

Trending Sources

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Webinars

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Accelerate AWS Well-Architected reviews with Generative AI

Build a multi-tenant generative AI environment for your enterprise on AWS

Model customization, RAG, or both: A case study with Amazon Nova

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Elevate customer experience by using the Amazon Q Business custom plugin for New Relic AI

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS to invest $5.3 to build data centers in Saudi Arabia to bolster tech in the region

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS launches second region in India with a $4.4 billion commitment

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

How BQA streamlines education quality reporting using Amazon Bedrock

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS vs. Azure vs. Google Cloud: Comparing Cloud Platforms

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

Revolutionizing clinical trials with the power of voice and AI

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

Taming the cost of AI: Is FinOps the answer?

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Top 5 Data and Security Topics on AWS

Sequoia India’s Surge backs healthtech startup RedBrick AI in $4.6M funding

The road to Software 2.0

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

Making Sense of IoT Platforms: AWS vs Azure vs Google vs IBM vs Cisco

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

Your guide to generative AI and ML at AWS re:Invent 2023

Dynamic video content moderation and policy evaluation using AWS generative AI services

IT staff shortages damage the bottom line: IDC report

Integrating Key Vault Secrets with Azure Synapse Analytics

Boost team productivity with Amazon Q Business Insights

Efficient continual pre-training LLMs for financial domains

Empowering everyone with GenAI to rapidly build, customize, and deploy apps securely: Highlights from the AWS New York Summit

The Ultimate, Non-Technical Guide to AWS

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Stay Connected