Machine Learning, Testing and Training

Why you should care about debugging machine learning models

O'Reilly Media - Data

DECEMBER 12, 2019

For all the excitement about machine learning (ML), there are serious impediments to its widespread adoption. In addition to newer innovations, the practice borrows from model risk management, traditional model diagnostics, and software testing. Not least is the broadening realization that ML models can fail.

Machine Learning

Machine Learning Artificial Inteligence Technical Review Analysis

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Across diverse industries—including healthcare, finance, and marketing—organizations are now engaged in pre-training and fine-tuning these increasingly larger LLMs, which often boast billions of parameters and larger input sequence length. This approach reduces memory pressure and enables efficient training of large models.

Training

Training Artificial Inteligence AWS Machine Learning

Introducing Cloudera Fine Tuning Studio for Training, Evaluating, and Deploying LLMs with Cloudera AI

Cloudera

NOVEMBER 13, 2024

Fine tuning involves another round of training for a specific model to help guide the output of LLMs to meet specific standards of an organization. Given some example data, LLMs can quickly learn new content that wasn’t available during the initial training of the base model. Build and test training and inference prompts.

Artificial Inteligence

Artificial Inteligence Training Machine Learning Performance

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Cellino is using AI and machine learning to scale production of stem cell therapies

TechCrunch

SEPTEMBER 22, 2021

technology, machine learning, hardware, software — and yes, lasers! Founded by a team whose backgrounds include physics, stem cell biology, and machine learning, Cellino operates in the regenerative medicine industry. — could eventually democratize access to cell therapies.

Machine Learning

Machine Learning Artificial Inteligence Off-The-Shelf Biotech

The dawn of agentic AI: Are we ready for autonomous technology?

CIO

MARCH 14, 2025

Ive spent more than 25 years working with machine learning and automation technology, and agentic AI is clearly a difficult problem to solve. One of the best is a penetration test that checks for ways someone could access a network. Could it work through complex, dynamic branch points, make autonomous decisions and act on them?

Artificial Inteligence

Artificial Inteligence Technology eCommerce Compliance

Machine learning model serving architectures

Xebia

APRIL 2, 2024

After months of crunching data, plotting distributions, and testing out various machine learning algorithms you have finally proven to your stakeholders that your model can deliver business value. For the sake of argumentation, we will assume the machine learning model is periodically trained on a finite set of historical data.

Machine Learning

Machine Learning Artificial Inteligence Architecture Data

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning - AI

DECEMBER 4, 2024

This a revolutionary new capability within Amazon Bedrock that serves as a centralized hub for discovering, testing, and implementing foundation models (FMs). Nemotron-4 15B, with its impressive 15-billion-parameter architecture trained on 8 trillion text tokens, brings powerful multilingual and coding capabilities to the Amazon Bedrock.

Artificial Inteligence

Artificial Inteligence Microservices Generative AI AWS

CIOs help set the workforce AI training agenda

CIO

JUNE 11, 2024

The pressure is on for CIOs to deliver value from AI, but pressing ahead with AI implementations without the necessary workforce training in place is a recipe for falling short of their goals. For many IT leaders, being central to organization-wide training initiatives may be new territory. “At

Training

Training Nonprofit Generative AI Artificial Inteligence

From legacy to lakehouse: Centralizing insurance data with Delta Lake

CIO

APRIL 23, 2025

Delta Lake: Fueling insurance AI Centralizing data and creating a Delta Lakehouse architecture significantly enhances AI model training and performance, yielding more accurate insights and predictive capabilities. A critical consideration emerges regarding enterprise AI platform implementation.

Insurance

Insurance Artificial Inteligence Data Architecture

9 IT resolutions for 2025

CIO

JANUARY 6, 2025

Wetmur says Morgan Stanley has been using modern data science, AI, and machine learning for years to analyze data and activity, pinpoint risks, and initiate mitigation, noting that teams at the firm have earned patents in this space. I am excited about the potential of generative AI, particularly in the security space, she says.

Generative AI

Generative AI Artificial Inteligence Innovation Off-The-Shelf

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Fine-tuning is a powerful approach in natural language processing (NLP) and generative AI , allowing businesses to tailor pre-trained large language models (LLMs) for specific tasks. The TAT-QA dataset has been divided into train (28,832 rows), dev (3,632 rows), and test (3,572 rows).

Artificial Inteligence

Artificial Inteligence Generative AI Training Metrics

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Demystifying RAG and model customization RAG is a technique to enhance the capability of pre-trained models by allowing the model access to external domain-specific data sources. Unlike fine-tuning, in RAG, the model doesnt undergo any training and the model weights arent updated to learn the domain knowledge.

Case Study

Case Study Artificial Inteligence Study Generative AI

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. However, customizing DeepSeek models effectively while managing computational resources remains a significant challenge.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Nvidia’s ‘hard pivot’ to AI reasoning bolsters Llama models for agentic AI

CIO

MARCH 18, 2025

The company has post-trained its new Llama Nemotron family of reasoning models to improve multistep math, coding, reasoning, and complex decision-making. Post-training is a set of processes and techniques for refining and optimizing a machine learning model after its initial training on a dataset.

Artificial Inteligence

Artificial Inteligence Microservices Data Center Azure

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

AWS Machine Learning - AI

APRIL 22, 2025

Over the past several months, we drove several improvements in intelligent prompt routing based on customer feedback and extensive internal testing. In this blog post, we detail various highlights from our internal testing, how you can get started, and point out some caveats and best practices. Lets dive in! v1, Haiku 3.5, Sonnet 3.5

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Top 6 Annotation Tools for HITL LLMs Evaluation and Domain-Specific AI Model Training

John Snow Labs

APRIL 29, 2025

What was once a preparatory task for training AI is now a core part of a continuous feedback and improvement cycle. Training compact, domain-specialized models that outperform general-purpose LLMs in areas like healthcare, legal, finance, and beyond. Todays annotation tools are no longer just for labeling datasets.

Artificial Inteligence

Artificial Inteligence Training Tools Generative AI

When is data too clean to be useful for enterprise AI?

CIO

NOVEMBER 27, 2024

But that’s exactly the kind of data you want to include when training an AI to give photography tips. Conversely, some of the other inappropriate advice found in Google searches might have been avoided if the origin of content from obviously satirical sites had been retained in the training set.

Data

Data Enterprise Weak Development Team Software Review

Mistral-NeMo-Instruct-2407 and Mistral-NeMo-Base-2407 are now available on SageMaker JumpStart

AWS Machine Learning - AI

DECEMBER 6, 2024

You can try these models with SageMaker JumpStart, a machine learning (ML) hub that provides access to algorithms and models that can be deployed with one click for running inference. Both pre-trained base and instruction-tuned checkpoints are available under the Apache 2.0

Artificial Inteligence

Artificial Inteligence AWS Generative AI Training

How today’s enterprise architect juggles strategy, tech and innovation

CIO

APRIL 16, 2025

Automation: Maximizing tools and practices in the delivery environments like IAC, CICD, DevOps, SecOps and Test Automation aligned with the technology and cloud provider stacks and enable sustainable agile delivery. This requires close attention to the detail, auditing/testing, planning and designing upfront.

Technical Review

Technical Review Enterprise Strategy Innovation

How we built an AI unicorn in 6 years

TechCrunch

JULY 20, 2021

In 2013, I was fortunate to get into artificial intelligence (more specifically, deep learning) six months before it blew up internationally. It started when I took a course on Coursera called “Machine learning with neural networks” by Geoffrey Hinton. It was like being love struck.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Disaster Recovery Machine Learning

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

Amazon Bedrock provides two primary methods for preparing your training data: uploading JSONL files to Amazon S3 or using historical invocation logs. Tool specification format requirements For agent function calling distillation, Amazon Bedrock requires that tool specifications be provided as part of your training data.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

FireEye acquires Respond Software for $186M, announces $400M investment

TechCrunch

NOVEMBER 19, 2020

Today it was FireEye’s turn, snagging Respond Software , a company that helps customers investigate and understand security incidents, while reducing the need for highly trained (and scarce) security analysts. The acquisition gives them a quick influx of machine learning-fueled software.

Software

Software Machine Learning Artificial Inteligence Network

Synthetic data takes aim at AI training challenges

CIO

FEBRUARY 19, 2025

The use of synthetic data to train AI models is about to skyrocket, as organizations look to fill in gaps in their internal data, build specialized capabilities, and protect customer privacy, experts predict. Gartner, for example, projects that by 2028, 80% of data used by AIs will be synthetic, up from 20% in 2024.

Training

Training Artificial Inteligence Data Software Review

Boosting Coveo Smart Snippets Machine Learning Model with Custom Context-Driven Responses

Perficient

DECEMBER 4, 2024

Smart Snippet Model in Coveo The Coveo Machine Learning Smart Snippets model shows users direct answers to their questions on the search results page. Navigate to Recommendations : In the left-hand menu, click “models” under the “Machine Learning” section.

Machine Learning

Machine Learning Artificial Inteligence Wireless Software Review

Escorts Kubota enlists AI to reinvent railway, construction, and agriculture

CIO

NOVEMBER 11, 2024

Kakkar and his IT teams are enlisting automation, machine learning, and AI to facilitate the transformation, which will require significant innovation, especially at the edge. Kakkar’s litmus test for pursuing a project depends on whether it has a clear purpose, goal, and measurable objectives.

Construction

Construction USP IoT Artificial Inteligence

Union AI raises $19.1M Series A to simplify AI and data workflows with Flyte

TechCrunch

MAY 17, 2023

At the core of Union is Flyte , an open source tool for building production-grade workflow automation platforms with a focus on data, machine learning and analytics stacks. But there was always friction between the software engineers and machine learning specialists. ” Image Credits: Union.ai

Artificial Inteligence

Artificial Inteligence Machine Learning Open Source Data

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

The generative AI playground is a UI provided to tenants where they can run their one-time experiments, chat with several FMs, and manually test capabilities such as guardrails or model evaluation for exploration purposes. This in itself is a microservice, inspired the Orchestrator Saga pattern in microservices.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning - AI

FEBRUARY 12, 2025

Trained on broad, generic datasets spanning a wide range of topics and domains, LLMs use their parametric knowledge to perform increasingly complex and versatile tasks across multiple business use cases. This blog post is co-written with Moran beladev, Manos Stergiadis, and Ilya Gusev from Booking.com. times on the same dataset.

Artificial Inteligence

Artificial Inteligence Training AWS Machine Learning

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

AWS Machine Learning - AI

AUGUST 21, 2024

Amazon SageMaker Canvas is a no-code machine learning (ML) service that empowers business analysts and domain experts to build, train, and deploy ML models without writing a single line of code. You can review the model status and test the model on the Predict tab.

Artificial Inteligence

Artificial Inteligence Machine Learning Government Software Review

Beyond the hype: Unlocking real enterprise value from genAI

CIO

NOVEMBER 4, 2024

However, any customer-facing genAI apps need to be extensively and continuously tested and trained to ensure accuracy and a high-quality experience. Creating a superior customer experience: Organizations can supercharge the customer experience with genAI analysis of customer feedback, personalized chatbots, and tailored engagement.

Enterprise

Enterprise ChatGPT Generative AI Infrastructure

Artificial Intelligence – A Guide for Thinking Humans

Henrik Warne

MAY 19, 2020

I don’t have any experience working with AI and machine learning (ML). We also read Grokking Deep Learning in the book club at work. Seeing a neural network that starts with random weights and, after training, is able to make good predictions is almost magical. These systems require labeled images for training.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Systems Review Software Review

Gretel announces $12M Series A to make it easier to anonymize data

TechCrunch

NOVEMBER 16, 2020

“As a developer, you want to test an idea or build a new feature, and it can take weeks to get access to the data you need. The first product is an open source, synthetic machine learning library for developers that strips out personally identifiable information. to train AI with synthetic data.

Open Source

Open Source Data Machine Learning Artificial Inteligence

State of the CIO, 2025: CIOs set the AI agenda

CIO

MAY 5, 2025

We need to train the organization to leverage AI to solve business problems, not just to create something new. Navigating the AI and machine learning journey will become an even bigger focus for IT leaders over the next year, according to three quarters of IT leader respondents. Direction is being set by the executive suite.

Generative AI

Generative AI Strategy Survey Artificial Inteligence

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The Education and Training Quality Authority (BQA) plays a critical role in improving the quality of education and training services in the Kingdom Bahrain. BQA oversees a comprehensive quality assurance process, which includes setting performance standards and conducting objective reviews of education and training institutions.

Education

Education Report Technical Review Generative AI

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock

AWS Machine Learning - AI

MAY 1, 2025

Refer to Supported models and Regions for fine-tuning and continued pre-training for updates on Regional availability and quotas. The required training dataset (and optional validation dataset) prepared and stored in Amazon Simple Storage Service (Amazon S3). As of writing this post, Meta Llama 3.2

Generative AI

Generative AI AWS Artificial Inteligence Training

7 famous analytics and AI disasters

CIO

APRIL 15, 2022

And 20% of IT leaders say machine learning/artificial intelligence will drive the most IT investment. Insights gained from analytics and actions driven by machine learning algorithms can give organizations a competitive advantage, but mistakes can be costly in terms of reputation, revenue, or even lives.

Analytics

Analytics Artificial Inteligence Machine Learning Systems Review

Protect AI lands a $13.5M investment to harden AI projects from attack

TechCrunch

DECEMBER 15, 2022

Protect AI claims to be one of the few security companies focused entirely on developing tools to defend AI systems and machine learning models from exploits. “We have researched and uncovered unique exploits and provide tools to reduce risk inherent in [machine learning] pipelines.”

Machine Learning

Machine Learning Artificial Inteligence Software Review Systems Review

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

In our tests, we’ve seen substantial improvements in scaling times for generative AI model endpoints across various frameworks. AI and ML engineers can now move from model training to production deployment with unprecedented speed, reducing time-to-market for new AI features and improvements. gpu-py311-cu124-ubuntu22.04-sagemaker",

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

Neu.ro announces zero emissions model building solution

TechCrunch

NOVEMBER 22, 2021

As companies increasingly move to take advantage of machine learning to run their business more efficiently, the fact is that it takes an abundance of energy to build, test and run models in production. an energy-efficient solution for customers to build machine learning models using its solution.

Artificial Inteligence

Artificial Inteligence Energy Machine Learning Telecommunications

Cardiomatics bags $3.2M for its ECG-reading AI

TechCrunch

AUGUST 20, 2021

“Ninety percent of the data is used as a training set, and 10% for algorithm validation and testing. According to the data-centric AI we attach great importance to the test sets to be sure that they contain the best possible representation of signals from our clients. When a human interprets an ECG, they see a curve.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Healthcare Analysis

Workera.ai, a precision upskilling platform, taps $16M to close enterprise skills gap

TechCrunch

AUGUST 26, 2021

Finding the right learning platform can be difficult, especially as companies look to upskill and reskill their talent to meet demand for certain technological capabilities, like data science, machine learning and artificial intelligence roles.

Artificial Inteligence

Artificial Inteligence Enterprise Machine Learning Artificial Intelligence

Betterdata uses synthetic data to keep real data safe

TechCrunch

APRIL 20, 2023

“The idea is to create a fictional version of a real dataset that can be used safely for a variety of purposes including safeguarding confidential data, reducing bias and also improving machine learning models,” he said. Programmatic synthetic data helps developers in many ways.

Data

Data Machine Learning Artificial Inteligence Blockchain

Datagen raises $50 million Series B to empower computer vision teams

TechCrunch

MARCH 23, 2022

With offices in Tel Aviv and New York, Datagen “is creating a complete CV stack that will propel advancements in AI by simulating real world environments to rapidly train machine learning models at a fraction of the cost,” Vitus said. ” Investors that had backed Datagen’s $18.5

Automotive

Automotive Artificial Inteligence Machine Learning VR

Why you should care about debugging machine learning models

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Webinars

Trending Sources

Introducing Cloudera Fine Tuning Studio for Training, Evaluating, and Deploying LLMs with Cloudera AI

Webinars

Cellino is using AI and machine learning to scale production of stem cell therapies

The dawn of agentic AI: Are we ready for autonomous technology?

Machine learning model serving architectures

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

CIOs help set the workforce AI training agenda

From legacy to lakehouse: Centralizing insurance data with Delta Lake

9 IT resolutions for 2025

Best practices and lessons for fine-tuning Anthropic’s Claude 3 Haiku on Amazon Bedrock

Model customization, RAG, or both: A case study with Amazon Nova

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Nvidia’s ‘hard pivot’ to AI reasoning bolsters Llama models for agentic AI

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Top 6 Annotation Tools for HITL LLMs Evaluation and Domain-Specific AI Model Training

When is data too clean to be useful for enterprise AI?

Mistral-NeMo-Instruct-2407 and Mistral-NeMo-Base-2407 are now available on SageMaker JumpStart

How today’s enterprise architect juggles strategy, tech and innovation

How we built an AI unicorn in 6 years

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

FireEye acquires Respond Software for $186M, announces $400M investment

Synthetic data takes aim at AI training challenges

Boosting Coveo Smart Snippets Machine Learning Model with Custom Context-Driven Responses

Escorts Kubota enlists AI to reinvent railway, construction, and agriculture

Union AI raises $19.1M Series A to simplify AI and data workflows with Flyte

Build a multi-tenant generative AI environment for your enterprise on AWS

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone

Beyond the hype: Unlocking real enterprise value from genAI

Artificial Intelligence – A Guide for Thinking Humans

Gretel announces $12M Series A to make it easier to anonymize data

State of the CIO, 2025: CIOs set the AI agenda

How BQA streamlines education quality reporting using Amazon Bedrock

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock

7 famous analytics and AI disasters

Protect AI lands a $13.5M investment to harden AI projects from attack

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Neu.ro announces zero emissions model building solution

Cardiomatics bags $3.2M for its ECG-reading AI

Workera.ai, a precision upskilling platform, taps $16M to close enterprise skills gap

Betterdata uses synthetic data to keep real data safe

Datagen raises $50 million Series B to empower computer vision teams

Stay Connected