Agile, Data Engineering and Machine Learning

From legacy to lakehouse: Centralizing insurance data with Delta Lake

CIO

APRIL 23, 2025

I believe that the fundamental design principles behind these systems, being siloed, batch-focused, schema-rigid and often proprietary, are inherently misaligned with the demands of our modern, agile, data-centric and AI-enabled insurance industry. Features like time-travel allow you to review historical data for audits or compliance.

Insurance

Insurance Artificial Inteligence Data Architecture

Are you ready for MLOps? 🫵

Xebia

FEBRUARY 28, 2025

Universities have been pumping out Data Science grades in rapid pace and the Open Source community made ML technology easy to use and widely available. Both the tech and the skills are there: Machine Learning technology is by now easy to use and widely available. Dev ML teams work agile and experiment rapidly using PoC’s.

Technical Review

Technical Review Weak Development Team Artificial Inteligence Machine Learning

What is data architecture? A framework to manage data

CIO

DECEMBER 20, 2024

Invest in core functions that perform data curation such as modeling important relationships, cleansing raw data, and curating key dimensions and measures. Optimize data flows for agility. Limit the times data must be moved to reduce cost, increase data freshness, and optimize enterprise agility.

Architecture

Architecture Data Fractional CTO Technical Review

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Article: Agile Development Applied to Machine Learning Projects

InfoQ Culture Methods

FEBRUARY 25, 2021

Machine learning is a powerful new tool, but how does it fit in your agile development? Developing ML with agile has a few challenges that new teams coming up in the space need to be prepared for - from new roles like Data Scientists to concerns in reproducibility and dependency management. By Jay Palat.

Artificial Inteligence

Artificial Inteligence Machine Learning Agile Development

IT leaders: What’s the gameplan as tech badly outpaces talent?

CIO

MARCH 13, 2025

Gen AI-related job listings were particularly common in roles such as data scientists and data engineers, and in software development. Were building a department of AI engineering, mostly by bringing in people from data engineering and training them to work with gen AI and AI in general, says Daniel Avancini, Indiciums CDO.

Part-Time VPE

Part-Time VPE Weak Development Team Fractional VPE Fractional CTO

What is DataOps? Collaborative, cross-functional analytics

CIO

DECEMBER 22, 2022

DataOps (data operations) is an agile, process-oriented methodology for developing and delivering analytics. It brings together DevOps teams with data engineers and data scientists to provide the tools, processes, and organizational structures to support the data-focused enterprise. What is DataOps?

Analytics

Analytics Data Engineering Artificial Inteligence Machine Learning

Building a vision for real-time artificial intelligence

CIO

APRIL 12, 2023

Real-time AI involves processing data for making decisions within a given time frame. Real-time AI brings together streaming data and machine learning algorithms to make fast and automated decisions; examples include recommendations, fraud detection, security monitoring, and chatbots. It isn’t easy.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Machine Learning Agile

What is Data Engineering: Explaining Data Pipeline, Data Warehouse, and Data Engineer Role

Altexsoft

JUNE 25, 2019

Being at the top of data science capabilities, machine learning and artificial intelligence are buzzing technologies many organizations are eager to adopt. If we look at the hierarchy of needs in data science implementations, we’ll see that the next step after gathering your data for analysis is data engineering.

Data Engineering

Data Engineering Engineering Data Artificial Inteligence

Union.ai raises $10M to simplify AI and ML workflow orchestration

TechCrunch

APRIL 12, 2022

“Searching for the right solution led the team deep into machine learning techniques, which came with requirements to use large amounts of data and deliver robust models to production consistently … The techniques used were platformized, and the solution was used widely at Lyft.” ” Taking Flyte.

Artificial Inteligence

Artificial Inteligence Machine Learning Open Source Biotech

Make the leap to Hybrid with Cloudera Data Engineering

Cloudera

FEBRUARY 14, 2022

When we introduced Cloudera Data Engineering (CDE) in the Public Cloud in 2020 it was a culmination of many years of working alongside companies as they deployed Apache Spark based ETL workloads at scale. Each unlocking value in the data engineering workflows enterprises can start taking advantage of. Usage Patterns.

Data Engineering

Data Engineering Engineering Data Storage

AI Chihuahua! Part I: Why Machine Learning is Dogged by Failure and Delays

d2iq

FEBRUARY 19, 2021

Going from a prototype to production is perilous when it comes to machine learning: most initiatives fail , and for the few models that are ever deployed, it takes many months to do so. As little as 5% of the code of production machine learning systems is the model itself. Adapted from Sculley et al.

Artificial Inteligence

Artificial Inteligence Machine Learning Technical Review Software Review

Why a data scientist is not a data engineer

O'Reilly Media - Ideas

APRIL 9, 2019

A few months ago, I wrote about the differences between data engineers and data scientists. An interesting thing happened: the data scientists started pushing back, arguing that they are, in fact, as skilled as data engineers at data engineering. I agree; learn as much as you can.

Data Engineering

Data Engineering Engineering Data Technical Review

Predictive analytics helps Fresenius anticipate dialysis complications

CIO

OCTOBER 18, 2023

In September 2021, Fresenius set out to use machine learning and cloud computing to develop a model that could predict IDH 15 to 75 minutes in advance, enabling personalized care of patients with proactive intervention at the point of care. This shift in attitude and expectations needed to come top down and bottom up,” he says.

Artificial Inteligence

Artificial Inteligence Analytics Machine Learning Artificial Intelligence

When is data too clean to be useful for enterprise AI?

CIO

NOVEMBER 27, 2024

For AI, there’s no universal standard for when data is ‘clean enough.’ AI needs data cleaning that’s more agile, collaborative, iterative and customized for how data is being used, adds Carlsson. The great thing is we’re using data in lots of different ways we didn’t before,” he says.

Data

Data Enterprise Weak Development Team Software Review

What is data science? Transforming data into value

CIO

APRIL 22, 2022

Data science is a method for gleaning insights from structured and unstructured data using approaches ranging from statistical analysis to machine learning. Data science gives the data collected by an organization a purpose. TensorFlow: Developed by Google and licensed under Apache License 2.0,

Data

Data Artificial Inteligence Machine Learning Analytics

What CEOs really need from today’s CIOs

CIO

AUGUST 3, 2022

Modern delivery is product (rather than project) management , agile development, small cross-functional teams that co-create , and continuous integration and delivery all with a new financial model that funds “value” not “projects.”. Modern delivery. The cloud. The cloud is about more than managing costs.

Leadership

Leadership SDLC Artificial Inteligence Machine Learning

Top 8 IT certifications in demand today

CIO

OCTOBER 20, 2023

Certified Agile Leadership (CAL) The Certified Agile Leadership (CAL) certification is offered by ScrumAlliance and includes three certification modules, including CAL Essentials, CAL for Teams, and CAL for Organizations. Microsoft also offers certifications focused on fundamentals, specific job roles, or specialty use cases.

SCRUM

SCRUM Azure AWS Agile

Unlocking the Power of AI with a Real-Time Data Strategy

CIO

FEBRUARY 14, 2023

To succeed with real-time AI, data ecosystems need to excel at handling fast-moving streams of events, operational data, and machine learning models to leverage insights and automate decision-making. It’s also used to deploy machine learning models, data streaming platforms, and databases.

Artificial Inteligence

Artificial Inteligence Strategy Data Machine Learning

Building Custom Runtimes with Editors in Cloudera Machine Learning

Cloudera

AUGUST 24, 2022

Cloudera Machine Learning (CML) is a cloud-native and hybrid-friendly machine learning platform. It unifies self-service data science and data engineering in a single, portable service as part of an enterprise data cloud for multi-function analytics on data anywhere. References.

Artificial Inteligence

Artificial Inteligence Machine Learning Open Source Windows

Integrating Key Vault Secrets with Azure Synapse Analytics

Apiumhub

DECEMBER 9, 2024

Azure Synapse Analytics acts as a data warehouse using dedicated SQL pools, but it is also a comprehensive analytics platform designed to handle a wide range of data processing and analytics tasks on structured and unstructured data. Also combines data integration with machine learning.

Azure

Azure Analytics Storage Artificial Inteligence

Article: Innovation Startups Modeling Agile Culture

InfoQ Culture Methods

JULY 17, 2020

To mix the power of the data and the importance of people to offer business intelligence is a key point nowadays. To be agile is to adapt to today's market. The result is not only the most imporant thing, the way you do it more important. By Alejandro Ruiz.

Agile

Agile Innovation Culture Business Intelligence

Article: How I Contributed as a Tester to a Machine Learning System: Opportunities, Challenges and Learnings

InfoQ Culture Methods

FEBRUARY 16, 2023

Have you ever wondered about systems based on machine learning? In those cases, testing takes a backseat. And even if testing is done, it’s done mostly by developers itself. A tester’s role is not clearly portrayed. Testers usually struggle to understand ML-based systems and explore what contributions they can make.

Artificial Inteligence

Artificial Inteligence Machine Learning System Testing

Big Data Engineer: Role, Responsibilities, and Job Description

Altexsoft

AUGUST 25, 2020

That’s why a data specialist with big data skills is one of the most sought-after IT candidates. Data Engineering positions have grown by half and they typically require big data skills. Data engineering vs big data engineering. Big data processing. maintaining data pipeline.

Big Data

Big Data Data Engineering Engineering Data

CDP Data Visualization: Self-Service Data Visualization For The Full Data Lifecycle

Cloudera

OCTOBER 29, 2020

From our release of advanced production machine learning features in Cloudera Machine Learning, to releasing CDP Data Engineering for accelerating data pipeline curation and automation; our mission has been to constantly innovate at the leading edge of enterprise data and analytics.

Data

Data Artificial Inteligence Machine Learning Analytics

P&G turns to AI to create digital manufacturing of the future

CIO

OCTOBER 1, 2022

We do that by leveraging data, AI, and automation with agility and scale across all dimensions of our business, accelerating innovation and increasing productivity in everything we do.”. Another element to achieving agility at scale is P&G’s “composite” approach to building teams in the IT organization. The power of people.

Artificial Inteligence

Artificial Inteligence Azure IoT Analytics

Key Data Engineer responsibilities

Apiumhub

JANUARY 26, 2022

Data engineer roles have gained significant popularity in recent years. Number of studies show that the number of data engineering job listings has increased by 50% over the year. And data science provides us with methods to make use of this data. Who are data engineers?

Data Engineering

Data Engineering Engineering Data Artificial Inteligence

Top 7 Tips for Scaling Your Artificial Intelligence Strategy

OTS Solutions

MARCH 14, 2019

They have started pilot projects that are associated with machine learning algorithms and their role in improving certain aspects of their business such as customer relationships and cyber security. This investment in AI technology is expected to continue. Include Responsibility and Accountability. The promise of AI is exciting.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Strategy Technical Review

PepsiCo transforms for the digital era

CIO

DECEMBER 1, 2022

Tapped to guide the company’s digital journey, as she had for firms such as P&G and Adidas, Kanioura has roughly 1,000 data engineers, software engineers, and data scientists working on a “human-centered model” to transform PepsiCo into a next-generation company.

Analytics

Analytics IoT KPI Azure

Foote Partners: bonus disparities reveal tech skills most in demand in Q3

CIO

DECEMBER 16, 2022

Other non-certified skills attracting a pay premium of 19% included data engineering , the Zachman Framework , Azure Key Vault and site reliability engineering (SRE). Close behind and rising fast, though, were security auditing and bioinformatics, offering a pay premium of 19%, up 18.8% since March.

Technical Review

Technical Review Analytics AWS SCRUM

The IBM Press Release on Spark That Every Tech Leader Should Read

CTOvision

JUNE 15, 2015

They also launched a plan to train over a million data scientists and data engineers on Spark. As data and analytics are embedded into the fabric of business and society –from popular apps to the Internet of Things (IoT) –Spark brings essential advances to large-scale data processing.

Open Source

Open Source Artificial Inteligence Machine Learning Big Data

How Prompt-Based Development Revolutionizes Machine Learning Workflows

Mentormate

DECEMBER 12, 2023

In a previous blog post, we introduced a five-phase framework to plan out Artificial Intelligence (AI) and Machine Learning (ML) initiatives. The Traditional Machine Learning Workflow Initiating a traditional ML project begins with collecting data. Duplicated records are identified and rectified.

Artificial Inteligence

Artificial Inteligence Machine Learning Technical Review Development

Modernizing Data Pipelines using Cloudera Data Platform – Part 1

Cloudera

JUNE 2, 2021

As critical elements in supplying trusted, curated, and usable data for end-to-end analytic and machine learning workflows, the role of data pipelines is becoming indispensable. To keep up, data pipelines are being vigorously reshaped with modern tools and techniques.

Data

Data Data Engineering Artificial Inteligence Machine Learning

What you need to know about product management for AI

O'Reilly Media - Ideas

MARCH 31, 2020

If you’re already a software product manager (PM), you have a head start on becoming a PM for artificial intelligence (AI) or machine learning (ML). AI products are automated systems that collect and learn from data to make user-facing decisions. Machine learning adds uncertainty.

Product Management

Product Management Artificial Inteligence Machine Learning Weak Development Team

How to Pinpoint Where Your Organization Wins (and Loses) with Data

CIO

NOVEMBER 29, 2022

By George Trujillo, Principal Data Strategist, DataStax Innovation is driven by the ease and agility of working with data. Increasing ROI for the business requires a strategic understanding of — and the ability to clearly identify — where and how organizations win with data.

Organization

Organization Technical Review Data Artificial Inteligence

Remote Data Science: How to Make it Work

Dataiku

MAY 20, 2020

Collaboration across teams : Data projects are not only about data, but also require strong involvement from business teams to build experience, generate buy-in, and validate relevance. They also require data engineering and other teams to help with the operationalization steps.

Data

Data How To Artificial Inteligence Machine Learning

Data Science on Steroids: Productionised Machine Learning as a Value Driver for Business

OpenCredo

JULY 31, 2018

Machine Learning, alongside a mature Data Science, will help to bring IT and business closer together. By leveraging data for actionable insights, IT will increasingly drive business value. The Role of Data. The reason for this is the central role that data plays in machine learning.

Artificial Inteligence

Artificial Inteligence Machine Learning Data Continuous Delivery

What is the CIO’s role today? Redefining transformational IT leadership

CIO

OCTOBER 19, 2022

John Hill, CIDO of MSC Industrial Supply, spends less of his time thinking deeply about technology and more about bringing organizational digital agility to MSC. So, at Zebra, we created a hub-and-spoke model, where the hub is data engineering and the spokes are machine learning experts embedded in the business functions.

Leadership

Leadership CTO Coach Culture Fractional CTO

DataOps and Hitachi Vantara

Hu's Place - HitachiVantara

APRIL 11, 2019

Few if any data management frameworks are business focused, to not only promote efficient use of data and allocation of resources, but also to curate the data to understand the meaning of the data as well as the technologies that are applied to the data so that data engineers can move and transform the essential data that data consumers need.

Data Engineering

Data Engineering Artificial Inteligence Machine Learning Technical Review

How to Turn your Data Center into a True Private Cloud

Cloudera

OCTOBER 13, 2021

On-premises, traditional data and analytics clusters are monolithic deployments of tight coupled compute and storage, unable to cope with current business demands of fast and agile use case deployment with services that are statically provisioned to physical infrastructure. The solution is clear, but the path to it is less so.

Data Center

Data Center Cloud Data How To

Five Trends for 2019

Hu's Place - HitachiVantara

JANUARY 3, 2019

Public cloud, agile methodologies and devops, RESTful APIs, containers, analytics and machine learning are being adopted. ” Deployments of large data hubs have only resulted in more data silos that are not easily understood, related, or shared.

Trends

Trends Artificial Inteligence Machine Learning Data Center

Cloudera Named a Visionary in the Gartner MQ for Cloud DBMS

Cloudera

APRIL 1, 2024

Cloudera, a leader in big data analytics, provides a unified Data Platform for data management, AI, and analytics. Our customers run some of the world’s most innovative, largest, and most demanding data science, data engineering, analytics, and AI use cases, including PB-size generative AI workloads.

Cloud

Cloud Artificial Inteligence Generative AI Analytics

Why 87% of AI/ML Projects Never Make It Into Production—And How to Fix It

d2iq

MARCH 31, 2022

Going from prototype to production is perilous when it comes to artificial intelligence (AI) and machine learning (ML). However, many organizations struggle moving from a prototype on a single machine to a scalable, production-grade deployment. And for the few models that are ever deployed, it takes 90 days or more to get there.

Artificial Inteligence

Artificial Inteligence Machine Learning How To Artificial Intelligence

9 Tech Conferences Not to Be Missed in October

Apiumhub

SEPTEMBER 20, 2023

From software architecture to artificial intelligence and machine learning, these conferences offer unparalleled insights, networking opportunities, and a glimpse into the future of technology. Learn more about the speakers and check out their schedule by visiting their site here. Interested in attending?

Conference

Conference Artificial Inteligence UI/UX Machine Learning

Analytics Maturity Model: Levels, Technologies, and Applications

Altexsoft

DECEMBER 9, 2020

Diagnostic analytics identifies patterns and dependencies in available data, explaining why something happened. Predictive analytics creates probable forecasts of what will happen in the future, using machine learning techniques to operate big data volumes. Introducing data engineering and data science expertise.

Analytics

Analytics Technical Review Technology Applications

From legacy to lakehouse: Centralizing insurance data with Delta Lake

Are you ready for MLOps? 🫵

What is data architecture? A framework to manage data

Webinars

Article: Agile Development Applied to Machine Learning Projects

IT leaders: What’s the gameplan as tech badly outpaces talent?

What is DataOps? Collaborative, cross-functional analytics

Building a vision for real-time artificial intelligence

What is Data Engineering: Explaining Data Pipeline, Data Warehouse, and Data Engineer Role

Union.ai raises $10M to simplify AI and ML workflow orchestration

Make the leap to Hybrid with Cloudera Data Engineering

AI Chihuahua! Part I: Why Machine Learning is Dogged by Failure and Delays

Why a data scientist is not a data engineer

Predictive analytics helps Fresenius anticipate dialysis complications

When is data too clean to be useful for enterprise AI?

What is data science? Transforming data into value

What CEOs really need from today’s CIOs

Top 8 IT certifications in demand today

Unlocking the Power of AI with a Real-Time Data Strategy

Building Custom Runtimes with Editors in Cloudera Machine Learning

Integrating Key Vault Secrets with Azure Synapse Analytics

Article: Innovation Startups Modeling Agile Culture

Article: How I Contributed as a Tester to a Machine Learning System: Opportunities, Challenges and Learnings

Big Data Engineer: Role, Responsibilities, and Job Description

CDP Data Visualization: Self-Service Data Visualization For The Full Data Lifecycle

P&G turns to AI to create digital manufacturing of the future

Key Data Engineer responsibilities

Top 7 Tips for Scaling Your Artificial Intelligence Strategy

PepsiCo transforms for the digital era

Foote Partners: bonus disparities reveal tech skills most in demand in Q3

The IBM Press Release on Spark That Every Tech Leader Should Read

How Prompt-Based Development Revolutionizes Machine Learning Workflows

Modernizing Data Pipelines using Cloudera Data Platform – Part 1

What you need to know about product management for AI

How to Pinpoint Where Your Organization Wins (and Loses) with Data

Remote Data Science: How to Make it Work

Data Science on Steroids: Productionised Machine Learning as a Value Driver for Business

What is the CIO’s role today? Redefining transformational IT leadership

DataOps and Hitachi Vantara

How to Turn your Data Center into a True Private Cloud

Five Trends for 2019

Cloudera Named a Visionary in the Gartner MQ for Cloud DBMS

Why 87% of AI/ML Projects Never Make It Into Production—And How to Fix It

9 Tech Conferences Not to Be Missed in October

Analytics Maturity Model: Levels, Technologies, and Applications

Stay Connected