Architecture, Data and Data Engineering

What is data architecture? A framework to manage data

CIO

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects.

Architecture

Architecture Data Fractional CTO Technical Review

The key to operational AI: Modern data architecture

CIO

NOVEMBER 27, 2024

From customer service chatbots to marketing teams analyzing call center data, the majority of enterprises—about 90% according to recent data —have begun exploring AI. For companies investing in data science, realizing the return on these investments requires embedding AI deeply into business processes.

Architecture

Architecture Artificial Inteligence Data Development Team Review

The future of data: A 5-pillar approach to modern data management

CIO

DECEMBER 11, 2024

In todays economy, as the saying goes, data is the new gold a valuable asset from a financial standpoint. A similar transformation has occurred with data. More than 20 years ago, data within organizations was like scattered rocks on early Earth.

Data

Data Technical Review Software Review Weak Development Team

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO

NOVEMBER 19, 2024

The next phase of this transformation requires an intelligent data infrastructure that can bring AI closer to enterprise data. The challenges of integrating data with AI workflows When I speak with our customers, the challenges they talk about involve integrating their data and their enterprise AI workflows.

Artificial Inteligence

Artificial Inteligence Engineering Data Storage

The Next-Generation Cloud Data Lake: An Open, No-Copy Data Architecture

In an effort to be data-driven, many organizations are looking to democratize data. However, they often struggle with increasingly larger data volumes, reverting back to bottlenecking data access to manage large numbers of data engineering requests and rising data warehousing costs.

Architecture

What is a data engineer? An analytics role in high demand

CIO

SEPTEMBER 14, 2023

What is a data engineer? Data engineers design, build, and optimize systems for data collection, storage, access, and analytics at scale. They create data pipelines that convert raw data into formats usable by data scientists, data-centric applications, and other data consumers.

Data Engineering

Data Engineering Analytics Engineering Data

What is a data engineer? An analytics role in high demand

CIO

AUGUST 9, 2022

What is a data engineer? Data engineers design, build, and optimize systems for data collection, storage, access, and analytics at scale. They create data pipelines used by data scientists, data-centric applications, and other data consumers. The data engineer role.

Data Engineering

Data Engineering Analytics Engineering Data

The evolution of data science, data engineering, and AI

O'Reilly Media - Data

MAY 24, 2018

The O’Reilly Data Show Podcast: A special episode to mark the 100th episode. This episode of the Data Show marks our 100th episode. We had a collection of friends who were key members of the data science and big data communities on hand and we decided to record short conversations with them.

Data Engineering

Data Engineering Engineering Data Artificial Inteligence

Fundamentals of Data Engineering

Xebia

JANUARY 19, 2023

The following is a review of the book Fundamentals of Data Engineering by Joe Reis and Matt Housley, published by O’Reilly in June of 2022, and some takeaway lessons. This book is as good for a project manager or any other non-technical role as it is for a computer science student or a data engineer.

Data Engineering

Data Engineering Engineering Data Technical Review

IT leaders: What’s the gameplan as tech badly outpaces talent?

CIO

MARCH 13, 2025

Hes seeing the need for professionals who can not only navigate the technology itself, but also manage increasing complexities around its surrounding architectures, data sets, infrastructure, applications, and overall security. There are data scientists, but theyre expensive, he says.

Part-Time VPE

Part-Time VPE Weak Development Team Fractional VPE Fractional CTO

What is a data architect? Skills, salaries, and how to become a data framework master

CIO

OCTOBER 13, 2023

Data architect role Data architects are senior visionaries who translate business requirements into technology requirements and define data standards and principles, often in support of data or digital transformations. Data architects are frequently part of a data science team and tasked with leading data system projects.

Data

Data Database Administration Data Engineering Artificial Inteligence

What is Data Engineering: Explaining Data Pipeline, Data Warehouse, and Data Engineer Role

Altexsoft

JUNE 25, 2019

Being at the top of data science capabilities, machine learning and artificial intelligence are buzzing technologies many organizations are eager to adopt. However, they often forget about the fundamental work – data literacy, collection, and infrastructure – that must be done prior to building intelligent data products.

Data Engineering

Data Engineering Engineering Data Artificial Inteligence

How AI orchestration has become more important than the models themselves

CIO

DECEMBER 10, 2024

As many companies that have already adopted off-the-shelf GenAI models have found, getting these generic LLMs to work for highly specialized workflows requires a great deal of customization and integration of company-specific data. million on inference, grounding, and data integration for just proof-of-concept AI projects.

Artificial Inteligence

Artificial Inteligence Off-The-Shelf Insurance Analytics

RudderStack raises $56M for its customer data platform

TechCrunch

FEBRUARY 2, 2022

RudderStack , a platform that focuses on helping businesses build their customer data platforms to improve their analytics and marketing efforts, today announced that it has raised a $56 million Series B round led by Insight Partners, with previous investors Kleiner Perkins and S28 Capital also participating. Image Credits: RudderStack.

Data

Data Machine Learning Artificial Inteligence Architecture

Cloudera Data Engineering 2021 Year End Review

Cloudera

DECEMBER 21, 2021

Since the release of Cloudera Data Engineering (CDE) more than a year ago , our number one goal was operationalizing Spark pipelines at scale with first class tooling designed to streamline automation and observability. Data pipelines are composed of multiple steps with dependencies and triggers.

Data Engineering

Data Engineering Technical Review Software Review Engineering

Ready to transform how your IT organization drives business outcomes with AIOps?

CIO

JANUARY 3, 2025

Today, IT encompasses site reliability engineering (SRE), platform engineering, DevOps, and automation teams, and the need to manage services across multi-cloud and hybrid-cloud environments in addition to legacy systems. At the same time, the scale of observability data generated from multiple tools exceeds human capacity to manage.

Organization

Organization Artificial Intelligence Artificial Inteligence DevOps

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning - AI

NOVEMBER 20, 2024

In today’s data-intensive business landscape, organizations face the challenge of extracting valuable insights from diverse data sources scattered across their infrastructure. The solution combines data from an Amazon Aurora MySQL-Compatible Edition database and data stored in an Amazon Simple Storage Service (Amazon S3) bucket.

Data

Data AWS Groups Knowledge Base

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Cloudera

OCTOBER 7, 2021

In this blog, I will demonstrate the value of Cloudera DataFlow (CDF) , the edge-to-cloud streaming data platform available on the Cloudera Data Platform (CDP) , as a Data integration and Democratization fabric. Introduction to the Data Mesh Architecture and its Required Capabilities.

Architecture

Architecture Data Security Technical Review

Meroxa raises $15M Series A for its real-time data platform

TechCrunch

APRIL 13, 2021

Meroxa , a startup that makes it easier for businesses to build the data pipelines to power both their analytics and operational workflows, today announced that it has raised a $15 million Series A funding round led by Drive Capital. “Honestly, people come to us as a real-time FiveTran or real-time data warehouse sink. .

Data

Data Software Engineering Open Source Engineering

Make the leap to Hybrid with Cloudera Data Engineering

Cloudera

FEBRUARY 14, 2022

When we introduced Cloudera Data Engineering (CDE) in the Public Cloud in 2020 it was a culmination of many years of working alongside companies as they deployed Apache Spark based ETL workloads at scale. It’s no longer driven by data volumes, but containerization, separation of storage and compute, and democratization of analytics.

Data Engineering

Data Engineering Engineering Data Storage

Liberty Mutual CIO Monica Caldas on developing a digital-savvy workforce

CIO

NOVEMBER 7, 2024

We are continuously deploying new data capabilities and insights, we are pushing forward with our digital progression agenda, and we’re also building these generative AI capabilities internally to help our employees have more productivity in their day to day. What’s your mindset when it comes to data? We’re modernizing our ecosystem.

Artificial Inteligence

Artificial Inteligence Development Generative AI Artificial Intelligence

3 promises every CIO should keep in 2025

CIO

JANUARY 22, 2025

The trouble is, when people in the business do their own thing, IT loses control, and protecting against loss of data and intellectual property becomes an even bigger concern. AI models will be developed differently for different industries, and different data will be used to train for the healthcare industry than for logistics, for example.

Weak Development Team

Weak Development Team Education Meeting Data

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

Cloudera

OCTOBER 23, 2024

In August, we wrote about how in a future where distributed data architectures are inevitable, unifying and managing operational and business metadata is critical to successfully maximizing the value of data, analytics, and AI.

Data

Data Analytics Systems Review Architecture

Data Scientist vs Data Engineer: Differences and Why You Need Both

Altexsoft

OCTOBER 30, 2021

Explaining the difference, especially when they both work with something intangible such as data , is difficult. If you’re an executive who has a hard time understanding the underlying processes of data science and get confused with terminology, keep reading. Data science vs data engineering.

Data Engineering

Data Engineering Engineering Data Artificial Inteligence

Porsche Carrera Cup Brasil gets real-time data boost

CIO

MAY 21, 2024

In the annual Porsche Carrera Cup Brasil, data is essential to keep drivers safe and sustain optimal performance of race cars. Until recently, getting at and analyzing that essential data was a laborious affair that could take hours, and only once the race was over. The process took between 30 minutes and two hours.

Data

Data Azure Engineering Analytics

Firebolt, a data warehouse startup, raises $100M at a $1.4B valuation for faster, cheaper analytics on large data sets

TechCrunch

JANUARY 26, 2022

Israeli startup Firebolt has been taking on Google’s BigQuery, Snowflake and others with a cloud data warehouse solution that it claims can run analytics on large datasets cheaper and faster than its competitors. Big data is at the heart of how a lot of applications, and a lot of business overall, works these days.

Analytics

Analytics Data Big Data Business Intelligence

Scala returning to its origins: A tale of 4 chapters

Xebia

APRIL 9, 2025

For example, events such as Twitters rebranding to X, and PySparks rise in the data engineering realm over Spark have all contributed to this decline. The initial excitement that once propelled the language into the limelight during the mid-2010s has diminished over the last 15 years.

Systems Review

Systems Review Programming Technical Review Engineering

Manta, a data observability startup, raises $35M to grow its workforce

TechCrunch

MAY 26, 2022

Over the last decade, the rate at which organizations create data has accelerated as it becomes cheaper to store, access, and process data. But as data continues to grow in scale and complexity, it’s becoming scattered across apps and platforms — often leading to problems where it concerns data quality.

Data

Data Weak Development Team Enterprise Analysis

Breaking down data silos for digital success

CIO

NOVEMBER 7, 2023

For years, IT and business leaders have been talking about breaking down the data silos that exist within their organizations. In fact, as companies undertake digital transformations , usually the data transformation comes first, and doing so often begins with breaking down data — and political — silos in various corners of the enterprise.

Data

Data Artificial Inteligence Architecture Analytics

The rise of the data lakehouse: A new era of data value

CIO

AUGUST 18, 2022

To find out, he queried Walgreens’ data lakehouse, implemented with Databricks technology on Microsoft Azure. “We Previously, Walgreens was attempting to perform that task with its data lake but faced two significant obstacles: cost and time. Enter the data lakehouse. Lakehouses redeem the failures of some data lakes.

Data

Data Technical Advisors Technical Review Artificial Inteligence

Data collection and data markets in the age of privacy and machine learning

O'Reilly Media - Data

JULY 18, 2018

While models and algorithms garner most of the media coverage, this is a great time to be thinking about building tools in data. In this post I share slides and notes from a keynote I gave at the Strata Data Conference in London at the end of May. Economic value of data.

Artificial Inteligence

Artificial Inteligence Machine Learning Data Marketing

The Modern Data Lakehouse: An Architectural Innovation

Cloudera

SEPTEMBER 9, 2022

The promise of a modern data lakehouse architecture. Imagine having self-service access to all business data, anywhere it may be, and being able to explore it all at once. Imagine quickly answering burning business questions nearly instantly, without waiting for data to be found, shared, and ingested.

Architecture

Architecture Innovation Data Open Source

1. Streamlining Membership Data Engineering at Netflix with Psyberg

Netflix Tech

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty At Netflix, our Membership and Finance Data Engineering team harnesses diverse data related to plans, pricing, membership life cycle, and revenue to fuel analytics, power various dashboards, and make data-informed decisions. What is late-arriving data? Let’s dive in!

Data Engineering

Data Engineering Engineering Data Systems Review

Remember when developers reigned supreme? The market for software coding goes soft

CIO

APRIL 1, 2025

Job titles like data engineer, machine learning engineer, and AI product manager have supplanted traditional software developers near the top of the heap as companies rush to adopt AI and cybersecurity professionals remain in high demand. Demand for developers is simply growing at a slower rate than other IT roles.

Marketing

Marketing Software Development Software Development

6 strategic imperatives for your next data strategy

CIO

JUNE 23, 2023

According to the MIT Technology Review Insights Survey, an enterprise data strategy supports vital business objectives including expanding sales, improving operational efficiency, and reducing time to market. The problem is today, just 13% of organizations excel at delivering on their data strategy.

Strategy

Strategy Technical Review Data Software Review

A Recap of the Data Engineering Open Forum at Netflix

Netflix Tech

JUNE 20, 2024

A summary of sessions at the first Data Engineering Open Forum at Netflix on April 18th, 2024 The Data Engineering Open Forum at Netflix on April 18th, 2024. At Netflix, we aspire to entertain the world, and our data engineering teams play a crucial role in this mission by enabling data-driven decision-making at scale.

Data Engineering

Data Engineering Engineering Data Generative AI

Databand raises $14.5M led by Accel for its data pipeline observability tools

TechCrunch

DECEMBER 1, 2020

DevOps continues to get a lot of attention as a wave of companies develop more sophisticated tools to help developers manage increasingly complex architectures and workloads. And as data workloads continue to grow in size and use, they continue to become ever more complex. ” Not a great scenario. .”

Tools

Tools Data Weak Development Team Big Data

Accelerate Your Data Mesh in the Cloud with Cloudera Data Engineering and Modak NabuTM

Cloudera

OCTOBER 11, 2021

Modak, a leading provider of modern data engineering solutions, is now a certified solution partner with Cloudera. Customers can now seamlessly automate migration to Cloudera’s Hybrid Data Platform — Cloudera Data Platform (CDP) to dynamically auto-scale cloud services with Cloudera Data Engineering (CDE) integration with Modak Nabu.

Data Engineering

Data Engineering Engineering Data Cloud

You still don’t need a feature store

Xebia

MARCH 13, 2025

The implementation was a over-engineered custom Feast implementation using unsupported backend data stores. The engineer that implemented it had left the company by the time I joined. Mind, data lineage and discoverability become paramount when collaborating on features. You have complete access to all historical data.

Training

Training Artificial Inteligence Machine Learning Data

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

Cloudera

DECEMBER 2, 2024

Cloudera is committed to providing the most optimal architecture for data processing, advanced analytics, and AI while advancing our customers’ cloud journeys. Together, Cloudera and AWS empower businesses to optimize performance for data processing, analytics, and AI while minimizing their resource consumption and carbon footprint.

Sustainability

Sustainability AWS Analytics Infrastructure

Coalesce lands fresh capital to transform data at ‘enterprise scale’

TechCrunch

SEPTEMBER 29, 2022

Coalesce is a startup that offers data transformation tools geared mainly toward enterprise customers. Petrossian met Coalesce’s other co-founder, Satish Jayanthi, at WhereScape, where the two were responsible for solving data warehouse problems for large organizations. (In

Enterprise

Enterprise Data Business Intelligence Analytics

Heartex raises $25M for its AI-focused, open source data labeling platform

TechCrunch

MAY 18, 2022

Heartex, a startup that bills itself as an “open source” platform for data labeling, today announced that it landed $25 million in a Series A funding round led by Redpoint Ventures. We agreed that the only viable solution was to have internal teams with domain expertise be responsible for annotating and curating training data.

Open Source

Open Source Weak Development Team Data Artificial Inteligence

How to Pinpoint Where Your Organization Wins (and Loses) with Data

CIO

NOVEMBER 29, 2022

By George Trujillo, Principal Data Strategist, DataStax Innovation is driven by the ease and agility of working with data. Increasing ROI for the business requires a strategic understanding of — and the ability to clearly identify — where and how organizations win with data.

Organization

Organization Technical Review Data Artificial Inteligence

What is DataOps? Collaborative, cross-functional analytics

CIO

DECEMBER 22, 2022

DataOps (data operations) is an agile, process-oriented methodology for developing and delivering analytics. It brings together DevOps teams with data engineers and data scientists to provide the tools, processes, and organizational structures to support the data-focused enterprise. What is DataOps?

Analytics

Analytics Data Engineering Artificial Inteligence Machine Learning

What is data architecture? A framework to manage data

The key to operational AI: Modern data architecture

Webinars

Trending Sources

The future of data: A 5-pillar approach to modern data management

Webinars

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

The Next-Generation Cloud Data Lake: An Open, No-Copy Data Architecture

What is a data engineer? An analytics role in high demand

What is a data engineer? An analytics role in high demand

The evolution of data science, data engineering, and AI

Fundamentals of Data Engineering

IT leaders: What’s the gameplan as tech badly outpaces talent?

What is a data architect? Skills, salaries, and how to become a data framework master

What is Data Engineering: Explaining Data Pipeline, Data Warehouse, and Data Engineer Role

How AI orchestration has become more important than the models themselves

RudderStack raises $56M for its customer data platform

Cloudera Data Engineering 2021 Year End Review

Ready to transform how your IT organization drives business outcomes with AIOps?

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Meroxa raises $15M Series A for its real-time data platform

Make the leap to Hybrid with Cloudera Data Engineering

Liberty Mutual CIO Monica Caldas on developing a digital-savvy workforce

3 promises every CIO should keep in 2025

Cloudera and Snowflake Partner to Deliver the Most Comprehensive Open Data Lakehouse

Data Scientist vs Data Engineer: Differences and Why You Need Both

Porsche Carrera Cup Brasil gets real-time data boost

Firebolt, a data warehouse startup, raises $100M at a $1.4B valuation for faster, cheaper analytics on large data sets

Scala returning to its origins: A tale of 4 chapters

Manta, a data observability startup, raises $35M to grow its workforce

Breaking down data silos for digital success

The rise of the data lakehouse: A new era of data value

Data collection and data markets in the age of privacy and machine learning

The Modern Data Lakehouse: An Architectural Innovation

1. Streamlining Membership Data Engineering at Netflix with Psyberg

Remember when developers reigned supreme? The market for software coding goes soft

6 strategic imperatives for your next data strategy

A Recap of the Data Engineering Open Forum at Netflix

Databand raises $14.5M led by Accel for its data pipeline observability tools

Accelerate Your Data Mesh in the Cloud with Cloudera Data Engineering and Modak NabuTM

You still don’t need a feature store

Cloudera and AWS Partner to Deliver Cost-Efficient and Sustainable Infrastructure for AI and Analytics

Coalesce lands fresh capital to transform data at ‘enterprise scale’

Heartex raises $25M for its AI-focused, open source data labeling platform

How to Pinpoint Where Your Organization Wins (and Loses) with Data

What is DataOps? Collaborative, cross-functional analytics

Stay Connected