Data Engineering, Demo and Machine Learning

New Applied ML Prototypes Now Available in Cloudera Machine Learning

Cloudera

NOVEMBER 17, 2021

You know the one, the mathematician / statistician / computer scientist / data engineer / industry expert. Some companies are starting to segregate the responsibilities of the unicorn data scientist into multiple roles (data engineer, ML engineer, ML architect, visualization developer, etc.),

Artificial Inteligence

Artificial Inteligence Machine Learning Hotels Data Engineering

Machine Learning with Python, Jupyter, KSQL and TensorFlow

Confluent

FEBRUARY 6, 2019

Building a scalable, reliable and performant machine learning (ML) infrastructure is not easy. It takes much more effort than just building an analytic model with Python and your favorite machine learning framework. Impedance mismatch between data scientists, data engineers and production engineers.

Artificial Inteligence

Artificial Inteligence Machine Learning Scalability Data Engineering

Next Stop – Predicting on Data with Cloudera Machine Learning

Cloudera

APRIL 9, 2021

The second blog dealt with creating and managing Data Enrichment pipelines. The third video in the series highlighted Reporting and Data Visualization. Specifically, we’ll focus on training Machine Learning (ML) models to forecast ECC part production demand across all of its factories. Data Collection – streaming data.

Artificial Inteligence

Artificial Inteligence Machine Learning Data Data Engineering

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Building a Machine Learning Application With Cloudera Data Science Workbench And Operational Database, Part 3: Productionization of ML models

Cloudera

JANUARY 20, 2021

In this last installment, we’ll discuss a demo application that uses PySpark.ML to make a classification model based off of training data stored in both Cloudera’s Operational Database (powered by Apache HBase) and Apache HDFS. Machine learning is now being used to solve many real-time problems. Background / Overview.

Artificial Inteligence

Artificial Inteligence Machine Learning Applications Data

Simplify your workflow deployment with Databricks Asset Bundles: Part I

Xebia

DECEMBER 26, 2024

Databricks is now a top choice for data teams. Its user-friendly, collaborative platform simplifies building data pipelines and machine learning models. Many data practitioners, myself included, have faced various deployment and resource management strategies. You must build a data ingestion app.

Resources

Resources Testing Infrastructure Applications

What you need to know about product management for AI

O'Reilly Media - Ideas

MARCH 31, 2020

If you’re already a software product manager (PM), you have a head start on becoming a PM for artificial intelligence (AI) or machine learning (ML). AI products are automated systems that collect and learn from data to make user-facing decisions. Machine learning adds uncertainty.

Product Management

Product Management Artificial Inteligence Machine Learning Weak Development Team

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

AWS Machine Learning - AI

APRIL 1, 2025

With App Studio, technical professionals such as IT project managers, data engineers, enterprise architects, and solution architects can quickly develop applications tailored to their organizations needswithout requiring deep software development skills.

AWS

AWS Software Review Technical Review Generative AI

Happy Birthday, CDP Public Cloud

Cloudera

OCTOBER 13, 2020

In the beginning, CDP ran only on AWS with a set of services that supported a handful of use cases and workload types: CDP Data Warehouse: a kubernetes-based service that allows business analysts to deploy data warehouses with secure, self-service access to enterprise data. Predict – Data Engineering (Apache Spark).

Cloud

Cloud Artificial Inteligence Machine Learning Data Engineering

Forget the Rules, Listen to the Data

Hu's Place - HitachiVantara

MAY 10, 2019

Rule-based fraud detection software is being replaced or augmented by machine-learning algorithms that do a better job of recognizing fraud patterns that can be correlated across several data sources. DataOps is required to engineer and prepare the data so that the machine learning algorithms can be efficient and effective.

Data

Data Artificial Inteligence Machine Learning Weak Development Team

What I have been working on: Modal

Erik Bernhardsson

DECEMBER 6, 2022

We've been focusing a lot on machine learning recently, in particular model inference — Stable Diffusion is obviously the coolest thing right now, but we also support a wide range of other things: Using OpenAI's Whisper model for transcription , Dreambooth , object detection (with a webcam demo!).

CTO Coach

CTO Coach Fractional CTO Software Engineering Serverless

9 Great Reasons to Join the DataRobot AI Experience Virtual Event Jun 7-8

DataRobot

JUNE 1, 2022

As a partner of the McLaren Formula 1 Team , DataRobot is excited to share an exclusive view of how McLaren uses machine learning and AI. Learn how the McLaren Formula 1 Team is delivering AI-powered predictions and insights to maximize performance and optimize simulations. New DataRobot AI Cloud Product Announcements.

Virtualization

Virtualization Artificial Inteligence Machine Learning Healthcare

Advancing AI Cloud with Release 7.2

DataRobot

SEPTEMBER 14, 2021

As AI continues to advance at such an aggressive pace, solutions built on machine learning are quickly becoming the new norm. Data scientists and data engineers want full control over every aspect of their machine learning solutions and want coding interfaces so that they can use their favorite libraries and languages.

Cloud

Cloud Artificial Inteligence Machine Learning Data Engineering

An A-Z Data Adventure on Cloudera’s Data Platform

Cloudera

DECEMBER 21, 2020

In this blog we will take you through a persona-based data adventure, with short demos attached, to show you the A-Z data worker workflow expedited and made easier through self-service, seamless integration, and cloud-native technologies. Company data exists in the data lake. The Data Scientist.

Data

Data Virtualization Banking Data Engineering

Why 87% of AI/ML Projects Never Make It Into Production—And How to Fix It

d2iq

MARCH 31, 2022

Going from prototype to production is perilous when it comes to artificial intelligence (AI) and machine learning (ML). However, many organizations struggle moving from a prototype on a single machine to a scalable, production-grade deployment. And for the few models that are ever deployed, it takes 90 days or more to get there.

Artificial Inteligence

Artificial Inteligence Machine Learning How To Artificial Intelligence

Data Innovation Summit with Gema Parreño – lead data scientist at Apiumhub

Apiumhub

JUNE 22, 2021

Data Innovation Summit topics. Same as last year, the event offers six workshops (crash-course) themes, each dedicated to a unique domain area: Data-driven Strategy, Analytics & Visualisation, Machine Learning, IoT Analytics & Data Management, Data Management and Data Engineering.

Innovation

Innovation Data Technical Review Artificial Inteligence

The Third Generation of XDR Has Arrived!

Palo Alto Networks

AUGUST 23, 2021

We wanted to provide a modern cloud-based platform leveraging the latest in machine learning, analytics and automation to fight the many cyber attacks businesses face every day. Cortex XDR’s Third-Party Data Engine Now Delivers the Ability to Ingest, Normalize, Correlate, Query and Analyze Data from Virtually Any Source.

Cloud

Cloud Artificial Inteligence Machine Learning Analytics

Digital Transformation is a Data Journey From Edge to Insight

Cloudera

JANUARY 20, 2021

Predictive Analytics – predictive analytics based upon AI and machine learning (Fraud detection, predictive maintenance, demand based inventory optimization as examples). Security & Governance – an integrated set of security, management and governance technologies across the entire data lifecycle.

Data

Data Artificial Inteligence Analytics Machine Learning

Announcing Cloudera’s Enterprise Artificial Intelligence Partnership Ecosystem

Cloudera

DECEMBER 20, 2023

The data management platform, models, and end applications are powered by cloud infrastructure and/or specialized hardware. In a stack including Cloudera Data Platform the applications and underlying models can also be deployed from the data management platform via Cloudera Machine Learning.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Enterprise Machine Learning

Simplify your workflow deployment with Databricks Asset Bundles: Part I

Xebia

DECEMBER 26, 2024

Databricks is now a top choice for data teams. Its user-friendly, collaborative platform simplifies building data pipelines and machine learning models. Many data practitioners, myself included, have faced various deployment and resource management strategies. You must build a data ingestion app.

Resources

Resources Testing Infrastructure Applications

Simplify your workflow deployment with Databricks Asset Bundles: Part I

Xebia

DECEMBER 26, 2024

Databricks is now a top choice for data teams. Its user-friendly, collaborative platform simplifies building data pipelines and machine learning models. Many data practitioners, myself included, have faced various deployment and resource management strategies. You must build a data ingestion app.

Resources

Resources Testing Infrastructure Applications

The Good and the Bad of Databricks Lakehouse Platform

Altexsoft

MARCH 30, 2023

What is Databricks Databricks is an analytics platform with a unified set of tools for data engineering, data management , data science, and machine learning. Besides that, it’s fully compatible with various data ingestion and ETL tools. How data engineering works in 14 minutes.

Weak Development Team

Weak Development Team Artificial Inteligence Machine Learning Software Review

Educating ChatGPT on Data Lakehouse

Cloudera

MARCH 17, 2023

At Cloudera, we also provide machine learning as part of our lakehouse, so data scientists get easy access to reliable data in the data lakehouse to quickly launch new machine learning projects and build and deploy new models for advanced analytics.

ChatGPT

ChatGPT Education Data Comparison

DataOps Uncovered: A Bold New Approach to Telemetry and Network Visibility

Kentik

APRIL 12, 2023

Data scientists play a critical role in the DataOps ecosystem, leveraging advanced analytics and machine learning techniques to gain insights from large and complex data sets. DataOps team roles In a DataOps team, several key roles work together to ensure the data pipeline is efficient, reliable, and scalable.

Network

Network Data Engineering Artificial Inteligence Machine Learning

From Hive Tables to Iceberg Tables: Hassle-Free

Cloudera

JULY 14, 2023

While these instructions are carried out for Cloudera Data Platform (CDP), Cloudera Data Engineering, and Cloudera Data Warehouse, one can extrapolate them easily to other services and other use cases as well. Watch our webinar Supercharge Your Analytics with Open Data Lakehouse Powered by Apache Iceberg.

Backup

Backup Data Engineering Engineering Data

3 Major Trends at Strata New York 2017

DataRobot

OCTOBER 3, 2017

Enterprise data architects, data engineers, and business leaders from around the globe gathered in New York last week for the 3-day Strata Data Conference , which featured new technologies, innovations, and many collaborative ideas. DataRobot Data Prep. free trial. Try now for free.

Trends

Trends Azure Conference Media

Delivering the Next Generation of AI with DataRobot AI Cloud

DataRobot

SEPTEMBER 14, 2021

AI Cloud brings together any type of data, from any source, giving you a unique, global view of insights that drive your business. All of this is part of a unified, integrated platform spanning data engineering, machine learning, decision intelligence, and continuous AI – the entire AI lifecycle.

Cloud

Cloud Artificial Inteligence Machine Learning Data Center

An Overview of the Top Text Annotation Tools For Natural Language Processing

John Snow Labs

MAY 24, 2023

Almost 90% of the machine learning models encounter delays and never make it into production. Developing a machine learning model requires a big amount of training data. Therefore, the data needs to be properly labeled/categorized for a particular use case.

Tools

Tools Artificial Inteligence Machine Learning Software Review

Accenture’s Smart Data Transition Toolkit Now Available for Cloudera Data Platform

Cloudera

AUGUST 31, 2021

It outperforms other data warehouses on all sizes and types of data, including structured and unstructured, while scaling cost-effectively past petabytes. Running on CDW is fully integrated with streaming, data engineering, and machine learning analytics. Demo Video. Solution brief. Contributors: .

Data

Data Analytics Cloud Technical Review

12 Times Faster Query Planning With Iceberg Manifest Caching in Impala

Cloudera

JULY 13, 2023

Watch our webinar Supercharge Your Analytics with Open Data Lakehouse Powered by Apache Iceberg. It includes a live demo recording of Iceberg capabilities. Try Cloudera Data Warehouse (CDW), Cloudera Data Engineering (CDE), and Cloudera Machine Learning (CML) by signing up for a 60 day trial , or test drive CDP.

Weak Development Team

Weak Development Team Engineering Analytics Storage

Apiumhub among top IT industry leaders in Code Europe event

Apiumhub

AUGUST 12, 2021

Gema Parreño Piqueras – Lead Data Science @Apiumhub Gema Parreno is currently a Lead Data Scientist at Apiumhub, passionate about machine learning and video games, with three years of experience at BBVA and later at Google in ML Prototype. Twitter: [link] Linkedin: [link]. Twitter: ??

Industry

Industry Technical Advisors CTO Coach Azure

Data Governance: Concept, Models, Framework, Tools, and Implementation Best Practices

Altexsoft

MARCH 2, 2023

Source: McKinsy&Company For example, a data science team may spend 70 to 80 percent of their time preparing data for machine learning projects , with a prevailing part of this time being spent on data cleansing alone. Learn how data is prepared for machine learning in our dedicated video.

Government

Government Tools Data Weak Development Team

Five Takeaways from HashiConf US 2019: Building Infrastructure in a Multi-* World

Daniel Bryant

SEPTEMBER 13, 2019

What was worth noting was that (anecdotally) even engineers from large organisations were not looking for full workload portability (i.e. There were also two patterns of adoption of HashiCorp tooling I observed from engineers that I chatted to: Infrastructure-driven?—?in

Infrastructure

Infrastructure Azure Software Engineering Cloud

A brave new (generative) world – The future of generative software engineering

Capgemini

MARCH 31, 2024

The AI evolution: Transforming software engineering In the past year, the landscape of tech has seen unprecedented upheaval. Generative AI (GenAI) has catapulted data science, machine learning, and AI into the limelight, sparking conversations and at all levels of business and democratizing access to the power of AI.

Software Engineering

Software Engineering Engineering Software Generative AI

TIBCO’s Innovation Streak Continues with Exciting New Product Announcements and Enhancements

TIBCO - Connected Intelligence

DECEMBER 20, 2021

TIBCO DQ will become the new data quality product family, through an evolution of our current data quality offerings, significantly enhancing current capabilities available throughout the TIBCO data fabric with built-in AI and ML to automate quality, detection, monitoring, and anomaly resolution.

Innovation

Innovation Analytics Virtualization Cloud

Technology Trends for 2022

O'Reilly Media - Ideas

JANUARY 25, 2022

A quick look at bigram usage (word pairs) doesn’t really distinguish between “data science,” “data engineering,” “data analysis,” and other terms; the most common word pair with “data” is “data governance,” followed by “data science.” But these topics are relatively small and narrow.

Trends

Trends Technical Review Technology Artificial Inteligence

The death of Agile?

O'Reilly Media - Ideas

MARCH 2, 2020

It’s reasonable to have something to demo in two weeks (or whatever interval you choose). This year’s growth in Python usage was buoyed by its increasing popularity among data scientists and machine learning (ML) and artificial intelligence (AI) engineers. Data quality might get worse before it gets better.

Agile

Agile Artificial Inteligence Weak Development Team SCRUM

The Good and the Bad of Apache Airflow Pipeline Orchestration

Altexsoft

NOVEMBER 7, 2022

You can hardly compare data engineering toil with something as easy as breathing or as fast as the wind. The platform went live in 2015 at Airbnb, the biggest home-sharing and vacation rental site, as an orchestrator for increasingly complex data pipelines. How data engineering works. What is Apache Airflow?

Weak Development Team

Weak Development Team Technical Review Software Review Data Engineering

Cost Conscious Data Warehousing with Cloudera Data Platform

Cloudera

DECEMBER 10, 2020

These file formats not only help avoid data duplication into proprietary storage formats but also provide highly efficient storage formats. Multiple analytical engines (data warehousing, machine learning, data engineering, and so on) can operate on the same data in these file formats.

Data

Data Technical Review Storage Systems Review

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

Cloudera

DECEMBER 3, 2024

Setup Ranger Policy to allow “rest-demo” access for sharing: Create a policy that will allow the “rest-demo” role to have read access to the Carriers table, but will have no access to read the Airports table. In this case I’m using a role named – “UnitedAirlinesRole” that I can use to share data.

Data

Data Disaster Recovery Airlines Policies

Build agentic systems with CrewAI and Amazon Bedrock

AWS Machine Learning - AI

MARCH 31, 2025

Our use case demo implements a specialized team of three agents, each with distinct responsibilities that mirror roles you might find in a professional security consulting firm: Infrastructure mapper Acts as our system architect, methodically documenting AWS resources and their configurations.

Systems Review

Systems Review System Artificial Inteligence AWS

New Applied ML Prototypes Now Available in Cloudera Machine Learning

Machine Learning with Python, Jupyter, KSQL and TensorFlow

Next Stop – Predicting on Data with Cloudera Machine Learning

Webinars

Building a Machine Learning Application With Cloudera Data Science Workbench And Operational Database, Part 3: Productionization of ML models

Simplify your workflow deployment with Databricks Asset Bundles: Part I

What you need to know about product management for AI

AWS App Studio introduces a prebuilt solutions catalog and cross-instance Import and Export

Happy Birthday, CDP Public Cloud

Forget the Rules, Listen to the Data

What I have been working on: Modal

9 Great Reasons to Join the DataRobot AI Experience Virtual Event Jun 7-8

Advancing AI Cloud with Release 7.2

An A-Z Data Adventure on Cloudera’s Data Platform

Why 87% of AI/ML Projects Never Make It Into Production—And How to Fix It

Data Innovation Summit with Gema Parreño – lead data scientist at Apiumhub

The Third Generation of XDR Has Arrived!

Digital Transformation is a Data Journey From Edge to Insight

Announcing Cloudera’s Enterprise Artificial Intelligence Partnership Ecosystem

Simplify your workflow deployment with Databricks Asset Bundles: Part I

Simplify your workflow deployment with Databricks Asset Bundles: Part I

The Good and the Bad of Databricks Lakehouse Platform

Educating ChatGPT on Data Lakehouse

DataOps Uncovered: A Bold New Approach to Telemetry and Network Visibility

From Hive Tables to Iceberg Tables: Hassle-Free

3 Major Trends at Strata New York 2017

Delivering the Next Generation of AI with DataRobot AI Cloud

An Overview of the Top Text Annotation Tools For Natural Language Processing

Accenture’s Smart Data Transition Toolkit Now Available for Cloudera Data Platform

12 Times Faster Query Planning With Iceberg Manifest Caching in Impala

Apiumhub among top IT industry leaders in Code Europe event

Data Governance: Concept, Models, Framework, Tools, and Implementation Best Practices

Five Takeaways from HashiConf US 2019: Building Infrastructure in a Multi-* World

A brave new (generative) world – The future of generative software engineering

TIBCO’s Innovation Streak Continues with Exciting New Product Announcements and Enhancements

Technology Trends for 2022

The death of Agile?

The Good and the Bad of Apache Airflow Pipeline Orchestration

Cost Conscious Data Warehousing with Cloudera Data Platform

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

Build agentic systems with CrewAI and Amazon Bedrock

Stay Connected