Compliance, Data Engineering and Scalability

What is data architecture? A framework to manage data

CIO

DECEMBER 20, 2024

Modern data architectures must be designed to take advantage of technologies such as AI, automation, and internet of things (IoT). According to data platform Acceldata , there are three core principles of data architecture: Scalability. Ensure data governance and compliance. Scalable data pipelines.

Architecture

Architecture Data Fractional CTO Technical Review

From legacy to lakehouse: Centralizing insurance data with Delta Lake

CIO

APRIL 23, 2025

Use mechanisms like ACID transactions to guarantee that every data update is either fully completed or reliably reversed in case of an error. Features like time-travel allow you to review historical data for audits or compliance. data lake for exploration, data warehouse for BI, separate ML platforms).

Insurance

Insurance Artificial Inteligence Data Architecture

Fundamentals of Data Engineering

Xebia

JANUARY 19, 2023

The following is a review of the book Fundamentals of Data Engineering by Joe Reis and Matt Housley, published by O’Reilly in June of 2022, and some takeaway lessons. This book is as good for a project manager or any other non-technical role as it is for a computer science student or a data engineer.

Data Engineering

Data Engineering Engineering Data Technical Review

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Why thinking like a tech company is essential for your business’s survival

CIO

MARCH 13, 2025

We developed clear governance policies that outlined: How we define AI and generative AI in our business Principles for responsible AI use A structured governance process Compliance standards across different regions (because AI regulations vary significantly between Europe and U.S.

Company

Company Generative AI Insurance Education

Deletion Vectors in Delta Live Tables: Identifying and Remediating Compliance Risks

Perficient

MARCH 27, 2025

Our Databricks Practice holds FinOps as a core architectural tenet, but sometimes compliance overrules cost savings. There is a catch once we consider data deletion within the context of regulatory compliance. However; in regulated industries, their default implementation may introduce compliance risks that must be addressed.

Compliance

Compliance Systems Review Policies Storage

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

MaestroQA also offers a logic/keyword-based rules engine for classifying customer interactions based on other factors such as timing or process steps including metrics like Average Handle Time (AHT), compliance or process checks, and SLA adherence. Now, they are able to detect compliance risks with almost 100% accuracy.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Integrating Key Vault Secrets with Azure Synapse Analytics

Apiumhub

DECEMBER 9, 2024

By integrating Azure Key Vault Secrets with Azure Synapse Analytics, organizations can securely access external data sources and manage credentials centrally. This integration not only improves security by ensuring that secrets in code or configuration files are never exposed but also improves compliance with regulatory standards.

Azure

Azure Analytics Storage Artificial Inteligence

A Recap of the Data Engineering Open Forum at Netflix

Netflix Tech

JUNE 20, 2024

A summary of sessions at the first Data Engineering Open Forum at Netflix on April 18th, 2024 The Data Engineering Open Forum at Netflix on April 18th, 2024. At Netflix, we aspire to entertain the world, and our data engineering teams play a crucial role in this mission by enabling data-driven decision-making at scale.

Data Engineering

Data Engineering Engineering Data Generative AI

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

The solution had to adhere to compliance, privacy, and ethics regulations and brand standards and use existing compliance-approved responses without additional summarization. It was important for Principal to maintain fine-grained access controls and make sure all data and sources remained secure within its environment.

Generative AI

Generative AI AWS Groups Artificial Inteligence

The success of GenAI models lies in your data management strategy

CIO

OCTOBER 9, 2024

How will organizations wield AI to seize greater opportunities, engage employees, and drive secure access without compromising data integrity and compliance? While it may sound simplistic, the first step towards managing high-quality data and right-sizing AI is defining the GenAI use cases for your business.

Strategy

Strategy Data Artificial Inteligence Storage

SAP and Databricks: Better Together

Perficient

FEBRUARY 13, 2025

Breaking down silos has been a drumbeat of data professionals since Hadoop, but this SAP <-> Databricks initiative may help to solve one of the more intractable data engineering problems out there. SAP has a large, critical data footprint in many large enterprises. However, SAP has an opaque data model.

Government

Government Open Source Machine Learning Artificial Inteligence

Healthcare organizations must create a strong data foundation to fully benefit from generative AI

CIO

JANUARY 22, 2024

That amount of data is more than twice the data currently housed in the U.S. Nearly 80% of hospital data is unstructured and most of it has been underutilized until now. To build effective and scalable generative AI solutions, healthcare organizations will have to think beyond the models that are visible at the surface.

Generative AI

Generative AI Healthcare Fractional CTO Artificial Inteligence

The 10 most in-demand tech jobs for 2023 — and how to hire for them

CIO

JANUARY 6, 2023

Database developers should have experience with NoSQL databases, Oracle Database, big data infrastructure, and big data engines such as Hadoop. It requires a strong ability for complex project management and to juggle design requirements while ensuring the final product is scalable, maintainable, and efficient.

LAN

LAN How To Systems Administration Software Engineering

How to Screen and Interview Fintech Data Engineer

Mobilunity

MAY 3, 2024

When it comes to financial technology, data engineers are the most important architects. As fintech continues to change the way standard financial services are done, the data engineer’s job becomes more and more important in shaping the future of the industry.

Data Engineering

Data Engineering Fintech Engineering Data

10 highest-paying IT jobs

CIO

APRIL 27, 2023

There’s an ever-growing need for technical pros who can handle the rapid pace of technology, ensuring businesses keep up with industry standards, compliance regulations, and emerging or disruptive technologies. The demand for specialized skills has boosted salaries in cybersecurity, data, engineering, development, and program management.

Technical Review

Technical Review Software Review Systems Review Software Engineering

CIOs take note: Platform engineering teams are the future core of IT orgs

CIO

JUNE 19, 2024

Platform engineering: purpose and popularity Platform engineering teams are responsible for creating and running self-service platforms for internal software developers to use. Platform engineering teams work closely with both IT and business teams, fostering collaboration within the organization,” he says. “We

Weak Development Team

Weak Development Team Engineering UI/UX Software Development

Scalable Entity Resolution With Python and ML

John Snow Labs

SEPTEMBER 26, 2024

This makes it hard to combine them together, especially with growing data volumes. Unfortunately, unharmonized data is not fit for use in customer analytics, risk and compliance and data engineers and scientists end up building some sort of rule or heuristic based system to manage it.

Scalability

Scalability Open Source Data Engineering Compliance

Repsol doubles down on digital transformation

CIO

JULY 5, 2023

For this reason, a multidisciplinary working group has been created at the competence center, whose mission will be to guarantee the responsible use of AI, ensuring security and regulatory compliance at all times. Likewise, he insists on building platforms that help staff make developing digital products as efficient and scalable as possible.

Artificial Inteligence

Artificial Inteligence Energy Generative AI Strategic Planning

Snowflake and Capgemini powering data and AI at scale

Capgemini

NOVEMBER 21, 2024

This will empower businesses and accelerate the time to market by creating: A data asset which supports business self-service, data science, and shadow IT Technology enabled scalability, cross self-service, shadow IT, data science, and IT industrialized solutions. To read the full whitepaper, click here.

Data

Data Government Innovation Architecture

Using SQL to democratize streaming data

Cloudera

MARCH 2, 2021

However, in the typical enterprise, only a small team has the core skills needed to gain access and create value from streams of data. This data engineering skillset typically consists of Java or Scala programming skills mated with deep DevOps acumen. A rare breed.

Data

Data Weak Development Team Data Engineering Enterprise

ETL vs ELT: Key Differences Everyone Must Know

Altexsoft

MARCH 18, 2021

This includes Apache Hadoop , an open-source software that was initially created to continuously ingest data from different sources, no matter its type. Cloud data warehouses such as Snowflake, Redshift, and BigQuery also support ELT, as they separate storage and compute resources and are highly scalable. Compliance.

Systems Review

Systems Review Technical Review Software Review Big Data

The Modern Data Lakehouse: An Architectural Innovation

Cloudera

SEPTEMBER 9, 2022

analyst Sumit Pal, in “Exploring Lakehouse Architecture and Use Cases,” published January 11, 2022: “Data lakehouses integrate and unify the capabilities of data warehouses and data lakes, aiming to support AI, BI, ML, and data engineering on a single platform.” According to Gartner, Inc.

Architecture

Architecture Innovation Data Open Source

Back to the Financial Regulatory Future

Cloudera

FEBRUARY 15, 2024

While there are clear reasons SVB collapsed, which can be reviewed here , my purpose in this post isn’t to rehash the past but to present some of the regulatory and compliance challenges financial (and to some degree insurance) institutions face and how data plays a role in mitigating and managing risk.

Insurance

Insurance Compliance Technical Review Banking

Automate Sensitive Data Protection with Metadata-Driven Masking

Xebia

JANUARY 30, 2025

In this blog post, we want to tell you about our recent effort to do metadata-driven data masking in a way that is scalable, consistent and reproducible. Using dbt to define and document data classifications and Databricks to enforce dynamic masking, we ensure that access is controlled automatically based on metadata.

Data

Data Groups Data Engineering Systems Review

Altexsoft - Untitled Article

Altexsoft

JANUARY 14, 2021

The variety of data explodes and on-premises options fail to handle it. Apart from the lack of scalability and flexibility offered by modern databases, the traditional ones are costly to implement and maintain. At the moment, cloud-based data warehouse architectures provide the most effective employment of data warehousing resources.

Backup

Backup Azure Software Review Architecture

Using John Snow Labs’ Medical Large Language Models on Azure Fabric

John Snow Labs

FEBRUARY 12, 2025

John Snow Labs’ Medical Language Models library is an excellent choice for leveraging the power of large language models (LLM) and natural language processing (NLP) in Azure Fabric due to its seamless integration, scalability, and state-of-the-art accuracy on medical tasks.

Artificial Inteligence

Artificial Inteligence Azure Healthcare Software Review

DevOps in a data science world

Xebia

MARCH 10, 2021

Because of the different character of the lab and factory setting, the request from a Data Scientist to the Data Engineer to productionise an advanced analytics model can be quite a labor intensive activity with many iterations and handovers. Data & Analytics adopting DevOps principles.

DevOps

DevOps Data Analytics Policies

The State of Tech: 4 Trends to Watch in 2022

Mentormate

JANUARY 11, 2022

Custom and off-the-shelf microservices cover the complexity of security, scalability, and data isolation and integrate into complex workflows through orchestration. That said, there’s still a significant data engineering effort to safely and securely aggregate and cleanse the data in the warehouse.

Technical Review

Technical Review Trends Off-The-Shelf Software Review

Data Governance and Strategy for the Global Enterprise

Cloudera

OCTOBER 1, 2022

According to Gartner, by 2023 65% of the world’s population will have their personal data covered under modern privacy regulations. . As a result, growing global compliance and regulations for data are top of mind for enterprises that conduct business worldwide. People selling information. Infrastructure.

Government

Government Enterprise Artificial Inteligence Strategy

Managing Machine Learning Workloads Using Kubeflow on AWS with D2iQ Kaptain

d2iq

JANUARY 18, 2022

Security: Data privacy and security are often afterthoughts during the process of model creation but are critical in production. It satisfies the organization’s security and compliance requirements, thus minimizing operational friction and meeting the needs of all teams involved in a successful ML project.

Artificial Inteligence

Artificial Inteligence Machine Learning AWS Weak Development Team

Implement a Multi-Cloud Open Lakehouse with Apache Iceberg in Cloudera Data Platform

Cloudera

DECEMBER 15, 2022

Performance and scalability. Cloudera developed unique features in CDP for Iceberg query performance and scalability for large data sets including I/O caching, dynamic partition pruning, vectorization, Z-ordering, parquet page indexes, and manifest caching. Read why the future of data lakehouses is open.

Cloud

Cloud Data Analytics Artificial Inteligence

Data Architect: Role Description, Skills, Certifications and When to Hire

Altexsoft

FEBRUARY 11, 2023

Data architect and other data science roles compared Data architect vs data engineer Data engineer is an IT specialist that develops, tests, and maintains data pipelines to bring together data from various sources and make it available for data scientists and other specialists.

Data

Data Data Engineering Big Data Architecture

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Cloudera

JUNE 30, 2022

Today’s general availability announcement covers Iceberg running within key data services in the Cloudera Data Platform (CDP) — including Cloudera Data Warehousing ( CDW ), Cloudera Data Engineering ( CDE ), and Cloudera Machine Learning ( CML ). Read why the future of data lakehouses is open.

Data

Data Analytics Artificial Inteligence Machine Learning

Data Migration Software: Which Solution Fits Your Project Best

Altexsoft

DECEMBER 4, 2020

Three types of data migration tools. Automation scripts can be written by data engineers or ETL developers in charge of your migration project. This makes sense when you move a relatively small amount of data and deal with simple requirements. Use cases: moving data from on-premises to cloud or between cloud environments.

Software Review

Software Review Software Data Technical Review

Governing for digital transformation and growth

Cloudera

FEBRUARY 11, 2019

To achieve their goals of digital transformation and becoming data-driven, companies need more than just a better data warehouse or BI tool. They need a range of analytical capabilities from data engineering to data warehousing to operational databases and data science. Governing for compliance.

Government

Government Compliance Artificial Inteligence Machine Learning

Breaking State and Local Data Silos with Modern Data Architectures

Cloudera

AUGUST 30, 2022

Legacy data sharing involves proliferating copies of data, creating data management, and security challenges. Data quality issues deter trust and hinder accurate analytics. Disparate systems create issues with transparency and compliance. Deploying modern data architectures.

Architecture

Architecture Data Artificial Inteligence Artificial Intelligence

Making AI Work in Legal Tech: Balancing Cost and Performance

Invid Group

AUGUST 28, 2024

Automation and Scalability Operationalization normally involves automating processes and workflows to enable scalability and efficiency. By automating data processes, organizations can ensure that insights and models are consistently applied to new data and operational decisions, reducing manual effort and improving responsiveness.

Technical Review

Technical Review Artificial Inteligence Performance Azure

Data analytics: your complete guide to big data consulting

Agile Engine

DECEMBER 27, 2023

Introduction With the growing availability of cloud and AI, the data collected by organizations is now worth its weight in gold. Large-scale, granular, and actionable data analytics is more accessible than ever, but it still comes with numerous challenges. That’s where data analytics consultancies come into play.

Big Data

Big Data Analytics Data Analysis

The Good and the Bad of Databricks Lakehouse Platform

Altexsoft

MARCH 30, 2023

What is Databricks Databricks is an analytics platform with a unified set of tools for data engineering, data management , data science, and machine learning. It combines the best elements of a data warehouse, a centralized repository for structured data, and a data lake used to host large amounts of raw data.

Weak Development Team

Weak Development Team Artificial Inteligence Machine Learning Software Review

Hiring Offshore Python Developers: Benefits, Costs, and Trends

Mobilunity

MARCH 19, 2025

Python devs create robust and scalable solutions using Django and Flask frameworks. Developers gather and preprocess data to build and train algorithms with libraries like Keras, TensorFlow, and PyTorch. Data engineering. They efficiently extract and manipulate data to process and analyze large datasets.

Trends

Trends Technical Review Development Software Review

Interpreting predictive models with Skater: Unboxing model opacity

O'Reilly Media - Data

MARCH 22, 2018

In fact, the ability to account for the fairness and transparency of these predictive models has been mandated for legal compliance. At DataScience.com , where I’m a lead data scientist, we feel passionately about the ability of practitioners to use models to ensure safety, non-discrimination, and transparency.

Off-The-Shelf

Off-The-Shelf Artificial Inteligence Machine Learning Weak Development Team

Percona Live 2023 Event Recap

Datavail

JUNE 20, 2023

Percona Live 2023 was an exciting open-source database event that brought together industry experts, database administrators, data engineers, and IT leadership. The top factors leading to respondents choosing proprietary databases included greater stability (68%), more security (63%), and regulatory compliance (61%).

Open Source

Open Source Database Administration Survey AWS

New DataRobot and Snowflake Integrations: Seamless Data Prep, Model Deployment, and Monitoring

DataRobot

MARCH 16, 2023

To work effectively, data scientists need agility in the form of access to enterprise data, streamlined tooling, and infrastructure that just works. Agility and enterprise security, compliance, and governance are often at odds. Now you can take full advantage of the scale and elasticity of your Snowflake instance.

Data

Data Artificial Inteligence Machine Learning Agile

Cloud Certification Guide: How to Master & Showcase Your Expertise in AWS, Azure, & Google Cloud

ParkMyCloud

JANUARY 17, 2020

Individuals in an associate solutions architect role have 1+ years of experience designing available, fault-tolerant, scalable, and most importantly cost-efficient, distributed systems on AWS. Must prove knowledge of deploying, operating and managing highly available, scalable and fault-tolerant systems on AWS.

Google Cloud

Google Cloud Azure AWS Cloud

What is data architecture? A framework to manage data

From legacy to lakehouse: Centralizing insurance data with Delta Lake

Webinars

Trending Sources

Fundamentals of Data Engineering

Webinars

Why thinking like a tech company is essential for your business’s survival

Deletion Vectors in Delta Live Tables: Identifying and Remediating Compliance Risks

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Integrating Key Vault Secrets with Azure Synapse Analytics

A Recap of the Data Engineering Open Forum at Netflix

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

The success of GenAI models lies in your data management strategy

SAP and Databricks: Better Together

Healthcare organizations must create a strong data foundation to fully benefit from generative AI

The 10 most in-demand tech jobs for 2023 — and how to hire for them

How to Screen and Interview Fintech Data Engineer

10 highest-paying IT jobs

CIOs take note: Platform engineering teams are the future core of IT orgs

Scalable Entity Resolution With Python and ML

Repsol doubles down on digital transformation

Snowflake and Capgemini powering data and AI at scale

Using SQL to democratize streaming data

ETL vs ELT: Key Differences Everyone Must Know

The Modern Data Lakehouse: An Architectural Innovation

Back to the Financial Regulatory Future

Automate Sensitive Data Protection with Metadata-Driven Masking

Altexsoft - Untitled Article

Using John Snow Labs’ Medical Large Language Models on Azure Fabric

DevOps in a data science world

The State of Tech: 4 Trends to Watch in 2022

Data Governance and Strategy for the Global Enterprise

Managing Machine Learning Workloads Using Kubeflow on AWS with D2iQ Kaptain

Implement a Multi-Cloud Open Lakehouse with Apache Iceberg in Cloudera Data Platform

Data Architect: Role Description, Skills, Certifications and When to Hire

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Data Migration Software: Which Solution Fits Your Project Best

Governing for digital transformation and growth

Breaking State and Local Data Silos with Modern Data Architectures

Making AI Work in Legal Tech: Balancing Cost and Performance

Data analytics: your complete guide to big data consulting

The Good and the Bad of Databricks Lakehouse Platform

Hiring Offshore Python Developers: Benefits, Costs, and Trends

Interpreting predictive models with Skater: Unboxing model opacity

Percona Live 2023 Event Recap

New DataRobot and Snowflake Integrations: Seamless Data Prep, Model Deployment, and Monitoring

Cloud Certification Guide: How to Master & Showcase Your Expertise in AWS, Azure, & Google Cloud

Stay Connected