article thumbnail

Select Star raises seed to automatically document datasets for data scientists

TechCrunch

With many of these tools, “they don’t do the work of connecting and building the relationship,” between data she said, adding that “documentation is still important, but being able to automatically generate [metadata] allows data teams to get value right away.”. Photo via Select Star.

Data 271
article thumbnail

Fishtown Analytics raises $29.5M Series B for its data engineering platform

TechCrunch

Fishtown Analytics , the Philadelphia-based company behind the dbt open-source data engineering tool, today announced that it has raised a $29.5 The company is building a platform that allows data analysts to more easily create and disseminate organizational knowledge. million Series A round in April.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

How FiveStars re-engineered its data engineering stack

CIO

It shows in his reluctance to run his own servers but it’s perhaps most obvious in his attitude to data engineering, where he’s nearing the end of a five-year journey to automate or outsource much of the mundane maintenance work and focus internal resources on data analysis. It’s not a good use of our time either.”

article thumbnail

The future of data: A 5-pillar approach to modern data management

CIO

This approach is repeatable, minimizes dependence on manual controls, harnesses technology and AI for data management and integrates seamlessly into the digital product development process. Operational errors because of manual management of data platforms can be extremely costly in the long run.

Data 167
article thumbnail

Beyond the hype: 4 use cases that show what’s actually working with gen AI

CIO

Baker says productivity is one of the main areas of gen AI deployment for the company, which is now available through Office 365, and allows employees to do such tasks as summarize emails, or help with PowerPoint and Excel documents. With these paid versions, our data remains secure within our own tenant, he says.

article thumbnail

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

AWS Machine Learning - AI

Traditional keyword-based search mechanisms are often insufficient for locating relevant documents efficiently, requiring extensive manual review to extract meaningful insights. This solution improves the findability and accessibility of archival records by automating metadata enrichment, document classification, and summarization.

article thumbnail

Ducklake: A journey to integrate DuckDB with Unity Catalog

Xebia

Dbt is a popular tool for transforming data in a data warehouse or data lake. It enables data engineers and analysts to write modular SQL transformations, with built-in support for data testing and documentation. This makes dbt a natural choice for the Ducklake setup.