Remove data-engineering-glossary
article thumbnail

Data News — December 2023

Christophe Blefari

Before moving on to the Data News, a bit of personal news, in December, I took part in the MotherDuck meetup in Berlin. End of January, on the 31st I'll speak at a Modern Data Stack conf in Paris, still about DuckDB, but this time in French. Enjoy this last 2023 Data News. We're going to get to know each other.

Data 100
article thumbnail

Data Engineering Weekly #105

Data Engineering Weekly

Data Engineering Weekly Is Brought to You by RudderStack RudderStack provides data pipelines that make it easy to collect data from every application, website, and SaaS platform, then activate it in your warehouse and business tools. The highlights are that 59% of folks think data catalogs are sometimes helpful.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

What Should I Look For in a Data Catalog Tool?

phData: Data Engineering

In our previous blog in this series , we spent a lot of time exploring why a data catalog is valuable and who you might need to support it. With that background information in mind, we’re ready to take a look at some actual tools and properly uncover what’s the best data catalog for your business. Where is your metadata stored?

article thumbnail

Data Quality + Data Lineage = ???

Datakin

Blog Data Quality + Data Lineage = Written by Peter Hicks on Sep 2, 2021 In a prior life, I dwelled in the day-to-day cycles of an e-commerce platform. We had both an application engineering team and a data team consuming our data and building dashboards at various cadences and informing business decisions at large.

Bytes 52
article thumbnail

Data Engineering Weekly #111

Data Engineering Weekly

Data Engineering Weekly Is Brought to You by RudderStack RudderStack provides data pipelines that make it easy to collect data from every application, website, and SaaS platform, then activate it in your warehouse and business tools. It's the year's end, so there is plenty of 2023 predictions in Data Engineering.

article thumbnail

A guide to Generative AI terminology by Colin Eberhardt

Scott Logic

I find it such a useful reference, I thought I’d share it in this blog post. Training - a process whereby large quantities of data are presented to the neural network, with the quality of its output evaluated in some way. These steps may also involve collecting additional data via web searches.

article thumbnail

What’s a Data Catalog and How to Choose the Right One

phData: Data Engineering

Your business might be moving to the cloud, just completed, or have been established with it for a little while, and you are likely wondering, “what data catalog tool is best for me?” How to set goals for your data catalog How to establish business drivers for your data catalog What value does a data catalog provide your business?