article thumbnail

Data Engineering Annotated Monthly – May 2022

Big Data Tools

DataHub 0.8.36 – Metadata management is a big and complicated topic. DataHub is a completely independent product by LinkedIn, and the folks there definitely know what metadata is and how important it is. If you haven’t found your perfect metadata management system just yet, maybe it’s time to try DataHub!

article thumbnail

Data Engineering Annotated Monthly – May 2022

Big Data Tools

DataHub 0.8.36 – Metadata management is a big and complicated topic. DataHub is a completely independent product by LinkedIn, and the folks there definitely know what metadata is and how important it is. If you haven’t found your perfect metadata management system just yet, maybe it’s time to try DataHub!

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

DAG Dependencies in Apache Airflow: The Ultimate Guide

Marc Lamberti

The three DAGs on the left are still doing the same stuff that produces metadata (XComs, task instances, etc). The DAG on the right is in charge of cleaning this metadata as soon as one DAG on the left completes. I tend to use it, especially for cleaning metadata generated by DAG Runs over time.

Metadata 130
article thumbnail

How to get started with dbt

Christophe Blefari

You can also add metadata on models (in YAML). docs — in dbt you can add metadata on everything, some of the metadata is already expected by the framework and thank to it you can generate a small web page with your light catalog inside: you only need to do dbt docs generate and dbt docs serve.

article thumbnail

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog: Data Engineering

Support for Various Data Warehouses and Databases : AnalyticsCreator supports MS SQL Server 2012-2022, Azure SQL Database, Azure Synapse Analytics dedicated, and more. Versioning: AnalyticsCreator maintains a version of history of metadata changes. Data Lakes : It supports MS Azure Blob Storage. Mixed approach of DV 2.0

article thumbnail

Ensuring the Successful Launch of Ads on Netflix

Netflix Tech

By Jose Fernandez , Ed Barker , Hank Jacobs Introduction In November 2022, we introduced a brand new tier —  Basic with ads. It also included metadata about ads, such as ad placement and impression-tracking events. As we were gearing up for launch, we wanted to ensure it would go as smoothly as possible.

Algorithm 136
article thumbnail

All of Netflix’s HDR video streaming is now dynamically optimized

Netflix Tech

We A/B tested HDR-DO encodes in production in Q3-Q4 2021, followed by improving the ladder generation algorithm further in early 2022. We started backfilling HDR-DO encodes for existing titles from Q2 2022. and based on content characteristics and/or metadata signaled in the bitstream. The graphic below (Fig.

Metadata 100