article thumbnail

Data Engineering Annotated Monthly – May 2022

Big Data Tools

DataHub 0.8.36 – Metadata management is a big and complicated topic. DataHub is a completely independent product by LinkedIn, and the folks there definitely know what metadata is and how important it is. If you haven’t found your perfect metadata management system just yet, maybe it’s time to try DataHub!

article thumbnail

Data Engineering Annotated Monthly – May 2022

Big Data Tools

DataHub 0.8.36 – Metadata management is a big and complicated topic. DataHub is a completely independent product by LinkedIn, and the folks there definitely know what metadata is and how important it is. If you haven’t found your perfect metadata management system just yet, maybe it’s time to try DataHub!

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

All of Netflix’s HDR video streaming is now dynamically optimized

Netflix Tech

As noted in an earlier blog post , we began developing an HDR variant of VMAF; let’s call it HDR-VMAF. We A/B tested HDR-DO encodes in production in Q3-Q4 2021, followed by improving the ladder generation algorithm further in early 2022. We started backfilling HDR-DO encodes for existing titles from Q2 2022. Krasula, A.

Metadata 100
article thumbnail

Ensuring the Successful Launch of Ads on Netflix

Netflix Tech

By Jose Fernandez , Ed Barker , Hank Jacobs Introduction In November 2022, we introduced a brand new tier —  Basic with ads. It also included metadata about ads, such as ad placement and impression-tracking events. As we were gearing up for launch, we wanted to ensure it would go as smoothly as possible.

Algorithm 136
article thumbnail

Data Engineering Weekly #104

Data Engineering Weekly

of my dear data friends say that they use @data_weekly to keep up with the data engineering landscape ❤️❤️❤️ 8:04 PM ∙ Oct 18, 2022 The top of my mind for this week is Data Catalog. link] It is almost two years since we published the metadata edition, but I keep thinking back.

article thumbnail

Why Data Governance Is Crucial for All Enterprise-Level Businesses

Cloudera

Data analytics and machine learning can become a business and a compliance risk if data security, governance, lineage, metadata management, and automation are not holistically applied across the entire data lifecycle and all environments. From Bad to Worse. One possible solution is to adopt a hybrid cloud strategy. .

article thumbnail

Apache Ozone Powers Data Science in CDP Private Cloud

Cloudera

In this blog post, we will ingest a real world dataset into Ozone, create a Hive table on top of it and analyze the data to study the correlation between new vaccinations and new cases per country using a Spark ML Jupyter notebook in CML. Learn more about the impacts of global data sharing in this blog, The Ethics of Data Exchange.