Remove Cloud Storage Remove Data Schemas Remove Designing Remove Metadata
article thumbnail

A Cost-Effective Data Warehouse Solution in CDP Public Cloud – Part1

Cloudera

A typical approach that we have seen in customers’ environments is that ETL applications pull data with a frequency of minutes and land it into HDFS storage as an extra Hive table partition file. In this way, the analytic applications are able to turn the latest data into instant business insights. Design Detail.

article thumbnail

Implementing the Netflix Media Database

Netflix Tech

A fundamental requirement for any lasting data system is that it should scale along with the growth of the business applications it wishes to serve. NMDB is built to be a highly scalable, multi-tenant, media metadata system that can serve a high volume of write/read throughput as well as support near real-time queries.

Media 94
article thumbnail

Modern Data Engineering

Towards Data Science

Back in October, I wrote about the rise of the Data Engineer, the role, its challenges, responsibilities, daily routine and how to become successful in this field. The data engineering landscape is constantly changing but major trends seem to remain the same. So here are a few things to consider that can help us answer these questions.