Remove Blog Remove Bytes Remove Metadata Remove Systems
article thumbnail

Launching the Engineering Blog

Zalando Engineering

Our Engineering Blog was launched in June 2020 after a long break of the previous tech blog. What customizations we applied to design the blog and the publishing process. Static Site Generator Our previous tech blog used a CMS which only a limited number of people had access to. So which static site generator to choose?

article thumbnail

Data Engineering Annotated Monthly – May 2022

Big Data Tools

DataHub 0.8.36 – Metadata management is a big and complicated topic. DataHub is a completely independent product by LinkedIn, and the folks there definitely know what metadata is and how important it is. If you haven’t found your perfect metadata management system just yet, maybe it’s time to try DataHub!

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Engineering Annotated Monthly – May 2022

Big Data Tools

DataHub 0.8.36 – Metadata management is a big and complicated topic. DataHub is a completely independent product by LinkedIn, and the folks there definitely know what metadata is and how important it is. If you haven’t found your perfect metadata management system just yet, maybe it’s time to try DataHub!

article thumbnail

HDFS Data Encryption at Rest on Cloudera Data Platform

Cloudera

To prevent the management of these keys (which can run in the millions) from becoming a performance bottleneck, the encryption key itself is stored in the file metadata. Each file will have an EDEK which is stored in the file’s metadata. yum install rng-tools # For Centos/RHEL 6, 7+ systems. For Centos/RHEL 7+ systems.

MySQL 69
article thumbnail

How We Use RocksDB at Rockset

Rockset

In this blog post, I'll describe how we use RocksDB at Rockset and how we tuned it to get the most performance out of it. Let me quickly describe where the RocksDB storage nodes fall in the overall system architecture. For more details on leaf nodes, please refer to Aggregator Leaf Tailer blog post or Rockset white paper.

Bytes 40
article thumbnail

Improving Efficiency Of Goku Time Series Database at Pinterest (Part?—?1)

Pinterest Engineering

In the first blog, we will share a short summary on the GokuS and GokuL architecture, data format for Goku Long Term, and how we improved the bootstrap time for our storage and serving components. As defined generally, a shard is a logical subset of data used for distributing data across distributed systems.

article thumbnail

Netflix Cloud Packaging in the Terabyte Era

Netflix Tech

Our previous tech blog Packaging award-winning shows with award-winning technology detailed our packaging technology deployed on the streaming side. The inspection stage examines the input media for compliance with Netflix’s delivery specifications and generates rich metadata.

Cloud 95