article thumbnail

The Rise of Unstructured Data

Cloudera

The International Data Corporation (IDC) estimates that by 2025 the sum of all data in the world will be in the order of 175 Zettabytes (one Zettabyte is 10^21 bytes). Seagate Technology forecasts that enterprise data will double from approximately 1 to 2 Petabytes (one Petabyte is 10^15 bytes) between 2020 and 2022.

article thumbnail

97 things every data engineer should know

Grouparoo

39 How to Prevent a Data Mutiny Key trends: modular architecture, declarative configuration, automated systems 40 Know the Value per Byte of Your Data Check if you are actually using your data 41 Know Your Latencies key questions: how old is data? Increase visibility. how fast are queries? how many concurrent queries can we handle?

article thumbnail

Why are database columns 191 characters?

Grouparoo

In this post, we’ll look at the historical reasons for the 191 character limit as a default in most relational databases. The first question you might ask is why limit the length of the strings you can store in a database at all? Why varchar and not text ? s fault 255 makes a lot more sense than 191. How did we get to 191?