Data Quality Testing: 7 Essential Tests
Monte Carlo
DECEMBER 19, 2022
Too much data Too much data might not sound like a problem (it is called big data afterall), but when rows populate out of proportion, it can slow model performance and increase compute costs. Essentially, does this data reflect reality? In this case, the SLI would be something like “hours since dataset refreshed.”
Let's personalize your content