Remove Business Intelligence Remove Data Governance Remove Data Lake Remove Data Workflow
article thumbnail

Testing Data Applications is Hard

Meltano

Testing a data application is similar to testing any software application in many ways, just with a strong focus on testing data-related issues. But testing problems like failing data workflows, mismatches in data reconciliation after ETL, and data quality issues means that you’re not only testing the code but also the data itself.

Data 52
article thumbnail

Data Pipeline Architecture Explained: 6 Diagrams and Best Practices

Monte Carlo

The modern data stack era , roughly 2017 to present data, saw the widespread adoption of cloud computing and modern data repositories that decoupled storage from compute such as data warehouses, data lakes, and data lakehouses. Zero ETL is a bit of a misnomer.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Migration Risks and the Checklist You Need to Avoid Them

Monte Carlo

Sure, terabytes or even petabytes of data are involved, but generally it’s not the size of the data but everything surrounding the data–workflows, access permissions, layers of dependencies–that pose data migration risks. Data governance, compliance and access management Moving a table is relatively simple.

article thumbnail

DataOps: What Is It, Core Principles, and Tools For Implementation

phData: Data Engineering

This commonly introduces: Database or Data Warehouse API/EDI Integrations ETL software Business intelligence tooling By leveraging off-the-shelf tooling, your company separates disciplines by technology. One of our customers needed the ability to export/import data between systems and create data products from this source data.

IT 52
article thumbnail

The Modern Data Stack: What It Is, How It Works, Use Cases, and Ways to Implement

AltexSoft

Built around a cloud data warehouse, data lake, or data lakehouse. Modern data stack tools are designed to integrate seamlessly with cloud data warehouses such as Redshift, Bigquery, and Snowflake, as well as data lakes or even the child of the first two — a data lakehouse.

IT 59