Remove tag dbt
article thumbnail

Data News — Week 24.02

Christophe Blefari

Fast News ⚡️ dbt related stuff Download artifacts from you dbt Cloud job runs — a tutorial from a CLI tool to generate ERD diagrams for dbt Cloud projects. Unit testing dbt models — Using a dbt-unit-testing package Matthieu showcases how you can easily test your models.

article thumbnail

How to get started with dbt

Christophe Blefari

This article is meant to be a resource hub in order to understand dbt basics and to help get started your dbt journey. When I write dbt, I often mean dbt Core. dbt Core is an open-source framework that helps you organise data warehouse SQL transformation. First let's understand why dbt exists.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Improving the code quality of your dbt models with unit tests and TDD

Towards Data Science

How to improve the code quality of your dbt models with unit tests and TDD All you need to know to start unit testing your dbt SQL models Photo by Christin Hume on Unsplash If you are a data or analytics engineer, you are probably comfortable writing SQL models and testing for data quality with dbt tests.

Coding 72
article thumbnail

A Complete Guide to Scale Your Data Pipelines and Data Products with Contract Testing and Dbt

Towards Data Science

Monolithic data architecture is still the norm — even though there are clear drawbacks.” —  dbt labs In this article, I will introduce one of those techniques: contract testing. I will show how you can use dbt to create simple contract tests for your upstream sources and your dbt models’ public interfaces.

article thumbnail

Now Featuring: Orchestration Lineage

Monte Carlo

For Airflow lineage, Monte Carlo relies on query tagging to ingest DAGs and tasks related to tables. This means leveraging functions like Snowflake query tags, BigQuery labels, query comments, cluster policies or dbt macros.

BI 52
article thumbnail

Upgrade your Modern Data Stack

Christophe Blefari

A lot of companies started with Postgres + dbt + Metabase, don't be ashamed. Find, tag and remove what is useless, what can be factorised. Your data stack won't explode if you don't use dbt. DuckDB can help saving tons of money. data debt —You might have too many dashboards or tables, workflows spaghetti.

article thumbnail

How to identify your business-critical data

Towards Data Science

Identifying your business-critical data models With many dbt projects exceeding hundreds or thousands of data models, it’s important to know which ones are business-critical so you know when you should prioritise a run or test failure, or build extra robust tests. You may already know which ones these are. Source: synq.io

BI 78