Data Engineering Digest

best-practices clone-incremental-models

How we shaved 90 minutes off our longest running model

dbt Developer Hub

AUGUST 17, 2022

When running a job that has over 1,700 models, how do you know what a “good” runtime is? While there are many possible answers depending on dataset size, complexity of modeling, and historical run times, the crux of the matter is normally “did you hit your SLAs”? The model fct_dbt_invocations takes, on average, 1.5 hours to run.

Data Warehouse

Data Warehouse Datasets Cloud Coding

How to Speed up Local Development of a Docker Application running on AWS

DoorDash Engineering

MARCH 7, 2023

While most engineering tooling at DoorDash is focused on making safe incremental improvements to existing systems, in part by testing in production (learn more about our end-to-end testing strategy ), this is not always the best approach when launching an entirely new business line.

AWS

AWS PostgreSQL Database SQL

Join 16,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

How To Get Promoted In Product Management

MORE WEBINARS

Trending Sources

How I Study Open Source Community Growth with dbt

dbt Developer Hub

NOVEMBER 28, 2021

My models process the data so that it's easy to perform analysis and spot trends. Here are the tools I chose to use: Google Bigquery acts as the main database, holding all the source data, intermediate models, and data marts. That's why I built a mini-warehouse for studying community growth.

Raw Data

Raw Data Metadata Datasets Database

Webinars

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

How To Get Promoted In Product Management

MORE WEBINARS

Dat: Distributed Versioned Data Sharing with Danielle Robinson and Joe Hand - Episode 16

Data Engineering Podcast

JANUARY 28, 2018

Happening April 29th to the 30th in New York it will give you a solid understanding of the latest breakthroughs and best practices in AI for business. In New York, it will give you a solid understanding of the latest breakthroughs and best practices in AI for business. To the 30th.

Data

Data Project Electronics Data Management

What is ETL Pipeline? Process, Considerations, and Examples

ProjectPro

NOVEMBER 30, 2021

This guide provides definitions, a step-by-step tutorial, and a few best practices to help you understand ETL pipelines and how they differ from data pipelines. When working on real-time business problems, data scientists build models using various Machine Learning or Deep Learning algorithms.

Process

Process Data Pipeline Data Warehouse AWS

The Ultimate Modern Data Stack Migration Guide

phData: Data Engineering

JULY 18, 2023

Business-Focused Operation Model: Teams can shed countless hours of managing long-running and complex ETL pipelines that do not scale. Transparent Pricing Model: Say goodbye to tedious cost adjustments for hardware, software, platform maintenance, upgrade costs, etc. Why Migrate to a Modern Data Stack?

Data Warehouse

Data Warehouse Pipeline-centric Government Data

How we shaved 90 minutes off our longest running model

How to Speed up Local Development of a Docker Application running on AWS

Webinars

Trending Sources

How I Study Open Source Community Growth with dbt

Webinars

Dat: Distributed Versioned Data Sharing with Danielle Robinson and Joe Hand - Episode 16

What is ETL Pipeline? Process, Considerations, and Examples

The Ultimate Modern Data Stack Migration Guide

Stay Connected