Remove learn incremental-model-using-dbt-snowflake
article thumbnail

Upgrade your Modern Data Stack

Christophe Blefari

Over the years Cloudera logo has been replaced by Snowflake and Databricks ones. We jumped from HDFS to Cloud Storage (S3, GCS) for storage and from Hadoop, Spark to Cloud warehouses (Redshift, BigQuery, Snowflake) for processing. The modern data stack has always been nice words to bundle a philosophy used to build data platform.

article thumbnail

Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable

Data Engineering Podcast

Datafold integrates with dbt, the modern data stack, and seamlessly plugs in your data CI for team-wide and automated testing. Learn more about Datafold by visiting dataengineeringpodcast.com/datafold You shouldn't have to throw away the database to build with fast-changing data. How can you get the best results for your use case?

Process 182
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data News — Week 23.09

Christophe Blefari

This week I've published an compact article about how to get started with dbt. The idea behind this article is to define every dbt concept and objects from the CLI to the Jinja templating or models and sources. Rad my article — How to get started with dbt Machine Learning Saturday 🤖 Was it a boost ride?

article thumbnail

How to Master Data Transformations with DBT Materializations?

Workfall

But then, a game-changer emerged – DBT (Data Build Tool). With DBT’s materializations, our data transformations underwent a magical transformation themselves. In this blog, we’ll whisk you away on an enchanting journey through DBT materializations. In this blog, we will cover: What is DBT? DBT , the Data Build Tool.

article thumbnail

Optimizing Materialized Views with dbt

dbt Developer Hub

I was a kitten-only household, and dbt Labs was still Fishtown Analytics. A enterprise customer I was working with, Jetblue, asked me for help running their dbt models every 2 minutes to meet a 5 minute SLA. After getting over the initial terror, we talked through the use case and soon realized there was a better option.

article thumbnail

Data News — December 2023

Christophe Blefari

I also took part in my friend's podcast where we discussed 3 trends in data : data modeling, real-time analytics and DataOps. He even quickly explain how you can run a model on your computer. When it comes to generative they currently have 3 models: tiny, small and medium, which are performing well against GPT-3.5.

Data 100
article thumbnail

Charting A Path For Streaming Data To Fill Your Data Lake With Hudi

Data Engineering Podcast

With more real-time requirements and the increasing use of streaming data there has been a struggle to merge fast, incremental updates with large, historical analysis. It builds your customer data warehouse and your identity graph on your data warehouse, with support for Snowflake, Google BigQuery, Amazon Redshift, and more.

Data Lake 130