Remove docs build python-models
article thumbnail

How to get started with dbt

Christophe Blefari

a model — a model is a select statement that can be materialised as a table or as a view. The models are most the important dbt object because they are your data assets. All your business logic will be in the model select statements. You can also add metadata on models (in YAML). We call this a DAG.

article thumbnail

Data News — Week 23.37

Christophe Blefari

It brings diversity to Foundation models usually coming from US. Dimensional data modeling with dbt — A great 6-steps process to create a simple dim-fact model with dbt. Factory Patterns in Python — It remembers me Java design patterns classes at the engineering school. A bittersweet feeling.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

dbt Core, Snowflake, and GitHub Actions: pet project for Data Engineers

Towards Data Science

dbt (data build tool) facilitates modularization of SQL queries, enabling the reuse and version control of SQL workflows, just like software code is typically managed. dbt Cloud is a paid cloud-based version of the service and dbt Core is a python package providing all functionality you can use for free. You need to create a workflow

article thumbnail

Data Engineering Weekly #123

Data Engineering Weekly

Contribute to the Rudderstack Transformations Library, Win $1000 RudderStack Transformations lets you customize event data in real time with your own JavaScript or Python code. Data Engineering Weekly Functional Data Engineering - A Blueprint The Rise of Data Modeling Data modeling has been one of the hot topics in Data LinkedIn.

article thumbnail

Charting A Path For Streaming Data To Fill Your Data Lake With Hudi

Data Engineering Podcast

By adding support for small, incremental inserts into large table structures, and building support for arbitrary update and delete operations the Hudi project brings the best of both worlds together. Write some Python scripts to automate it? Go to dataengineeringpodcast.com/97things today to get your copy! Then what do you do?

Data Lake 130
article thumbnail

dbt + Machine Learning: What makes a great baton pass?

dbt Developer Hub

Special Thanks: Emilie Schario, Matt Winkler dbt has done a great job of building an elegant, common interface between data engineers, analytics engineers, and any data-y role, by uniting our work on SQL. Should we increase the SQL surface area to build ML models? Should we leave that to non-SQL interfaces(Python/Scala/etc.)?

article thumbnail

Now in Public Preview: Processing Files and Unstructured Data with Snowpark for Python

Snowflake

Announced at Summit, we’ve recently added to Snowpark the ability to process files programmatically, with Python in public preview and Java generally available. Processing files in a Python UDF and Stored Procedure has piqued the interest of our data scientists and paves the way for automation of new, complex data pipelines.”