Remove learn open-source-etl-tools
article thumbnail

Seamless SQL And Python Transformations For Data Engineers And Analysts With SQLMesh

Data Engineering Podcast

SQLMesh was designed as a unifying tool that is simple to work with but powerful enough for large-scale transformations and complex projects. Their SDKs make event streaming from any app or website easy, and their extensive library of integrations enable you to automatically send data to hundreds of downstream tools.

article thumbnail

Building ETL Pipelines With Generative AI

Data Engineering Podcast

Summary Artificial intelligence applications require substantial high quality data, which is provided through ETL pipelines. Now that AI has reached the level of sophistication seen in the various generative models it is being used to build new ETL workflows. When is AI the wrong choice for ETL applications?

Building 162
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data News — Week 24.09

Christophe Blefari

I'll give a talk—in French—about how to put in production machine learning at a small scale. Final comment, with these 2 announcement Mistral left the open side to go commercial / closed. In 2024 we are more than ever tools to move data from sources to destinations. Mistral perdant.

Data 162
article thumbnail

Data News — Week 23.14

Christophe Blefari

I've tried to run open-source models locally on my own laptop. The only normalisation I did was back at the engineering school while learning SQL with Normal Forms. A data engineer should still be a software engineer working with data, empowering others with tooling and apps. But here is your usual Data News.

article thumbnail

Data News — Week 13.14

Christophe Blefari

I've tried to run open-source models locally on my own laptop. The only normalisation I did was back at the engineering school while learning SQL with Normal Forms. A data engineer should still be a software engineer working with data, empowering others with tooling and apps. But here is your usual Data News.

article thumbnail

Mapping The Data Infrastructure Landscape As A Venture Capitalist

Data Engineering Podcast

Join The RudderStack Transformation Challenge today for a chance to win a $1,000 cash prize just by submitting a Transformation to the open-source RudderStack Transformation library. The rapid growth and proliferation of data tools helped establish the "Modern Data Stack" as a de-facto architectural paradigm.

Hadoop 130
article thumbnail

Modern Data Engineering

Towards Data Science

Platform Specific Tools and Advanced Techniques Photo by Christopher Burns on Unsplash The modern data ecosystem keeps evolving and new data tools emerge now and then. Image by author It also might be a datalake in the center and it depends on the type of our data platform and tools we use.