Remove Blog Remove Data Remove Engineering Remove Systems
article thumbnail

I asked ChatGPT to write a blog post about Data Engineering. Here it is.

Confessions of a Data Guy

Data engineering is a vital field within the realm of data science that focuses on the practical aspects of collecting, storing, and processing large amounts of data. appeared first on Confessions of a Data Guy. Here it is.

article thumbnail

How to learn data engineering

Christophe Blefari

Learn data engineering, all the references ( credits ) This is a special edition of the Data News. But right now I'm in holidays finishing a hiking week in Corsica 🥾 So I wrote this special edition about: how to learn data engineering in 2024. Who are the data engineers?

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Engineering Weekly #171

Data Engineering Weekly

This year, we added some additional features to bring the data community together. Gloss Genius: How We Migrated From dbt Cloud and Scaled Our Data Development Gloss Genius describes its migration journey from dbt cloud to Airflow + custom Github actions. At scale, it becomes impossible to enrich the data assets manually.

article thumbnail

GPT and LLMs from a Data Engineering Perspective

Jesse Anderson

There has been quite a bit of writing covering GPT and LLMs from data science and business perspectives. I haven’t seen much from the data engineering side. Let me share my perspective, having been in data and AI for a while and using LLMs before they became popular. that summarizes blog posts using LLMs.

article thumbnail

Data Engineering Weekly #166

Data Engineering Weekly

dbt: 2024 State of Analytics Engineering The 2024 dbt’s state of analytical engineering report is out. Poor data quality and unlcear data ownership remains the top challenges for the data teams. Data Mesh continuously gaining popularity among the enterprises.

article thumbnail

Data Engineering Weekly #161

Data Engineering Weekly

RudderStack is the Warehouse Native CDP, built to help data teams deliver value across the entire data activation lifecycle, from collection to unification and activation. Editor’s Note: Chennai, India Meetup - March-08 Update We are thankful to Ideas2IT to host our first Data Hero’s meetup.

article thumbnail

Brief History of Data Engineering

Jesse Anderson

Google looked over the expanse of the growing internet and realized they’d need scalable systems. They were the first companies to commercialize open source big data technologies and pushed the marketing and commercialization of Hadoop. Apache Spark came in 2009 and gave a unified batch and streaming engine.