article thumbnail

How to learn data engineering

Christophe Blefari

Learn data engineering, all the references ( credits ) This is a special edition of the Data News. But right now I'm in holidays finishing a hiking week in Corsica 🥾 So I wrote this special edition about: how to learn data engineering in 2024. What is Hadoop? Who are the data engineers?

article thumbnail

Brief History of Data Engineering

Jesse Anderson

Doug Cutting took those papers and created Apache Hadoop in 2005. They were the first companies to commercialize open source big data technologies and pushed the marketing and commercialization of Hadoop. Hadoop was hard to program, and Apache Hive came along in 2010 to add SQL. They eventually merged in 2012.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Mr. Pavan’s Data Engineering Journey Drives Business Success

Analytics Vidhya

He is an experienced data engineer with a passion for problem-solving and a drive for continuous growth. Thus, providing valuable insights into the field of data engineering. Introduction We had an amazing opportunity to learn from Mr. Pavan.

article thumbnail

Most Essential 2023 Interview Questions on Data Engineering

Analytics Vidhya

Introduction Data engineering is the field of study that deals with the design, construction, deployment, and maintenance of data processing systems. The goal of this domain is to collect, store, and process data efficiently and efficiently so that it can be used to support business decisions and power data-driven applications.

article thumbnail

Reflecting On The Past 6 Years Of Data Engineering

Data Engineering Podcast

In that time there have been a number of generational shifts in how data engineering is done. Go to [dataengineeringpodcast.com/materialize]([link] Support Data Engineering Podcast Summary This podcast started almost exactly six years ago, and the technology landscape was much different than it is now.

article thumbnail

Data Engineering Weekly #173

Data Engineering Weekly

[link] Tweeq: Tweeq Data Platform: Journey and Lessons Learned: Clickhouse, dbt, Dagster, and Superset Tweeq writes about its journey of building a data platform with cloud-agnostic open-source solutions and some integration challenges. It is refreshing to see an open stack after the Hadoop era.

article thumbnail

How to Become a Data Engineer in 2024?

Knowledge Hut

Data Engineering is typically a software engineering role that focuses deeply on data – namely, data workflows, data pipelines, and the ETL (Extract, Transform, Load) process. What is Data Science? What are the roles and responsibilities of a Data Engineer? And many more. And many more.