article thumbnail

A Detailed Guide of Interview Questions on Apache Kafka

Analytics Vidhya

Introduction Apache Kafka is an open-source publish-subscribe messaging application initially developed by LinkedIn in early 2011. It is a message broker application and a logging service that is distributed, segmented, and […] The post A Detailed Guide of Interview Questions on Apache Kafka appeared first on Analytics Vidhya.

Kafka 201
article thumbnail

Unapologetically Technical Episode 10 – Michael Drogalis

Jesse Anderson

In this episode, I interview Michael Drogalis, the founder and CEO of ShadowTraffic where we talked about the early Hadoop era and how he saw the need for Kafka in the industry. And just like that, we’re down to the 10th episode of Unapologetically Technical!

Hadoop 100
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Unapologetically Technical Episode 8 – Tom Scott

Jesse Anderson

We discuss the key features and how they enable analytics uses of data stored in Kafka. We go in-depth into Streambased. We cover how it works and the ease of use. Don’t forget to subscribe to my YouTube channel to get the latest on Unapologetically Technical!

Kafka 100
article thumbnail

Big Data Technologies that Everyone Should Know in 2024

Knowledge Hut

If you pursue the MSc big data technologies course, you will be able to specialize in topics such as Big Data Analytics, Business Analytics, Machine Learning, Hadoop and Spark technologies, Cloud Systems etc. There are a variety of big data processing technologies available, including Apache Hadoop, Apache Spark, and MongoDB.

article thumbnail

Brief History of Data Engineering

Jesse Anderson

Doug Cutting took those papers and created Apache Hadoop in 2005. They were the first companies to commercialize open source big data technologies and pushed the marketing and commercialization of Hadoop. Hadoop was hard to program, and Apache Hive came along in 2010 to add SQL. We lacked a scalable pub/sub system.

article thumbnail

How to learn data engineering

Christophe Blefari

Hadoop initially led the way with Big Data and distributed computing on-premise to finally land on Modern Data Stack — in the cloud — with a data warehouse at the center. In order to understand today's data engineering I think that this is important to at least know Hadoop concepts and context and computer science basics.

article thumbnail

Why you should not learn everything in Data Science

Team Data Science

and then all of a sudden you have Spark 3, or Kafka - Kafka Streaming, Kafka Connect and so on. So, let's bring Hadoop into play here. Everyone suddenly started talking about Hadoop. Everyone should learn Hadoop. There was a time when people said, "Okay, let's look at Hadoop and become a Hadoop expert.