article thumbnail

12 Big Data Project Topics with Source Code 2023

Knowledge Hut

Current suggestions for your next big data project are provided in this article. You can check out the best Big Data courses to have an in-depth idea about big data tools and technologies to prepare for a job in the domain. The top big data projects that you shouldn't miss are listed below.

article thumbnail

20+ Data Engineering Projects for Beginners with Source Code

ProjectPro

Data professionals who work with raw data like data engineers, data analysts, machine learning scientists , and machine learning engineers also play a crucial role in any data science project. And, out of these professions, this blog will discuss the data engineering job role.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Engineering Annotated Monthly – January 2022

Big Data Tools

The one remaining free tool I’m aware of is Arenadata Cluster Manager , but the free version doesn’t allow the user to do certain things, like deploy HA name nodes. Apache Hop 1.1 — The number of no-code tools is snowballing. We all know Apache NiFi, a stream processing tool with its own processing engine.

article thumbnail

Data Engineering Annotated Monthly – January 2022

Big Data Tools

The one remaining free tool I’m aware of is Arenadata Cluster Manager , but the free version doesn’t allow the user to do certain things, like deploy HA name nodes. Apache Hop 1.1 — The number of no-code tools is snowballing. We all know Apache NiFi, a stream processing tool with its own processing engine.

article thumbnail

Data Engineering Annotated Monthly – October 2021

Big Data Tools

Spark: Constraint Propagation code causes OOM issues or increasing compilation time to hours – Under certain conditions, constraint propagation code may be suboptimal or even cause an application to crash with an OutOfMemoryError. That wraps up October’s Data Engineering Annotated.

article thumbnail

Data Engineering Annotated Monthly – October 2021

Big Data Tools

Spark: Constraint Propagation code causes OOM issues or increasing compilation time to hours – Under certain conditions, constraint propagation code may be suboptimal or even cause an application to crash with an OutOfMemoryError. That wraps up October’s Data Engineering Annotated.

article thumbnail

20 Solved End-to-End Big Data Projects with Source Code

ProjectPro

Ace your big data interview by adding some unique and exciting Big Data projects to your portfolio. This blog lists over 20 big data projects you can work on to showcase your big data skills and gain hands-on experience in big data tools and technologies.