article thumbnail

12 Big Data Project Topics with Source Code 2023

Knowledge Hut

This project is a Lambda Architecture program that tracks Chicago's streets' traffic conditions, including congestion and safety. Anomaly detection in Cloud Servers As cloud computing has grown in popularity, many people and businesses have turned to cloud storage solutions.

article thumbnail

20+ Data Engineering Projects for Beginners with Source Code

ProjectPro

Then, the Yelp dataset downloaded in JSON format is connected to Cloud SDK, following connections to Cloud storage which is then connected with Cloud Composer. Cloud composer and PubSub outputs are Apache Beam and connected to Google Dataflow. The Yelp dataset JSON stream is published to the PubSub topic.