article thumbnail

20+ Data Engineering Projects for Beginners with Source Code

ProjectPro

This project is an opportunity for data enthusiasts to engage in the information produced and used by the New York City government. The real-time data will be processed using Spark structured streaming API and analyzed using Spark MLib to get the sentiment of every tweet. MongoDB stores the processed and aggregated results.