article thumbnail

20+ Data Engineering Projects for Beginners with Source Code

ProjectPro

The current architecture is called Lambda architecture, where you can handle both real-time streaming data and batch data. Log files are pushed to Kafka topic using NiFi, and this Data is Analyzed and stored in Cassandra DB for real-time analytics. MongoDB stores the processed and aggregated results.